Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Data Pipeline Intern: Build Clean Data for AI Agents

XPENG

XPENG is seeking an intern in Santa Clara to assist in building a data foundation for an LLM-powered agent. This role involves cleaning, organizing, and connecting various data sources, particularly from team communications and experiments. The ideal candidate will have strong Python and SQL skills, experience with data processing, and an interest in LLM development. Interns can expect a supportive work environment with opportunities to contribute to innovative autonomous driving technologies. #J-18808-Ljbffr

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the AI Data Pipeline Intern: Build Clean Data for AI Agents in Santa Clara, CA vacancy
  •  ...AI Agent Data Pipeline Intern Santa Clara, CA XPENG is a leading smart technology company at the...  ...learning, and smart connectivity. Our team builds platform capabilities that support...  ...for this agent, with a focus on cleaning, organizing, and connecting various data... 
    Internship
    Pipeline
    Data

    XPENG

    Santa Clara, CA
    6 days ago
  • Gigamon is seeking an AI Agent Intern to design, build, and refine AI-powered agents that streamline revenue workflows. You will work closely with...  ...efficiency using modern AI/ML tools, APIs, and data pipelines. Ideal candidates are pursuing a degree in a technical... 
    Internship
    Pipeline
    Data

    Gigamon

    Santa Clara, CA
    4 days ago
  • XPENG & Volkswagen Group is looking for an intern to develop data pipelines and clean various data sources for LLM-powered agent support. Responsibilities include organizing experiment-related data and evaluating agent performance using curated datasets. Candidates should... 
    Internship
    Pipeline
    Data

    XPENG & Volkswagen Group

    Santa Clara, CA
    4 days ago
  • $19 - $65 per hour

     ...located in Santa Clara, is seeking a Simulation/ML Engineer Intern to help build an internal AI assistant that facilitates instant access to company...  ...AI chatbots, implement RAG architecture, and create data pipelines. This role requires strong Python skills, knowledge of... 
    Internship
    Pipeline
    Data
    Hourly pay

    PlusAI, Inc.

    Santa Clara, CA
    1 day ago
  • $123k - $151k

     ...Applied AI Engineer We are a cybersecurity company building a next-generation AI-driven...  ...orchestration, retrieval pipelines, and evaluation...  .... Help design data flows (event...  ...capabilities. Write clean, well-documented...  ...learning about multi-agent architectures,... 
    Internship
    Pipeline
    Data
    Full time
    Summer work
    Night shift

    Edelman

    Sunnyvale, CA
    3 days ago
  •  ...integrating advanced AI and autonomous driving...  ...we’re looking for an intern to support data analysis, vector...  ...‑specific scenarios. Build workflows for vector...  ...engineering, and RAG‑style pipelines to organize...  ...Support data mining, data cleaning, and scenario curation... 
    Internship
    Pipeline
    Data

    XPENG

    Santa Clara, CA
    1 day ago
  • $19 - $65 per hour

    PlusAI is a Physical AI company pioneering...  ...Scania, MAN, and International brands, Hyundai...  ...role, you’ll help build an internal AI assistant...  ..., and Slack data, it uses retrieval...  ...Generation (RAG) pipeline to pull contextual...  ...pipelines to ingest, clean, and structure... 
    Internship
    Pipeline
    Data
    Hourly pay

    PlusAI, Inc.

    Santa Clara, CA
    6 hours ago
  • $31 per hour

     ...innovation empowers us to build a world where...  ...architecture. Interns will work on a variety...  ...software, perception, AI/ML, UI/UX, and...  ...algorithms, perception pipelines, AI/ML models, user...  ...platforms. Write clean, efficient, and maintainable...  ..., Computer Vision, Data Science, Automation... 
    Internship
    Pipeline
    Data
    Hourly pay
    Permanent employment
    Full time
    Local area
    Worldwide

    Wheaton

    Santa Clara, CA
    4 days ago
  • XPENG is seeking an intern to support data analysis for next-generation autonomous driving products...  ...involves analyzing global driving data, building workflows for vector search, and...  ...well as experience with machine learning pipelines. The position offers a competitive compensation... 
    Internship
    Pipeline
    Data

    XPENG

    Santa Clara, CA
    1 day ago
  • $19 - $65 per hour

    PlusAI is seeking a Simulation/ML Engineer Intern to help develop an AI assistant that utilizes natural language processing for querying...  ...lunch, and competitive benefits. The intern will work on building data pipelines, fine-tuning open-source models, and developing secure... 
    Internship
    Pipeline
    Data
    Hourly pay

    Medium

    Santa Clara, CA
    16 hours ago
  •  ...transforming scientific research with novel AI algorithms. Our mission is to empower...  ...full-time Software Engineering Summer Intern position and the second is a full-time...  ...AI / ML engineering intern will help build data pipelines for ingestion of experimental data and... 
    Internship
    Pipeline
    Data
    Full time
    Work experience placement
    Summer work
    Summer internship
    Work from home

    LabAlly

    Santa Clara, CA
    12 hours ago
  • $126.8k - $220.9k

     ...Learning Engineer - Visual Agents - Special Projects...  ...States Machine Learning and AI Description The Special...  ...Engineer to help us build, fine-tune, and...  ...Develop automated evaluation pipelines including LLM-as-judge...  ...large‑scale multimodal data and annotation pipelines... 
    Internship
    Pipeline
    Data
    Relocation

    Apple

    Cupertino, CA
    1 day ago
  •  ...Product Manager For Physical Ai Agents As Product Manager for Physical...  ...agent can do, how developers build with it, what "good behavior"...  ...them. Revisit them when the data says you were wrong....  ...evaluation methodology, data pipelines, harness engineering, RAG, tool... 
    Pipeline
    Data
    Shift work

    Dexmate

    Santa Clara, CA
    6 days ago
  •  ...WorksHub is looking for a versatile Software Developer in Palo Alto, California. The role involves developing AI-native data solutions, working hands-on with LLMs and a variety of AI use cases, including data analysis and code generation. The ideal candidate will have... 
    Pipeline
    Data

    WorksHub

    Palo Alto, CA
    2 days ago
  •  ...product innovations through the use of AI by making it trusted, safe and...  ...platform that enables users to build AI agents that can analyze, reason, act and...  ...Help implement and manage big data systems, including Kafka and data pipelines, to support data-driven decision-... 
    Internship
    Pipeline
    Data

    Brevian.ai

    Sunnyvale, CA
    5 days ago
  • $150k - $400k

     ...Engineer Core Agent OS At Boson AI, we are not just building AI solutions; we are pioneering the...  ...and vector retrieval pipelines for production use. Agentic...  ...and stateful memory. Internal SDK/Framework...  ..., isolation boundaries, data protection protocols, and... 
    Pipeline
    Data

    Boson AI

    Santa Clara, CA
    4 days ago
  • $147.4k - $272.1k

    A leading technology company in Santa Clara seeks a research engineer to build AI infrastructure, design data pipelines, and work alongside world-class researchers. The ideal candidate will have over 5 years of software engineering experience with strong skills in Python... 
    Pipeline
    Data

    Apple Inc.

    Santa Clara, CA
    3 days ago
  •  ...Build and Deploy AI the right way, anywhere. The FlexAI Compute Infrastructure...  ...for a Software Engineering Intern with an interest in...  ...for model serving, inference pipelines, and data processing Work on infrastructure...  ...‑grade systems Write clean, efficient, and well‑tested... 
    Internship
    Pipeline
    Data

    FlexAI

    San Jose, CA
    1 day ago
  • $19 - $65 per hour

     ...PlusAI is a Physical AI company pioneering AI-based...  ...’s Scania, MAN, and International brands, Hyundai Motor Company...  ...video search indexing pipelines to maximize...  ...minimize operational costs. Build Iterative Search Features...  ...to intelligently select data based on contextual factors... 
    Internship
    Pipeline
    Data
    Hourly pay

    PlusAI, Inc.

    Santa Clara, CA
    1 day ago
  • $19 - $65 per hour

     ...PlusAI is a Physical AI company pioneering AI-based...  ...’s Scania, MAN, and International brands, Hyundai Motor Company...  ...‑world, large‑scale data challenges? We’re...  ...Engineer Intern to help build and improve an event mining...  ...our data processing pipelines faster and more efficient... 
    Internship
    Pipeline
    Data
    Hourly pay

    Medium

    Santa Clara, CA
    1 day ago
  • $20.1 - $70.4 per hour

     ...AI/ML Model Development Intern Number of Position(s): 1 Duration: 4 Months Date:...  ...analyze and interpret complex data. Design and implement...  ...machine learning models and pipelines for performance, scalability...  ...Resource Groups (NERGs) and build connections across the... 
    Internship
    Pipeline
    Data
    Hourly pay
    Full time
    Apprenticeship
    Fixed term contract
    Traineeship
    Flexible hours

    Nokia

    Sunnyvale, CA
    5 days ago
  •  ...the industry on cutting-edge AI technology, revolutionizing performance...  ...for AI and a deep desire to build the best AI platform possible....  ...of tensor compute and tensor data movement optimizations kernels...  ...learning frameworks and pipelines. Performance Profiling: Identify... 
    Internship
    Pipeline
    Data
    Permanent employment

    Tenstorrent

    Santa Clara, CA
    5 days ago
  • $168k - $258.75k

     ...product efforts for local AI on Linux and developers...  ...— that enables AI and agents, content creation, and...  ..., and enterprise teams build, run, and deploy AI on...  ...against their private data on-prem. Inference stacks...  ...agent frameworks, RAG pipelines, and evaluation. ~ Working... 
    Pipeline
    Data
    Local area
    Shift work

    NVIDIA

    Santa Clara, CA
    5 days ago
  •  ...Infrastructure Engineer Intern Santa Clara, CA...  ...advanced AI and autonomous driving...  ...connectivity. XPENG is building the next generation...  ..., multi-agent coordination, and automated...  ...interoperability, pipeline automation, and MCP...  ...outputs, structured data validation, or workflow... 
    Internship
    Pipeline
    Data

    XPENG

    Santa Clara, CA
    5 days ago
  • $72.8k

     ...features and bug fixes. Write clean, maintainable, well-documented...  ...Help improve development tools, build pipelines, and automation used by the...  ...computer science fundamentals (data structures, algorithms, OOP)....  ...learning, deep learning, or applied AI. Experience using AI/ML... 
    Internship
    Pipeline
    Data

    Dormont Manufacturing Company

    Mountain View, CA
    1 day ago
  • $123.2k - $189.1k

     ...focus on automation, internal tooling, and AI-assisted...  ...architected, automated pipelines and tools that...  ...agentic workflows to build internal tools...  ...underlying code and data pipelines....  ...Building simple agents or scripts that chain...  ...Experience writing clean, well-tested, and... 
    Internship
    Pipeline
    Data
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $19 - $65 per hour

    PlusAI is a Physical AI company pioneering AI-based...  ...’s Scania, MAN, and International brands, Hyundai Motor Company...  ...overnight, and new buildings reshape the horizon....  ...end Map Change Detection Pipeline. Your mission will be to process incoming data from our active testing... 
    Internship
    Pipeline
    Data
    Hourly pay
    Permanent employment
    Summer internship
    Night shift

    Medium

    Santa Clara, CA
    16 hours ago
  • $20 - $71 per hour

     ...unlimited potential of AI to define the next era...  ...for a software engineer intern with a strong passion in...  ...learning, and 3D data visualization. NVIDIA plays...  ...team laser focused on building globally scaled HD mapping...  ...algorithms, data pipelines and visualization tools... 
    Internship
    Pipeline
    Data
    Hourly pay

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $170k - $275k

     ...Software Engineer, Agent Harnessing Sunnyvale, California...  ...at scale. At Scout AI, we're developing Fury,...  ...Harnessing team, you will build the core architecture...  ...communication across complex pipelines. Building simulation...  ...AI models and build the data acquisition pipelines... 
    Pipeline
    Data
    Full time
    Relocation package

    Scout AI

    Sunnyvale, CA
    4 days ago
  • $115k

     ...Laboratories is seeking a Data Scientist to advance...  ...intelligent automation, and AI-enabled decision support...  ...learning, and AI agent development to complex,...  ...techniques. Design, build, and deploy AI agents that...  ...scalable data workflows, pipelines, and agent-based systems... 
    Pipeline
    Data
    Work at office
    Local area

    Eurofins USA Material Sciences

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Data Pipeline Intern: Build Clean Data for AI Agents. Be the first to apply!