Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. AI Data Engineer

$105k - $110k

iSoftStone

Sr. AI Data Engineer (Image Generation Data)

iSoftStone, Inc. is seeking a Sr. AI Data Engineer to join our team! This is a contract onsite opportunity in Menlo Park, CA. This is a one-year contract role, and candidates must have permanent authorization to work in the United States. Visa sponsorship is not available for this role and third-party vendor candidates cannot be considered.

Summary: Generative AI models are only as good as the data they consume. Unlike traditional data engineering, building data pipelines for generative AI requires orchestrating ML model invocations (content understanding classifiers, embedding models, LLM-based cleaners) alongside standard SQL-based transformations, all at billion-row scale. This role sits at the intersection of Data Engineering and ML Systems. The Senior AI Data Engineer will own end-to-end data pipelines that don't just move and transform data, but enrich it through remote model inference, managing the systems complexity of async execution, capacity allocation, retry/fallback logic, and throughput optimization that comes with it. This is not a pure ETL-with-SQL role; it demands hands-on systems experience with distributed inference infrastructure. Our team develops comprehensive data curation and evaluation solutions for image generation models across quality dimensions including visual quality, prompt adherence, identity preservation, naturalness, and visual text generation.

Responsibilities:

  • AI-Augmented Data Pipelines: Design and maintain AI-augmented, large-scale data pipelines (billions of images) integrating traditional transformations with ML models (classifiers, embeddings, LLMs) for cleaning and annotation.
  • Remote Inference Orchestration: Own the systems for remote ML model inference orchestration within pipelines, managing batching, retries, async jobs, and ensuring graceful degradation.
  • Feature Pipelines: Build and maintain scalable pipelines for generating, storing, and serving vector embeddings, including nearest-neighbor index management and quality validation.
  • Data Curation at Scale: Source, filter, and curate training datasets using a combination of SQL and model-derived signals (e.g., aesthetic scores, NSFW classifiers), owning the end-to-end data flow and maintaining governance, quality, and compliance.
  • LLM-Assisted Annotation: Design and operate pipelines that use LLMs and vision models for automated annotation of training data, including auditing workflows to measure and improve annotation model performance.
  • Tooling & Frameworks: Contribute to shared tooling and frameworks that make it easier for the broader team to build AI-augmented data pipelines — e.g., reusable operators for model invocation, standard patterns for async job management.

Qualifications:

  • Advanced SQL & data pipeline expertise. Complex queries, query optimization, pipeline orchestration frameworks (Airflow, Dataswarm, or equivalent).
  • Experience integrating ML models into data pipelines. Calling inference endpoints, managing model versions, batching requests, handling inference failures at scale.
  • Proficiency with AI-assisted coding agents (e.g., Copilot, Cursor, Codex). Expected to leverage AI tools as a force multiplier for writing, debugging, and reviewing code, building pipelines faster, and accelerating day-to-day engineering workflows.
  • Strong verbal and written communication skills, problem-solving ability, and cross-functional collaboration.

Preferred:

  • Working knowledge of embeddings and vector representations like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent).
  • Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, aesthetic scoring.
  • Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs.
  • Knowledge of generative AI like diffusion models, image generation, evaluation metrics (FID, CLIP score, etc.).

Education / Experience:

  • Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.
  • 5+ years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.
  • Demonstrated track record of building and operating production data pipelines that invoke ML models at scale.
  • Previous experience at Meta is preferred but not required.

Additional Requirements:

  • Work onsite in MPK 5 days per week, working closely with engineers and researchers.

Primary Location Pay Range: $105,000 - $110,000 per year Benefits: 1099/Contractors: No benefits Temp salaried employee benefits, if scheduled to work at least 30 hours per week: medical, dental, vision, 401k, holidays.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Sr. AI Data Engineer in Menlo Park, CA vacancy
  • $100k - $300k

     ...We are seeking a forward-thinking AI Data Engineer to bridge the gap between our user data assets and advanced AI capabilities. In this role, you will be the architect of our user data foundation, building a robust data warehouse and a dynamic tagging system. Crucially... 
    Senior
    Full time

    OPPO US Research Center

    Palo Alto, CA
    1 day ago
  • A leading AI technology company in Palo Alto seeks an AI Data Engineer to design a robust User Data Warehouse and integrate it with advanced AI capabilities. Responsibilities include establishing a tagging system and implementing pipelines for real-time data availability... 
    Senior

    OPPO US Research Center

    Palo Alto, CA
    3 days ago
  • ExpertHiring is looking for an experienced data engineering leader based in Menlo Park, California, to guide a team in developing data solutions...  ..., architecting data warehouses, and implementing innovative AI technologies. Candidates should have extensive experience in... 
    Senior
    Full time

    ExpertHiring

    Menlo Park, CA
    2 days ago
  •  ...AI-Native Data Engineer @ TrueMeter SF Bay Area | Hybrid (3 days onsite, 2 remote) About Us We're building the AI Energy Agent that's becoming the default way any business pays for power and saves on energy. The grid is breaking under the weight of AI and electrification... 
    Suggested
    Immediate start
    Remote work

    Pear VC

    Palo Alto, CA
    1 day ago
  • $124.09k - $210k

     ...Senior AI Data Infrastructure Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical... 
    Senior
    Full time
    Work experience placement

    XPENG

    Santa Clara, CA
    4 days ago
  • $203.45k - $344.3k

     ...Senior Staff Physical AI Data Algorithm Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric... 
    Senior
    Full time
    Temporary work
    Work experience placement

    XPENG

    Santa Clara, CA
    4 days ago
  •  ...technology company is seeking a GenAI Developer to design and deploy AI-powered workflows in real-world production environments. The...  ...systems using enterprise LLMs. This role partners closely with engineering and product teams to bring GenAI capabilities into business-... 
    Senior
    3 days per week

    Insight Global

    Palo Alto, CA
    1 day ago
  • $240k - $280k

    Pantera Capital is hiring a Data Engineer / AI Engineer in Palo Alto, California. This role focuses on developing systems for data acquisition and quality evaluation, ensuring high-quality training data for AI models. Responsibilities include building scalable data pipelines... 

    Pantera Capital

    Palo Alto, CA
    3 days ago
  • Maxxd see Tesla is looking for a Data Engineer / AI Engineer to develop systems for data acquisition, preparation, and delivery to enhance model training. You will analyze data performance, build scalable pipelines, and ensure high-quality training data. The ideal candidate... 

    Maxxd see Tesla

    Palo Alto, CA
    1 day ago
  •  ...Senior Principal Ai Software Engineer Palo Alto Networks is looking for a highly experienced, hands-on Senior Principal AI Software Engineer...  ...language models, retrieval systems, structured and unstructured data reasoning, workflow orchestration, evaluation frameworks, and... 
    Senior

    Palo Alto Networks

    Palo Alto, CA
    3 days ago
  • $169.8k - $233.5k

     ...Uniphore is one of the largest B2B AI-native companies-decades-proven, built-for-scale...  ...and text and how to analyze all types of data. As AI becomes more powerful,...  ...employees. Job Description: SR AI Engineer Uniphore is a leading B2B AI-native... 
    Senior

    Uniphore

    Palo Alto, CA
    8 hours ago
  • A global consulting firm is seeking a Senior AI Native Engineer in California. You will research and implement AI systems tailored to diverse business environments while collaborating with a talented team. The ideal candidate has a Bachelor's degree, strong Python skills... 
    Senior

    Ernst & Young Oman

    Palo Alto, CA
    5 days ago
  • $162.8k - $203.5k

     ...Rivian AI Engineer Position Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free electric adventure...  ...As an AI Engineer, you will contribute across the stack, from data pipelines and retrieval to prompt/agent logic, evaluation/... 
    Senior
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Palo Alto, CA
    4 days ago
  • $66.52 - $88.14 per hour

     ...This is a Stanford Health Care job. This role will develop data ingestion processes to the SHC Enterprise Data Platform (Lake),...  ...operational results. A Brief Overview The Senior Enterprise Data Engineer is responsible for designing and creating complex pipelines and... 
    Senior
    Hourly pay

    Stanford Health Care

    Palo Alto, CA
    3 days ago
  • $161.6k - $203.5k

     ...Insight Global is looking for Senior Applied AI Engineers to join an automotive customer in the electric vehicle space. As an AI Engineer, you will contribute across the stack, from data pipelines and retrieval to prompt/agent logic, evaluation/guardrails, and serving.... 
    Senior

    Insight Global

    Palo Alto, CA
    2 days ago
  • $216k - $324k

     ...we empower creators to own their own destiny. Job Title: Sr. Lead AI Engineer Location: Palo Alto, CA This role is based in our new...  ...implementation; and partner closely with product, machine learning, and data science to turn AI ideas into reliable, scalable production... 
    Senior
    Work at office
    Local area

    Klaviyo

    Palo Alto, CA
    8 hours ago
  • $144.25k - $256.25k

     ...Sr Staff AI Engineering - Agentic AI New York, NY, United States Phoenix, AZ, United States Sunrise, FL, United States Palo Alto, CA...  ...differentiation through building and leveraging innovative technology and data insights. At American Express, AI is reshaping the future... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    Visa sponsorship
    Flexible hours
    3 days per week

    American Express

    Palo Alto, CA
    1 day ago
  • $162.8k - $203.5k

     ...for future generations. Role Summary As part of the Autonomy Data Collection and Prototyping team, you will be fundamental in ensuring...  ..., you will be a key contributor to developing on‑vehicle Engineering and Operations Tooling HMI (Human‑Machine Interface) necessary... 
    Senior
    Full time
    Contract work
    Local area

    Rivian

    Palo Alto, CA
    5 days ago
  • A pioneering energy management company in the SF Bay Area is seeking an AI-Native Data Engineer to own the data and AI infrastructure. The candidate will build high-reliability data pipelines, design GCP infrastructure for scalability, and have a strong background in production... 

    Pear VC

    Palo Alto, CA
    2 days ago
  • $145.1k - $273.2k

     ...-depth research into the underlying hardware logic of various AI accelerators ; evaluate the power-efficiency ratio and suitability...  ...For 1.Education: Master's or Ph.D. degree in Computer Engineering, Electronic Engineering, Microelectronics, or a related field.... 
    Senior
    Relocation package

    Tencent

    Palo Alto, CA
    8 hours ago
  • $133k - $254k

     ...United States 42dot Full-time About Us 42dot is a mobility AI company committed to solving mobility challenges with...  ...self‑managing urban transportation operating system. Our AI Data Pipeline Engineers build up the core data processing pipelines and datasets readiness... 
    Senior
    Full time
    Work experience placement

    42dot Inc.

    Sunnyvale, CA
    3 days ago
  • $100k - $216k

     ...What to Expect We areseekinga skilled and collaborative Data Engineer to join our team. In this role, you will architect and implement a cuttingdata platform while leading the development of data pipelines, data warehousing, and reporting infrastructure to support the... 
    Senior
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  •  ...combining our expertise across connectivity, AI, security and more, we'll map a new way...  ...the gap between hardware validation and big data. You'll develop the software that controls...  ...designers, validation and reliability engineers to define data schemas and standards for ECU... 
    Senior
    Full time
    Contract work
    Flexible hours

    Rivian VW Group

    Palo Alto, CA
    2 days ago
  • $220k - $350k

     ...possible, with the ultimate goal of enabling human life on Mars. SR AI ENGINEER, PLATFORM INFRASTRUCTURE, SPECIAL PROGRAMS As an AI Engineer,...  ...computer science, mathematics, computer engineering, physics, data science, or engineering discipline. 5+ years of experience... 
    Senior
    Permanent employment
    Temporary work
    Immediate start
    Weekend work

    SpaceX

    Palo Alto, CA
    1 day ago
  • A leading AI development firm in California is seeking a talented developer to create LM/VLM-powered agents that generate physical data through advanced simulation and generative models. The role involves collaboration with simulation teams to design APIs and tools that... 
    Senior

    GenesisAI

    San Carlos, CA
    4 days ago
  • $100k - $216k

    What to Expect We are seeking a skilled and collaborative Data Engineer to join our team. In this role, you will lead the development of data pipelines, data warehousing, and reporting infrastructure to support the growth of our energy business including Industrial, Residential... 
    Senior
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla Motors, Inc.

    Palo Alto, CA
    1 day ago
  • $220k - $350k

     ...possible, with the ultimate goal of enabling human life on Mars. SR. AI ENGINEER, SPECIAL PROGRAMS This team focuses on engineering and...  ...models (e.g., Grok family) and government systems, platforms, and data environments Collaborate on custom SDKs, APIs, developer... 
    Senior
    Permanent employment
    Temporary work
    Local area
    Immediate start
    Weekend work

    SpaceX

    Palo Alto, CA
    1 day ago
  • $176k - $420k

     ...What to Expect At Tesla AI, you will have access to unparalleled resources that set us apart from other companies in the AI industry...  ...a unique opportunity to develop and optimize a large-scale data engine that powers our autonomous driving systems. Tesla's extensive fleet... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  • $203.3k - $305.6k

     ...AI Data Engineer Imagine what you could do here. At Apple, new ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Are you passionate... 
    Relocation

    Apple

    Cupertino, CA
    8 hours ago
  • $22 - $29 per hour

     ...Profession (Job Category): IT, Telecom & Internet Job Schedule: Full time Remote: No Job Description: AI and Data Engineer Company: Hitachi America, Ltd. Division: Business Innovation & Digital Transformation Location: Santa Clara,... 
    Hourly pay
    Permanent employment
    Full time
    Temporary work
    Work at office
    Remote work
    Worldwide

    Hitachi

    Santa Clara, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. AI Data Engineer. Be the first to apply!