Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. AI Data Engineer

$105k - $110k

iSoftStone

Sr. AI Data Engineer (Image Generation Data)

iSoftStone, Inc. is seeking a Sr. AI Data Engineer (Image Generation Data) to join our team! This is a contract onsite opportunity in Menlo Park, CA. This is a one-year contract role, and candidates must have permanent authorization to work in the United States. Visa sponsorship is not available for this role and third-party vendor candidates cannot be considered.

Summary: Generative AI models are only as good as the data they consume. Unlike traditional data engineering, building data pipelines for generative AI requires orchestrating ML model invocations (content understanding classifiers, embedding models, LLM-based cleaners) alongside standard SQL-based transformations, all at billion-row scale. This role sits at the intersection of Data Engineering and ML Systems. The Senior AI Data Engineer will own end-to-end data pipelines that don't just move and transform data, but enrich it through remote model inference, managing the systems complexity of async execution, capacity allocation, retry/fallback logic, and throughput optimization that comes with it. This is not a pure ETL-with-SQL role; it demands hands-on systems experience with distributed inference infrastructure. Our team develops comprehensive data curation and evaluation solutions for image generation models across quality dimensions including visual quality, prompt adherence, identity preservation, naturalness, and visual text generation.

Responsibilities:

Main Responsibilities:

  • AI-Augmented Data Pipelines: Design and maintain AI-augmented, large-scale data pipelines (billions of images) integrating traditional transformations with ML models (classifiers, embeddings, LLMs) for cleaning and annotation.
  • Remote Inference Orchestration: Own the systems for remote ML model inference orchestration within pipelines, managing batching, retries, async jobs, and ensuring graceful degradation.
  • Feature Pipelines: Build and maintain scalable pipelines for generating, storing, and serving vector embeddings, including nearest-neighbor index management and quality validation.
  • Data Curation at Scale: Source, filter, and curate training datasets using a combination of SQL and model-derived signals (e.g., aesthetic scores, NSFW classifiers), owning the end-to-end data flow and maintaining governance, quality, and compliance.

Additional Responsibilities:

  • LLM-Assisted Annotation: Design and operate pipelines that use LLMs and vision models for automated annotation of training data, including auditing workflows to measure and improve annotation model performance.
  • Tooling & Frameworks: Contribute to shared tooling and frameworks that make it easier for the broader team to build AI-augmented data pipelines — e.g., reusable operators for model invocation, standard patterns for async job management.

Qualifications:

  • Advanced SQL & data pipeline expertise. Complex queries, query optimization, pipeline orchestration frameworks (Airflow, Dataswarm, or equivalent).
  • Experience integrating ML models into data pipelines. Calling inference endpoints, managing model versions, batching requests, handling inference failures at scale.
  • Proficiency with AI-assisted coding agents (e.g., Copilot, Cursor, Codex). Expected to leverage AI tools as a force multiplier for writing, debugging, and reviewing code, building pipelines faster, and accelerating day-to-day engineering workflows.
  • Strong verbal and written communication skills, problem-solving ability, and cross-functional collaboration.

Preferred:

  • Working knowledge of embeddings and vector representations like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent).
  • Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, aesthetic scoring.
  • Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs.
  • Knowledge of generative AI like diffusion models, image generation, evaluation metrics (FID, CLIP score, etc.).

Education / Experience:

  • Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.
  • 5+ years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.
  • Demonstrated track record of building and operating production data pipelines that invoke ML models at scale.
  • Previous experience at Meta is preferred but not required.

Additional Requirements:

  • Work onsite in MPK 5 days per week, working closely with engineers and researchers.

Primary Location Pay Range: $105,000 - $110,000 per year Benefits: 1099/Contractors: No benefits Temp salaried employee benefits, if scheduled to work at least 30 hours per week: medical, dental, vision, 401k, holidays.

iSoftStone is a global IT service and consulting company that creates value and drives success through technology solutions, service excellence, and digital innovation. We specialize in web and application development, software testing and support, data and content management, digital experience, accessibility, and data for machine learning and AI. With 20 delivery centers and more than 90,000 employees worldwide, iSoftStone is proud to serve some of the world's most well-known businesses, including 90+ Fortune Global 500 companies.

Vacancy posted 6 hours ago
Similar jobs that could be interesting for youBased on the Sr. AI Data Engineer in Menlo Park, CA vacancy
  • $100k - $300k

     ...We are seeking a forward-thinking AI Data Engineer to bridge the gap between our user data assets and advanced AI capabilities. In this role, you will be the architect of our user data foundation, building a robust data warehouse and a dynamic tagging system. Crucially... 
    Senior
    Full time

    OPPO US Research Center

    Palo Alto, CA
    7 hours ago
  • A leading AI technology company in Palo Alto seeks an AI Data Engineer to design a robust User Data Warehouse and integrate it with advanced AI capabilities. Responsibilities include establishing a tagging system and implementing pipelines for real-time data availability... 
    Senior

    OPPO US Research Center

    Palo Alto, CA
    2 days ago
  • $119k - $299.93k

     ...processes and related controls. Those in data, analytics and technology solutions at PwC...  ...the design and deployment of enterprise AI/ML solutions, setting architecture standards...  ...years of professional AI/ML development, engineering, or testing experience. What Sets You... 
    Senior
    Full time
    H1b

    PwC

    Palo Alto, CA
    2 days ago
  •  ...AI-Native Data Engineer @ TrueMeter SF Bay Area | Hybrid (3 days onsite, 2 remote) About Us We’re building the AI Energy Agent that’s becoming the default way any business pays for power and saves on energy. The grid is breaking under the weight of AI and electrification... 
    Suggested
    Immediate start
    Remote work

    Pear VC

    Palo Alto, CA
    2 days ago
  •  ...A global consulting firm is seeking a Senior AI Native Engineer in California. You will research and implement AI systems tailored to diverse business environments while collaborating with a talented team. The ideal candidate has a Bachelor's degree, strong Python skills... 
    Senior

    Ernst & Young Oman

    Palo Alto, CA
    2 days ago
  • $203.45k - $344.3k

     ...Senior Staff Physical AI Data Algorithm Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric... 
    Senior
    Full time
    Temporary work
    Work experience placement

    XPENG

    Santa Clara, CA
    3 days ago
  • $240k - $280k

    Pantera Capital is hiring a Data Engineer / AI Engineer in Palo Alto, California. This role focuses on developing systems for data acquisition and quality evaluation, ensuring high-quality training data for AI models. Responsibilities include building scalable data pipelines... 

    Pantera Capital

    Palo Alto, CA
    2 days ago
  • Maxxd see Tesla is looking for a Data Engineer / AI Engineer to develop systems for data acquisition, preparation, and delivery to enhance model training. You will analyze data performance, build scalable pipelines, and ensure high-quality training data. The ideal candidate... 

    Maxxd see Tesla

    Palo Alto, CA
    14 hours ago
  •  ...A pioneering energy management company in the SF Bay Area is seeking an AI-Native Data Engineer to own the data and AI infrastructure. The candidate will build high-reliability data pipelines, design GCP infrastructure for scalability, and have a strong background in production... 

    Pear VC

    Palo Alto, CA
    2 days ago
  • $228.4k - $303.55k

     ...A leading data and AI company in Mountain View is seeking an experienced Sr. Staff Software Engineer to design and run the Data Intelligence Platform. The candidate must have extensive experience in building large-scale distributed systems and technical leadership. Responsibilities... 
    Senior
    Full time

    Databricks

    Mountain View, CA
    2 days ago
  • $169.8k - $233.5k

     ...Uniphore is one of the largest B2B AI-native companies-decades-proven, built-for-scale...  ...and text and how to analyze all types of data. As AI becomes more powerful,...  ...employees. Job Description: SR AI Engineer Uniphore is a leading B2B AI-native... 
    Senior

    Uniphore

    Palo Alto, CA
    4 days ago
  • $116.2k - $269.1k

     ...Sr. AI Software Engineer – Coding Agent Tencent Overseas IT supports rapid global growth with future‑ready IT platforms and leads strategy, architecture, and execution to empower game studios. Responsibilities Architect and lead the development of a robust, scalable... 
    Senior
    Overseas
    Relocation package

    Tencent

    Palo Alto, CA
    2 days ago
  •  ...A technology firm is seeking a Senior Data / AI / ML Software Engineer in Palo Alto, California. The ideal candidate will have over 7 years of experience in AI/ML engineering and strong software fundamentals. Responsibilities include designing data extractors, working... 
    Senior
    Full time

    Next Ventures

    Palo Alto, CA
    2 days ago
  • $162.8k - $203.5k

     ...Rivian AI Engineer Position Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free electric adventure...  ...As an AI Engineer, you will contribute across the stack, from data pipelines and retrieval to prompt/agent logic, evaluation/... 
    Senior
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Palo Alto, CA
    3 days ago
  •  ...Senior It Ai/Ml Engineer At Palo Alto Networks®, we're united by a shared mission—to protect our digital way of life. We thrive at the...  ...Finance & Marketing). You will work closely with Principal and Data leads to translate high-level strategies into concrete AI solution... 
    Senior
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Palo Alto, CA
    3 days ago
  • $66.52 - $88.14 per hour

     ...This is a Stanford Health Care job. This role will develop data ingestion processes to the SHC Enterprise Data Platform (Lake),...  ...operational results. A Brief Overview The Senior Enterprise Data Engineer is responsible for designing and creating complex pipelines and... 
    Senior
    Hourly pay

    Stanford Health Care

    Palo Alto, CA
    2 days ago
  • $181.1k - $318.4k

     ...Cupertino, California is searching for a Senior Software Engineer to design and implement AI-powered software systems in compliance with regulatory standards...  ...in modern engineering practices, AI integration, and data pipeline development. Collaborate closely with domain... 
    Senior

    Apple Inc.

    Cupertino, CA
    1 day ago
  • $120.1k - $225.7k

     ...design technical roadmaps and mentor team members to build a robust AI inference technical ecosystem. Who We Look For...  ...Experience: Master's or Ph.D. in Computer Science, Electronic Engineering, AI, or related fields; significant professional experience in... 
    Senior
    Relocation package

    Tencent

    Palo Alto, CA
    4 days ago
  • $174.72k - $295.68k

     ...Senior AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric... 
    Senior
    Full time
    Overseas

    XPENG

    Santa Clara, CA
    1 day ago
  • $133k - $254k

     ...United States 42dot Full-time About Us 42dot is a mobility AI company committed to solving mobility challenges with...  ...self‑managing urban transportation operating system. Our AI Data Pipeline Engineers build up the core data processing pipelines and datasets readiness... 
    Senior
    Full time
    Work experience placement

    42dot Inc.

    Sunnyvale, CA
    2 days ago
  • $145.1k - $273.2k

     ...-depth research into the underlying hardware logic of various AI accelerators ; evaluate the power-efficiency ratio and suitability...  ...For 1.Education: Master's or Ph.D. degree in Computer Engineering, Electronic Engineering, Microelectronics, or a related field.... 
    Senior
    Relocation package

    Tencent

    Palo Alto, CA
    4 days ago
  • $116.2k - $269.1k

     ...execution, aiming to support our game studios and become a world‑class global IT team. Position Overview We are seeking a Sr. AI Software Engineer to architect and lead the development of a groundbreaking in‑house AI Coding Agent. This tool will accelerate our game developers... 
    Senior
    Overseas
    Relocation package

    Lightspeed Studios

    Palo Alto, CA
    1 day ago
  • $168k - $230k

    Sr. Security Software Engineer, Applied Computing (Starshield) Starshield leverages SpaceX’s Starlink technology...  ...Software Engineer, you will leverage AI to automate security‑related efforts...  ...prompt injection, jailbreaking, and data exfiltration Experience deploying and... 
    Senior
    Permanent employment
    Temporary work
    Immediate start
    Flexible hours
    Weekend work

    Latent AI

    Palo Alto, CA
    2 days ago
  •  ...combining our expertise across connectivity, AI, security and more, we'll map a new way...  ...the gap between hardware validation and big data. You'll develop the software that controls...  ...designers, validation and reliability engineers to define data schemas and standards for ECU... 
    Senior
    Full time
    Contract work
    Flexible hours

    Rivian VW Group

    Palo Alto, CA
    1 day ago
  • $100k - $216k

    What To Expect We are seeking a skilled and collaborative Data Engineer to join our team. In this role, you will architect and implement a cutting data platform while leading the development of data pipelines, data warehousing, and reporting infrastructure to support the... 
    Senior
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    14 hours ago
  • $148.6k - $306.3k

     ...Learning powerhouse, utilizing the power of AI to build platform, for current day...  ...responsibilities of an AI / Machine Learning Engineer within SAP BTP Fabric will include work at...  ...Master’s degree in Computer Science, Data Science, Machine Learning, with equivalent... 
    Senior
    Permanent employment
    Full time
    Work experience placement
    Worldwide
    Flexible hours

    SAP

    Stanford, CA
    2 days ago
  • $22 - $29 per hour

     ...Profession (Job Category): IT, Telecom & Internet Job Schedule: Full time Remote: No Job Description: AI and Data Engineer Company: Hitachi America, Ltd. Division: Business Innovation & Digital Transformation Location: Santa Clara,... 
    Hourly pay
    Permanent employment
    Full time
    Temporary work
    Work at office
    Remote work
    Worldwide

    Hitachi

    Santa Clara, CA
    8 hours ago
  •  ...Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in... 
    Senior

    Illumio

    Sunnyvale, CA
    2 days ago
  •  ...A fintech company in Menlo Park is seeking a Senior Software Engineer, Data Engineering to build and maintain data pipelines, collaborating with various teams to empower analytics and decision-making. The ideal candidate has over 5 years of experience with a strong command... 
    Senior

    Robinhood

    Menlo Park, CA
    2 days ago
  •  ...Snowflake is seeking a Senior Software Engineer for the Data Clean Rooms team in Menlo Park, California. The role involves architecting scalable infrastructure for secure multi-party collaboration and requires 7+ years of experience with large-scale distributed systems... 
    Senior

    Snowflake Computing

    Menlo Park, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. AI Data Engineer. Be the first to apply!