Sr. AI Data Engineer
$105k - $110kiSoftStone
Sr. AI Data Engineer (Image Generation Data)
iSoftStone, Inc. is seeking a Sr. AI Data Engineer (Image Generation Data) to join our team! This is a contract onsite opportunity in Menlo Park, CA. This is a one-year contract role, and candidates must have permanent authorization to work in the United States. Visa sponsorship is not available for this role and third-party vendor candidates cannot be considered.
Summary: Generative AI models are only as good as the data they consume. Unlike traditional data engineering, building data pipelines for generative AI requires orchestrating ML model invocations (content understanding classifiers, embedding models, LLM-based cleaners) alongside standard SQL-based transformations, all at billion-row scale. This role sits at the intersection of Data Engineering and ML Systems. The Senior AI Data Engineer will own end-to-end data pipelines that don't just move and transform data, but enrich it through remote model inference, managing the systems complexity of async execution, capacity allocation, retry/fallback logic, and throughput optimization that comes with it. This is not a pure ETL-with-SQL role; it demands hands-on systems experience with distributed inference infrastructure. Our team develops comprehensive data curation and evaluation solutions for image generation models across quality dimensions including visual quality, prompt adherence, identity preservation, naturalness, and visual text generation.
Responsibilities:
Main Responsibilities:
- AI-Augmented Data Pipelines: Design and maintain AI-augmented, large-scale data pipelines (billions of images) integrating traditional transformations with ML models (classifiers, embeddings, LLMs) for cleaning and annotation.
- Remote Inference Orchestration: Own the systems for remote ML model inference orchestration within pipelines, managing batching, retries, async jobs, and ensuring graceful degradation.
- Feature Pipelines: Build and maintain scalable pipelines for generating, storing, and serving vector embeddings, including nearest-neighbor index management and quality validation.
- Data Curation at Scale: Source, filter, and curate training datasets using a combination of SQL and model-derived signals (e.g., aesthetic scores, NSFW classifiers), owning the end-to-end data flow and maintaining governance, quality, and compliance.
Additional Responsibilities:
- LLM-Assisted Annotation: Design and operate pipelines that use LLMs and vision models for automated annotation of training data, including auditing workflows to measure and improve annotation model performance.
- Tooling & Frameworks: Contribute to shared tooling and frameworks that make it easier for the broader team to build AI-augmented data pipelines — e.g., reusable operators for model invocation, standard patterns for async job management.
Qualifications:
- Advanced SQL & data pipeline expertise. Complex queries, query optimization, pipeline orchestration frameworks (Airflow, Dataswarm, or equivalent).
- Experience integrating ML models into data pipelines. Calling inference endpoints, managing model versions, batching requests, handling inference failures at scale.
- Proficiency with AI-assisted coding agents (e.g., Copilot, Cursor, Codex). Expected to leverage AI tools as a force multiplier for writing, debugging, and reviewing code, building pipelines faster, and accelerating day-to-day engineering workflows.
- Strong verbal and written communication skills, problem-solving ability, and cross-functional collaboration.
Preferred:
- Working knowledge of embeddings and vector representations like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent).
- Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, aesthetic scoring.
- Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs.
- Knowledge of generative AI like diffusion models, image generation, evaluation metrics (FID, CLIP score, etc.).
Education / Experience:
- Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.
- 5+ years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.
- Demonstrated track record of building and operating production data pipelines that invoke ML models at scale.
- Previous experience at Meta is preferred but not required.
Additional Requirements:
- Work onsite in MPK 5 days per week, working closely with engineers and researchers.
Primary Location Pay Range: $105,000 - $110,000 per year Benefits: 1099/Contractors: No benefits Temp salaried employee benefits, if scheduled to work at least 30 hours per week: medical, dental, vision, 401k, holidays.
iSoftStone is a global IT service and consulting company that creates value and drives success through technology solutions, service excellence, and digital innovation. We specialize in web and application development, software testing and support, data and content management, digital experience, accessibility, and data for machine learning and AI. With 20 delivery centers and more than 90,000 employees worldwide, iSoftStone is proud to serve some of the world's most well-known businesses, including 90+ Fortune Global 500 companies.
$100k - $300k
...We are seeking a forward-thinking AI Data Engineer to bridge the gap between our user data assets and advanced AI capabilities. In this role, you will be the architect of our user data foundation, building a robust data warehouse and a dynamic tagging system. Crucially...SeniorFull time- A leading AI technology company in Palo Alto seeks an AI Data Engineer to design a robust User Data Warehouse and integrate it with advanced AI capabilities. Responsibilities include establishing a tagging system and implementing pipelines for real-time data availability...Senior
$119k - $299.93k
...processes and related controls. Those in data, analytics and technology solutions at PwC... ...the design and deployment of enterprise AI/ML solutions, setting architecture standards... ...years of professional AI/ML development, engineering, or testing experience. What Sets You...SeniorFull timeH1b- ...AI-Native Data Engineer @ TrueMeter SF Bay Area | Hybrid (3 days onsite, 2 remote) About Us We’re building the AI Energy Agent that’s becoming the default way any business pays for power and saves on energy. The grid is breaking under the weight of AI and electrification...SuggestedImmediate startRemote work
- ...A global consulting firm is seeking a Senior AI Native Engineer in California. You will research and implement AI systems tailored to diverse business environments while collaborating with a talented team. The ideal candidate has a Bachelor's degree, strong Python skills...Senior
$203.45k - $344.3k
...Senior Staff Physical AI Data Algorithm Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric...SeniorFull timeTemporary workWork experience placement$240k - $280k
Pantera Capital is hiring a Data Engineer / AI Engineer in Palo Alto, California. This role focuses on developing systems for data acquisition and quality evaluation, ensuring high-quality training data for AI models. Responsibilities include building scalable data pipelines...- Maxxd see Tesla is looking for a Data Engineer / AI Engineer to develop systems for data acquisition, preparation, and delivery to enhance model training. You will analyze data performance, build scalable pipelines, and ensure high-quality training data. The ideal candidate...
- ...A pioneering energy management company in the SF Bay Area is seeking an AI-Native Data Engineer to own the data and AI infrastructure. The candidate will build high-reliability data pipelines, design GCP infrastructure for scalability, and have a strong background in production...
$228.4k - $303.55k
...A leading data and AI company in Mountain View is seeking an experienced Sr. Staff Software Engineer to design and run the Data Intelligence Platform. The candidate must have extensive experience in building large-scale distributed systems and technical leadership. Responsibilities...SeniorFull time$169.8k - $233.5k
...Uniphore is one of the largest B2B AI-native companies-decades-proven, built-for-scale... ...and text and how to analyze all types of data. As AI becomes more powerful,... ...employees. Job Description: SR AI Engineer Uniphore is a leading B2B AI-native...Senior$116.2k - $269.1k
...Sr. AI Software Engineer – Coding Agent Tencent Overseas IT supports rapid global growth with future‑ready IT platforms and leads strategy, architecture, and execution to empower game studios. Responsibilities Architect and lead the development of a robust, scalable...SeniorOverseasRelocation package- ...A technology firm is seeking a Senior Data / AI / ML Software Engineer in Palo Alto, California. The ideal candidate will have over 7 years of experience in AI/ML engineering and strong software fundamentals. Responsibilities include designing data extractors, working...SeniorFull time
$162.8k - $203.5k
...Rivian AI Engineer Position Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free electric adventure... ...As an AI Engineer, you will contribute across the stack, from data pipelines and retrieval to prompt/agent logic, evaluation/...SeniorFull timeContract workTemporary workPart timeLocal areaShift work- ...Senior It Ai/Ml Engineer At Palo Alto Networks®, we're united by a shared mission—to protect our digital way of life. We thrive at the... ...Finance & Marketing). You will work closely with Principal and Data leads to translate high-level strategies into concrete AI solution...SeniorFull timeWork at officeVisa sponsorshipWork visa
$66.52 - $88.14 per hour
...This is a Stanford Health Care job. This role will develop data ingestion processes to the SHC Enterprise Data Platform (Lake),... ...operational results. A Brief Overview The Senior Enterprise Data Engineer is responsible for designing and creating complex pipelines and...SeniorHourly pay$181.1k - $318.4k
...Cupertino, California is searching for a Senior Software Engineer to design and implement AI-powered software systems in compliance with regulatory standards... ...in modern engineering practices, AI integration, and data pipeline development. Collaborate closely with domain...Senior$120.1k - $225.7k
...design technical roadmaps and mentor team members to build a robust AI inference technical ecosystem. Who We Look For... ...Experience: Master's or Ph.D. in Computer Science, Electronic Engineering, AI, or related fields; significant professional experience in...SeniorRelocation package$174.72k - $295.68k
...Senior AI Data Infrastructure/Pipeline Engineer Santa Clara, CA XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric...SeniorFull timeOverseas$133k - $254k
...United States 42dot Full-time About Us 42dot is a mobility AI company committed to solving mobility challenges with... ...self‑managing urban transportation operating system. Our AI Data Pipeline Engineers build up the core data processing pipelines and datasets readiness...SeniorFull timeWork experience placement$145.1k - $273.2k
...-depth research into the underlying hardware logic of various AI accelerators ; evaluate the power-efficiency ratio and suitability... ...For 1.Education: Master's or Ph.D. degree in Computer Engineering, Electronic Engineering, Microelectronics, or a related field....SeniorRelocation package$116.2k - $269.1k
...execution, aiming to support our game studios and become a world‑class global IT team. Position Overview We are seeking a Sr. AI Software Engineer to architect and lead the development of a groundbreaking in‑house AI Coding Agent. This tool will accelerate our game developers...SeniorOverseasRelocation package$168k - $230k
Sr. Security Software Engineer, Applied Computing (Starshield) Starshield leverages SpaceX’s Starlink technology... ...Software Engineer, you will leverage AI to automate security‑related efforts... ...prompt injection, jailbreaking, and data exfiltration Experience deploying and...SeniorPermanent employmentTemporary workImmediate startFlexible hoursWeekend work- ...combining our expertise across connectivity, AI, security and more, we'll map a new way... ...the gap between hardware validation and big data. You'll develop the software that controls... ...designers, validation and reliability engineers to define data schemas and standards for ECU...SeniorFull timeContract workFlexible hours
$100k - $216k
What To Expect We are seeking a skilled and collaborative Data Engineer to join our team. In this role, you will architect and implement a cutting data platform while leading the development of data pipelines, data warehousing, and reporting infrastructure to support the...SeniorHourly payFull timeTemporary workFlexible hours$148.6k - $306.3k
...Learning powerhouse, utilizing the power of AI to build platform, for current day... ...responsibilities of an AI / Machine Learning Engineer within SAP BTP Fabric will include work at... ...Master’s degree in Computer Science, Data Science, Machine Learning, with equivalent...SeniorPermanent employmentFull timeWork experience placementWorldwideFlexible hours$22 - $29 per hour
...Profession (Job Category): IT, Telecom & Internet Job Schedule: Full time Remote: No Job Description: AI and Data Engineer Company: Hitachi America, Ltd. Division: Business Innovation & Digital Transformation Location: Santa Clara,...Hourly payPermanent employmentFull timeTemporary workWork at officeRemote workWorldwide- ...Illumio is hiring a Senior Software Engineer in Sunnyvale, California, to architect high-scale distributed systems, focusing on data processing and real-time analytics. The role requires expertise in backend engineering with Java, Python, or Go, and strong knowledge in...Senior
- ...A fintech company in Menlo Park is seeking a Senior Software Engineer, Data Engineering to build and maintain data pipelines, collaborating with various teams to empower analytics and decision-making. The ideal candidate has over 5 years of experience with a strong command...Senior
- ...Snowflake is seeking a Senior Software Engineer for the Data Clean Rooms team in Menlo Park, California. The role involves architecting scalable infrastructure for secure multi-party collaboration and requires 7+ years of experience with large-scale distributed systems...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. AI Data Engineer. Be the first to apply!
- ai engineer Menlo Park, CA
- ai developer Menlo Park, CA
- senior data center engineer Menlo Park, CA
- data science developer Menlo Park, CA
- data engineer Menlo Park, CA
- senior data engineer Menlo Park, CA
- senior cloud data engineer Menlo Park, CA
- senior data integration developer Menlo Park, CA
- sr information security engineer Menlo Park, CA
- senior data quality engineer Menlo Park, CA

