Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

NEW JOB OPENING DATA ENGINEER IN Menlo Park, CA, USA!

Rose International

Job Description
Client Location: Onsite 5 days a week in Menlo Park, CA

REQUIRED

- Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.

- 5+ years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.

- Demonstrated track record of building and operating production data pipelines that invoke ML models at scale.

- Strong software engineering fundamentals. Python, data structures, concurrency/async programming.

- Advanced SQL & data pipeline expertise. Complex queries, query optimization, pipeline orchestration frameworks (Airflow, Dataswarm, or equivalent).

- Experience integrating ML models into data pipelines. Calling inference endpoints, managing model versions, batching requests, handling inference failures at scale.

- Proficiency with AI-assisted coding agents (e.g., Copilot, Cursor, Codex). Expected to leverage AI tools as a force multiplier for writing, debugging, and reviewing code, building pipelines faster, and accelerating day-to-day engineering workflows

- Strong verbal and written communication skills, problem-solving ability, and cross-functional collaboration.

PREFERRED

- Working knowledge of embeddings and vector representations like generating, storing, indexing, and querying embeddings (FAISS, Milvus, or equivalent).

- Familiarity with content-understanding models like image classifiers, object detection, OCR, NSFW detection, aesthetic scoring.

- Experience with LLMs for data tasks like prompt engineering for annotation, data cleaning, or evaluation using LLM APIs.

- Knowledge of generative AI like diffusion models, image generation, evaluation metrics (FID, CLIP score, etc.).

Job Description:

Generative AI models are only as good as the data they consume. Unlike traditional data engineering, building data pipelines for generative AI requires orchestrating ML model invocations (content understanding classifiers, embedding models, LLM-based cleaners) alongside standard SQL-based transformations, all at billion-row scale.

This role sits at the intersection of Data Engineering and ML Systems. The Senior AI Data Engineer will own end-to-end data pipelines that don't just move and transform data, but enrich it through remote model inference, managing the systems complexity of async execution, capacity allocation, retry/fallback logic, and throughput optimization that comes with it. This is not a pure ETL-with-SQL role; it demands hands-on systems experience with distributed inference infrastructure.

Our team develops comprehensive data curation and evaluation solutions for image generation models across quality dimensions including visual quality, prompt adherence, identity preservation, naturalness, and visual text generation.

Job Responsibilities

Main Responsibilities

AI-Augmented Data Pipelines: Design and maintain AI-augmented, large-scale data pipelines (billions of images) integrating traditional transformations with ML models (classifiers, embeddings, LLMs) for cleaning and annotation.

Remote Inference Orchestration: Own the systems for remote ML model inference orchestration within pipelines, managing batching, retries, async jobs, and ensuring graceful degradation.

Feature Pipelines: Build and maintain scalable pipelines for generating, storing, and serving vector embeddings, including nearest-neighbor index management and quality validation.

Data Curation at Scale: Source, filter, and curate training datasets using a combination of SQL and model-derived signals (e.g., aesthetic scores, NSFW classifiers), owning the end-to-end data flow and maintaining governance, quality, and compliance.

Additional Responsibilities

LLM-Assisted Annotation: Design and operate pipelines that use LLMs and vision models for automated annotation of training data, including auditing workflows to measure and improve annotation model performance.

Tooling & Frameworks: Contribute to shared tooling and frameworks that make it easier for the broader team to build AI-augmented data pipelines - e.g., reusable operators for model invocation, standard patterns for async job management.

Education / Experience

Bachelor's degree or higher in Computer Science, Data Engineering, Machine Learning, or a related STEM field.

5+ years of industry experience in data engineering, ML engineering, or a hybrid role involving both data pipelines and model serving/inference.

Pursuant to the California Fair Chance Act, Los Angeles County Fair Chance Ordinance for Employers, Los Angeles Fair Chance Initiative for Hiring Ordinance, and San Francisco Fair Chance Ordinance, qualified applicants will be considered for assignment with arrest and conviction records. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness, meet client expectations, standards, and accompanying requirements, and safeguard business operations and company reputation.
  • **Only those lawfully authorized to work in the designated country associated with the position will be considered.**
  • **Please note that all Position start dates and duration are estimates and may be reduced or lengthened based upon a client's business needs and requirements.**


Benefits:
For information and details on employment benefits offered with this position, please visit here . Should you have any questions/concerns, please contact our HR Department via our secure website .

California Pay Equity:
For information and details on pay equity laws in California, please visit the State of California Department of Industrial Relations' website here .

Rose International is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation, gender (expression or identity), national origin, arrest and conviction records, disability, veteran status or any other characteristic protected by law. Positions located in San Francisco and Los Angeles, California will be administered in accordance with their respective Fair Chance Ordinances.

If you need assistance in completing this application, or during any phase of the application, interview, hiring, or employment process, whether due to a disability or otherwise, please contact our HR Department .

Rose International has an official agreement (ID #132522), effective June 30, 2008, with the U.S. Department of Homeland Security, U.S. Citizenship and Immigration Services, Employment Verification Program (E-Verify). (Posting required by OCGA 13/10-91.).
Vacancy posted 8 hours ago
Similar jobs that could be interesting for youBased on the NEW JOB OPENING DATA ENGINEER IN Menlo Park, CA, USA! in Menlo Park, CA vacancy
  • $140k - $200k

     ...office. These include frontend and backend engineers, AI research scientists, and others from...  ...We're looking to hire for our Data side of our AI team at Speechify. This role...  ...What You'll Do Be scrappy to find new sources of audio data and bring it into our... 
    Suggested
    Full time
    Work at office
    Shift work

    Speechify

    Menlo Park, CA
    16 days ago
  •  ...This position operates on-site in Menlo Park, CA. This position is not remote or hybrid....  ...must be able to effectively comprehend data and compose clear and effective communications...  ...analytical skills; experience monitoring open sources to proactively identify physical... 
    Suggested
    Work experience placement
    Shift work
    Night shift

    Crisis24

    Menlo Park, CA
    3 days ago
  • $150k - $250k

     ...Machine Learning Engineer Goldman Sachs is a leading...  ...firm is headquartered in New York and maintains...  ...engineering, and application of data science and machine...  ...framework for open-source and foundational...  ...026, 04:17 PM Locations Menlo Park, CA, United States... 
    Suggested
    Full time
    Temporary work
    Part time
    Worldwide

    The Goldman Sachs Group, Inc.

    Menlo Park, CA
    3 days ago
  •  ...I have an opportunity for "Sr Java Developer with Microservices and Kubernetes - Menlo Park, CA - HYBRID - Locals" and I am looking for a candidate who can join Immediately if you are interested, reply to me with your updated resume or if you could refer someone I would... 
    Suggested
    Local area
    Immediate start

    Navtech

    Atherton, CA
    2 days ago
  • $140k - $200k

     ...setting - Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and...  ...find the need for a Senior iOS Engineer to help us support the new user base as well as work on new and exciting projects to push... 
    Suggested
    Work at office
    Remote work

    Speechify

    Menlo Park, CA
    3 days ago
  • $196k - $230k

     ...and so are the rewards. The Data Engineering team builds and maintains...  ...This role is based in our Menlo Park, CA office, with in-person attendance...  ...-scale data pipelines using open source frameworks (Spark,...  ...: Zone 1 (Menlo Park, CA; New York, NY; Bellevue, WA; Washington... 
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Robinhood

    Menlo Park, CA
    14 hours ago
  •  ...Days a Week Onsite ~ Open to Alpharetta, GA or Menlo Park, CA locations. ~ Please put...  ...responsible to stream terabytes of data daily. ~ We have built...  ...for streaming, data engineer ~ Understand...  ...teams. ~ Able to learn new technologies and work independently... 
    Permanent employment
    Work experience placement
    Local area
    3 days per week

    3B Staffing LLC

    Menlo Park, CA
    3 days ago
  • $140k - $200k

     ...Senior Software Engineer, Windows/Desktop Applications The mission of Speechify is to make sure that reading is never a barrier to learning...  ...software engineering fundamentals: OOP, design patterns, data structures, algorithms, memory management, multi-threading or asynchronous... 
    Work at office

    Speechify

    Menlo Park, CA
    2 days ago
  • $35 - $39 per hour

    Data Engineer III itD is seeking a Senior AI Data Engineer III to build and scale AI-augmented...  ...workflows. Location: Hybrid Onsite - Menlo Park, CA (required onsite collaboration with...  ...Description About itD: We are part of a new generation of consulting and software development... 
    Hourly pay
    Remote work

    itD Tech

    Menlo Park, CA
    8 hours ago
  • $140k - $200k

     ...Android Core Product Palo Alto, CA, USA The mission of Speechify is...  ...include frontend and backend engineers, AI research scientists, and...  ...to help us support the new user base as well as work on new...  ...or otherwise contributing to open source projects in Android... 
    Work at office
    Remote work
    Night shift

    Speechify

    Palo Alto, CA
    4 days ago
  • $179k - $210k

     ...the rewards. Robinhood’s Analytics Engineering team, part of the Data Science organization, is the backbone...  .... This role is based in our Menlo Park, CA office, with in-person attendance expected...  ...Pay Range: Zone 1 (Menlo Park, CA; New York, NY; Bellevue, WA; Washington,... 
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Robinhood

    Menlo Park, CA
    4 days ago
  • $187k - $198k

     ...metrics driven company and data is foundational to all key decisions...  .... We are looking for a Data Engineer to build and maintain...  ...-scale data pipelines using open source frameworks (Spark, Flink...  ...Base Pay Range: Zone 1 (Menlo Park, CA; New York, NY; Bellevue, WA;... 
    Work at office
    Shift work

    Robinhood

    Menlo Park, CA
    more than 2 months ago
  •  ...Data Systems Engineer - ELK/Kafka/Linux Alpharetta, GA or Menlo Park, CA - Hybrid 3 Days a Week Onsite 12 months+ Interview Process: # Screening of Technical Background...  ...communication, Team Player and someone who is open to learning and getting their hands dirty.... 
    3 days per week

    Veterans Sourcing Group LLC

    Menlo Park, CA
    14 hours ago
  • $156k - $187k

     ...healthcare company, pioneering new technologies to advance...  ...of scientists, engineers, and physicians and we...  ...art computer science and data science to overcome one...  ...a hybrid role based in Menlo Park, CA (moving to Sunnyvale,...  ...accommodation to apply for an open position. GRAIL... 
    16 hours
    Full time
    Local area
    Flexible hours

    GRAIL, Inc.

    Menlo Park, CA
    3 days ago
  • $156k - $226k

     ...located in our Playa Vista, CA campus. Applicants in...  ...window will be open until at least May 29,...  ...location from the following: New York, NY, USA; Atlanta, GA, USA;...  ...of experience designing data pipelines, and dimensional...  ...Organization (GBO) is the engine that powers Google’s... 
    Full time

    Google

    Mountain View, CA
    3 days ago
  •  ...security and more, we’ll map a new way forward. Working...  ...We are seeking an AI Engineer Intern to contribute to...  ...models. About the About the Data Engineering, AI &...  ...8 Location: Palo Alto, CA By submitting a single...  ...projects, publications, or open-source software. Familiarity... 
    Full time
    Contract work
    Internship
    Local area

    Rivian and Volkswagen Group Technologies

    Palo Alto, CA
    7 hours ago
  •  ...Join to apply for the Junior Data Annotation Engineer (Remote) role at Jobright....  ...connect you with verified openings from employers you can trust...  ...is headquartered in Wayne, New Jersey, USA, with a team of 201-500...  ...Engineer" roles.Palo Alto, CA $95,000.00-$215,000.00 2 weeks... 
    Remote work

    jobright.com

    Santa Clara, CA
    3 days ago
  •  ...and help to build a better working world. Technology – Data and Decision Science – Data Engineering – Senior We are seeking a highly skilled Senior Consultant...  ...EY is building a better working world by creating new value for clients, people, society and the planet, while... 

    Ernst & Young Oman

    Palo Alto, CA
    4 days ago
  •  ...Junior Data Engineer Location - 5 Days Onsite in Alpharetta, GA or Menlo Park, CA Locals Only Our client, a top tier IT Consulting firm is looking for a Junior Data Engineer to join a Top-Tier Investment Bank. Essential Requirements and Responsibilities ~... 
    Local area

    RIT Solutions

    Menlo Park, CA
    14 hours ago
  •  ...Data Engineer (Splunk) Job Location: Menlo Park, CA Job Type: Full-Time / Contract Must Haves: ~10+ years of business and data engineering/analysis experience in Cloud Data Warehouse platforms. ~ Strong experience working directly with business teams to... 
    Full time
    Contract work

    InterSources

    Menlo Park, CA
    4 days ago
  • $117k - $234k

     ...critical business initiatives with Data. You love writing and owning...  ...on this data. Partner with engineering, AI/ML, and product teams to...  ...across cloud-native and open-source platforms. ~ Familiarity...  ...1395 Crossman Ave, Sunnyvale, CA 94089-1114, United States of America... 
    Full time
    Temporary work
    Part time

    Walmart

    Sunnyvale, CA
    3 days ago
  •  ...experience in design, implementation, and support of solutions big data solution in Hadoop using Hive, Spark, Drill, Impala, HBase3....  ...skills. Remote is fine but preferably hybrid in Sunnyvale. Data engineer with more than 4-5 yrs.’ of experience. Required Qualifications... 
    Remote work

    Samprasoft

    Sunnyvale, CA
    14 hours ago
  • Test & Software Engineer / On-Site (CA, USA) What are we looking for? Jetson is a category-defining eVTOL company with a mission to change the way...  ...-based transportation up into the air. Jetson is enabling new and exciting ways of travelling and we are committed to making... 
    Weekend work

    Jetson

    Palo Alto, CA
    2 days ago
  •  ...Job Title: Senior AI Data Engineer Location: Menlo Park, CA (Hybrid) Duration: 6 months (potential extensions to long term) Our client is looking for a Senior AI Data Engineer to build and scale next-generation data pipelines powering image generation systems... 
    Remote work

    Intelliswift

    Menlo Park, CA
    9 hours ago
  •  ...Unchain Data in Menlo Park, CA, is seeking a Senior Analytics Engineer to design and develop high-performance ETL pipelines, data models, and analytics tools. The role involves partnering with product, engineering, and data science teams to ensure data-driven decision... 

    Unchain Data

    Menlo Park, CA
    9 hours ago
  • $165.2k - $223.6k

     ...Come build the future of data streaming with the...  ...a Software Development Engineer for the Amazon Data Firehose...  ..., and experience with open-source data processing...  ...reliability and scaling) of new and existing systems...  ...our benefits at . USA, CA, East Palo Alto - 165,2... 
    Internship
    Local area
    Flexible hours

    Amazon

    Palo Alto, CA
    2 days ago
  • $125.5k - $230.2k

     ...help to build a better working world. Technology – Data and Decision Science – Data Engineering – Manager We are looking for a dynamic and experienced...  ...US is $125,500 to $230,200. The base salary range for New York City Metro Area, Washington State and California (... 
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    Palo Alto, CA
    2 days ago
  • $280.71k

     ...Senior AI Engineer I Menlo Park, CA A Rare Opportunity to Shape the Future of...  ...building a world-class Commercial Data & Applied AI team. We are...  ...the opportunity to join a new team and help shape the vision...  ...millions of patients Open, transparent culture that includes... 
    Full time
    Worldwide
    2 days per week
    3 days per week

    BillionToOne

    Menlo Park, CA
    1 day ago
  • $125 per hour

     ...Manager (Change Mgmt & Comms Program Manager) (26046-1) Santa Clara, CA Program Manager (Change Mgmt & Comms Program Manager) (26046-1...  ...tools (Asana, Smartsheets, Confluence, GDrive). Working Place: Santa Clara, CA, USA Company : ESR Healthcare... 
    Hourly pay
    Visa sponsorship
    Relocation package

    ESR Healthcare

    Santa Clara, CA
    3 days ago
  • $167.8k - $227k

     ...database of the year 3 times by DB-Engines DBMS of the year. It is one of...  ...2), to consistently released new product innovations that...  ...You will interact with the open-source community via forums, conferences...  ...about our benefits at . USA, CA, East Palo Alto - 167,800.00 -... 
    Local area
    Flexible hours

    Amazon

    Palo Alto, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to NEW JOB OPENING DATA ENGINEER IN Menlo Park, CA, USA!. Be the first to apply!