Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist: Pretraining

Seer

ML Engineer — Robot Foundation Model Pretraining

We are building large-scale embodied intelligence systems designed to operate in complex real-world environments. Our work focuses on training robot foundation models using massive multimodal datasets spanning video, language, proprioception, action traces, and related sensor modalities.

We are seeking ML Engineers to design and execute large-scale pretraining efforts that develop general-purpose capabilities across embodiments, tasks, and environments. This role focuses on turning raw robotic interaction data into scalable, generalizable intelligence.

What You’ll Do

Design and Execute Large-Scale Pretraining

  • Lead or contribute to large-scale training runs for robot foundation models
  • Work with transformer-based and diffusion-based architectures for multimodal learning
  • Define model architectures, training objectives, and curricula for multimodal robotic data

Develop Data and Training Strategies

  • Design scalable data mixtures and sampling strategies across petabyte-scale datasets
  • Structure training curricula across vision, action, state, language, and other modalities
  • Convert raw robotic interaction data into usable signals for model training

Improve Model Performance Through Analysis

  • Run ablation studies to understand scaling laws, data quality effects, and architecture tradeoffs
  • Analyze training dynamics and large-model failure modes
  • Iterate on data and training design to improve generalization and robustness

Collaborate on Large-Scale Systems

  • Work closely with ML infrastructure and systems teams to improve cluster utilization, throughput, and reliability
  • Contribute to multi-node, multi-GPU distributed training efforts
  • Ensure efficient execution of large-scale training workflows

Shape Data Collection and Model Direction

  • Guide data collection efforts toward high-impact capabilities and gaps
  • Identify and integrate new datasets to improve model coverage and generalization
  • Bridge raw data generation with downstream model performance

What We’re Looking For

  • Experience training large-scale transformer or diffusion models (e.g., language, video, audio, or generative models)
  • Experience with multi-node, multi-GPU distributed training systems
  • Strong understanding of:
  • Scaling laws
  • Optimization dynamics
  • Large-model training behavior and failure modes
  • Strong proficiency in PyTorch and ability to debug across the full ML stack
  • Comfort working with large-scale datasets and rapid experimentation
  • Strong empirical rigor combined with fast iteration speed

Preferred Experience

  • Experience with multimodal or generative model training at scale
  • Background in robotics, embodied AI, or sequential decision-making systems
  • Experience with large distributed training infrastructure
  • Familiarity with curriculum learning or dataset mixture design
  • Experience analyzing large-scale training runs and extracting actionable insights

Why This Role Matters

  • Build the core intelligence layer for general-purpose robotic systems
  • Directly shape how robots learn from large-scale real-world data
  • Work at the intersection of large-scale AI training, systems, and robotics
  • Develop foundation models that generalize across embodiments and environments

About the Company

We are a research-driven AI and robotics company focused on building scalable embodied intelligence systems. By combining advances in machine learning, large-scale training infrastructure, and robotics, we aim to develop systems capable of operating robustly in the physical world.

We are committed to building an inclusive and diverse workplace and encourage applicants from all backgrounds to apply.

Vacancy posted 14 hours ago
Similar jobs that could be interesting for youBased on the Research Scientist: Pretraining in San Mateo, CA vacancy
  •  ...ML Engineer — Robot Foundation Model Pretraining We are building large-scale embodied intelligence systems designed to operate across...  ...embodiments and environments About the Company We are a research-driven AI and robotics company focused on building scalable... 
    Suggested

    Seer

    San Mateo, CA
    14 hours ago
  •  ...seen. The Role You'll own the full stack of a unified multimodal foundation model, from pretraining to deployment on real robotic hardware. This is foundational research with direct physical impact. No hand-offs, no bureaucracy, just hard problems and the resources... 
    Suggested

    Prime Recruitment Partners

    San Mateo, CA
    3 days ago
  •  ...ML Research Scientist, Foundation Models About the Team Join a world-class team at the forefront of AI and biochemistry. At Genesis...  ...learning, novel model architectures, and advanced pretraining and post-training methods. Your core mission is to create groundbreaking... 
    Suggested
    Temporary work

    Genesis Molecular AI

    San Mateo, CA
    3 days ago
  •  ...large-scale compute infrastructure, we are investing heavily in research, infrastructure, and scalable training systems to push the...  ...Closely With Research Teams Collaborate directly with research scientists and ML engineers in a highly integrated environment... 
    Suggested

    Seer

    San Mateo, CA
    14 hours ago
  • $128.96k - $141.44k

     ...We are hiring a Scientist to support our client's drug product development team. You will have have knowledge of and practical experience...  ...staffing resource specializing in scientific, clinical research & engineering. We provide job opportunities within major pharmaceutical... 
    Suggested
    Contract work
    Flexible hours

    R&D Partners

    San Carlos, CA
    4 days ago
  •  ...that defines the right profile. This could suit: A senior scientist in translational biology or early development who wants...  ...development • Evidence of building something meaningful, whether research, tools or programmes • Interest or experience in immunology,... 
    Remote work
    Relocation
    Flexible hours

    Aspire Life Sciences Search

    San Mateo, CA
    1 day ago
  •  ...Scientist, iPSC Differentiation San Francisco Bay Area About Oculus is partnered with a Bay Area biotech creating advanced iPSC‑based technologies designed to enable new applications in human biology and long‑term health innovation. You will use mouse-derived... 

    Oculus Group

    San Mateo, CA
    2 days ago
  • $180k - $300k

     ...training data for models. Our team has pioneered this frontier research area and has the deep expertise on both data research and data...  ...4 days a week. About the Role We're looking for a Research Scientist to investigate how intervening on training data can improve the... 
    Work at office
    Relocation package

    AI Chopping Block, Inc.

    Redwood City, CA
    3 days ago
  • Amgen SA in South San Francisco is looking for a Scientist in Cardiometabolic Research. In this role, you will develop a deep understanding of cardiometabolic diseases, drive the pre-clinical pipeline, and work collaboratively with experts. Candidates should have an advanced... 
    Flexible hours

    Amgen SA

    South San Francisco, CA
    3 days ago
  •  ...’ll help make a lasting impact on the lives of patients as we research, manufacture, and deliver innovative medicines to help people...  ...transform the lives of patients while transforming your career. Scientist - Cardiometabolic Research What you will do Let’s do this.... 
    Flexible hours

    Amgen SA

    South San Francisco, CA
    3 days ago
  • $53 - $74 per hour

    Adicet Bio, Inc. seeks an Associate Scientist to advance lentiviral vector research. The role is laboratory-based, focusing on producing and optimizing viral vectors for preclinical studies. Candidates with a B.S. and 4+ years or M.S. and 2+ years of experience in molecular... 
    Hourly pay
    3 days per week

    Initial Therapeutics, Inc.

    Redwood City, CA
    14 hours ago
  •  ...reason over long horizons, and support real-world tasks in industrial and physical environments. We are looking for an experienced, researcher-oriented ML candidate to help build these systems end to end: from problem formulation and experimental design, to model... 

    Archetype AI

    San Mateo, CA
    1 day ago
  • $263.67k - $322.82k

     ...vertical team operations and systems. An Advanced Degree (MSc or PhD) or equivalent degree in Statistics, Economics, Operations Research, Computer Science, Applied Math, Physics, Engineering, or other quantitative fields. For roles that are based at our headquarters... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    1 day ago
  • $278.53k - $345.04k

     ...experiences for everyone. With Roblox's daily active users growing at a record pace, we are seeking experienced machine learning researchers who thrive on solving complex challenges and designing scalable, ground breaking solutions. In this role, you will innovate on... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    4 days ago
  • $72 per hour

     ...problems in tech today affecting massive scale with 112+ million daily active users. We support experimentation and production with top researchers, engineers, faculty, and PhD candidates globally that can lead to publications at esteemed venues. Teams Hiring For This Role:... 
    Hourly pay
    Work experience placement
    Internship
    Summer internship
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    14 hours ago
  • About Us Deciphr Health is a seed-stage biotech developing programmable DNA nanostructures for next-generation point-of-care diagnostics. Our first clinical application targets preeclampsia screening — a condition that affects 1 in 25 pregnancies with no adequate rapid...

    Deciphr Health

    San Mateo, CA
    1 day ago
  •  ...AI Researcher – Video World Generation San Francisco (Bay Area) Help build the next generation of AI video systems that can create rich, interactive worlds from text or images. What you’ll work on: Foundational diffusion models and world models for high-quality... 

    DeepRec.ai

    San Mateo, CA
    4 days ago
  • $200k - $300k

     ...change and achieving remarkable growth in a rapidly evolving industry. Now, we're growing! The Role We are seeking a Staff Research Scientist who can drive innovation through deep technical expertise and hands-on execution. You'll contribute to cutting-edge research... 
    Work at office
    Remote work
    Work from home

    Cognitiv

    San Mateo, CA
    12 days ago
  • $45 - $65 per hour

     ...Kelly® Science & Clinical is seeking a Scientist for a future opportunity with a client in the Bay Area. If you are driven by the prospect...  .... Preferred Significant experience with human stem cell research and stem cell culture techniques. Comprehensive knowledge... 
    Hourly pay
    Contract work

    Kelly

    San Mateo, CA
    3 days ago
  • $116.8k - $140k

     ...Job Description Job Description POSITION SUMMARY: Natera is seeking an innovative Scientist, R&D to join Natera's Oncology Product Development Team. We are looking for a meticulous and dedicated candidate with experience in next generation sequencing (NGS) assay... 
    Immediate start
    Worldwide
    Flexible hours

    Natera

    San Carlos, CA
    5 days ago
  • $148.2k - $216.6k

     ...commerce, and brands worldwide. To learn more, visit Overview of the role: We are looking for an Applied Scientist III to join our algorithmic and research science team. You'll work on mathematically rigorous, research-driven problems at production scale. This role... 
    Local area
    Worldwide
    Flexible hours

    InMobi

    San Mateo, CA
    5 days ago
  • $180k - $300k

    AI Chopping Block, Inc. is seeking a Research Scientist in Redwood City, CA. This position entails investigating how manipulating training data affects deep learning models. Candidates should have over three years of deep learning research experience. The role offers a... 

    AI Chopping Block, Inc.

    Redwood City, CA
    3 days ago
  • An established industry player is looking for a passionate research scientist to join their dynamic team in the Immunology and Cardiovascular Research Center. This role emphasizes developing critical assays for evaluating new drugs, focusing on patient stratification and... 

    TechDigital Group

    Brisbane, CA
    3 days ago
  • A leading research organization is looking for a Scientific Researcher to handle human biospecimens and perform lab-based activities. This role entails executing project tasks, maintaining quality, and ensuring operational excellence. Candidates should have a Bachelor'... 
    Full time

    R&D Partners

    South San Francisco, CA
    14 hours ago
  • Senior Research Scientist, High Throughput Biology Redwood City, CA About Integrated Biosciences At Integrated, we are focused on advancing small molecule therapeutics for age-related diseases by leveraging our first-of-its-kind discovery platform at the intersection of... 
    Visa sponsorship

    Integrated Biosciences, Inc.

    Redwood City, CA
    4 days ago
  • $180k - $260k

     ...actively harmful to model quality at worst. There is compelling research showing that smarterdataselection can train better models...  ...office 4 days a week. About the Role We’re looking for a Research Scientist to lead work on post-training data curation for foundation... 
    Work at office
    Relocation package

    DatologyAI

    Redwood City, CA
    2 days ago
  • Job Description Job Description Required Qualifications- Bachelors degree in mechanical engineering, Manufacturing Engineering, ora related field- 7-8 years of experience in dimensional engineering, tolerance analysis, orgeometric integration within automotive or other...
    Flexible hours

    Vertex Sigma Software

    Foster, CA
    24 days ago
  • Job Description Job Description JOB DESCRIPTIONDimensional Engineering Specialist Chassis CommoditiesWe are seeking a dynamic, analytical, and innovative professional who thrives on challenges and is passionate about delivering creative, practical solutions. The ...

    Vertex Sigma Software

    Foster, CA
    25 days ago
  • $56 - $62 per hour

     ...We are hiring a Senior Associate Scientist to review analytical data and electronic notebook entries for accuracy and adherence to SOPs...  ...staffing resource specializing in scientific, clinical research & engineering. We provide job opportunities within major pharmaceutical... 
    Contract work
    Part time
    Remote work
    Flexible hours

    R&D Partners

    San Mateo, CA
    3 days ago
  • $50 per hour

     ...language models (like ChatGPT) using your biology knowledge. You’ll design problems, check how well AI solves them, and work with researchers to build better benchmarks. Responsibilities: Design advanced biology questions to test AI performance. Develop clear,... 
    Contract work
    Remote work
    Flexible hours

    Turing

    San Mateo, CA
    14 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist: Pretraining. Be the first to apply!