Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Scientist, Synthetic Data Generation

$168k - $264.5k
Full-time

NVIDIA

NVIDIA is at the forefront of the AI revolution, and our research is shaping the future of large language models. We are looking for a Senior Scientist to join our team and help advance our capabilities in synthetic data generation for training frontier models. You will contribute to open-source libraries within the NVIDIA NeMo ecosystem that generate synthetic datasets across text, code, structured, and multimodal data, directly feeding the pre- and post-training of LLMs such as Nemotron. This role combines hands-on software engineering with applied research in generative methods, and you will collaborate with research, engineering, product, and model teams as well as external labs. What you'll be doing: Build synthetic data generation pipelines using LLM-based methods and automated quality evaluation, producing datasets that improve the pre- and post-training of LLMs such as Nemotron — reasoning, coding, structured output, and multimodal understanding. Advance multimodal synthetic data generation — image, document, video, and audio — in partnership with NVIDIA's model teams. Design and maintain open-source libraries and SDKs with clean APIs and strong documentation. Drive software excellence with modern tooling, architecture based on configuration, and professional Git/CI-CD. Publish original research at top machine learning and AI conferences to maintain NVIDIA's technical leadership. Mentor interns and junior researchers to develop technical growth within the team. What we need to see: PhD in Computer Science, Machine Learning, Statistics, or a related field, or equivalent experience. A research background of 3+ years in synthetic data generation, generative modeling, multimodal machine learning, or related areas. Comparable experience is also considered. Deep technical understanding of LLMs, how data shapes their pre- and post-training, and inference frameworks such as vLLM or TGI. Proven track record of developing or maintaining software libraries used by a broad developer community. Strong publication record at premier venues such as NeurIPS, ICML, ICLR, ACL or similar. Ways to stand out from the crowd: Open-source contributions in ML or data tooling. Experience with multimodal generation or understanding (vision-language, document AI, video, or audio). Building and optimizing scalable data pipelines for large-scale model training (throughput, distributed inference). Experience generating data for agentic, tool-use, or reinforcement-learning post-training. NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and talented people in the world working with us. If you are creative, autonomous, and passionate about building open-source tools that make AI safer and more private, we want to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 264,500 USD for Level 3, and 192,000 USD - 304,750 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 14, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. NVIDIA pioneered accelerated computing. Today, our AI infrastructure powers global intelligence, transforming every industry. Learn more about NVIDIA.

Vacancy posted 19 hours ago
Similar jobs that could be interesting for youBased on the Senior Scientist, Synthetic Data Generation in Santa Clara, CA vacancy
  • $168k - $264.5k

     ...and our research is shaping the future of large language models. We are looking for a Senior Scientist to join our team and help advance our capabilities in generating synthetic data and privacy-preserving AI. You will contribute to open-source libraries within the NVIDIA... 
    Data
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    19 hours ago
  • $192k - $304.75k

     ...computing. We're looking for a passionate scientist at the intersection of quantum device...  .... You will develop physics-informed data generation pipelines, advanced physics models, and...  ...simulation frameworks. Build physics-informed synthetic data generation pipelines that leverage... 
    Data
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $192k - $304.75k

     ...computing. We're looking for a passionate scientist at the intersection of quantum...  .... You will develop physics-informed data synthesis pipelines, post-trainable...  ...quantum computing. The work will span synthetic training data generation, surrogate modeling, and co-... 
    Data
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $121.5k - $270k

     ...Navan is seeking a Senior Applied Economist to join the Data Science & Machine Learning team. This is a foundational...  .... What You'll Do: Next-Generation Forecasting: Uplevel our existing...  ...to apply advanced methods (e.g., Synthetic Control, IV, Diff-in-Diff,... 
    Data
    Senior

    Navan

    Palo Alto, CA
    5 days ago
  • $184k - $299k

    Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding Senior Researcher...  ...computer vision, deep learning, generative models, and so forth. Your contributions...  ...large‑scale model training including data preparation and model parallelization... 
    Data
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...testing, innovative technology, and data-driven insights. The company specializes...  .... About the Role The Senior Mass Spectrometry Scientist will be responsible for designing, executing...  ..., perform statistical analyses, and generate clinically relevant reports.... 
    Data
    Senior
    Casual work
    Afternoon shift

    Vibrant Wellness

    Santa Clara, CA
    4 days ago
  • We are looking for a Senior Research Scientist passionate about Large Language Model (LLM) and Diffusion...  ...of NVIDIA’s foundation models and generative AI group, focusing on post‑training algorithms...  ...science fundamentals: algorithms, data structures, parallel/distributed... 
    Data
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $202.35k - $303.05k

     ...will be focusing on researching and developing state of the art generative models, with an emphasis on diffusion models, and bring...  ...Apply the model to various tasks such as planning, prediction, data generation, simulation, and so on. Research SoTA algorithms to... 
    Data
    Senior

    Icehouseventures

    Mountain View, CA
    3 days ago
  • $142.7k - $257.6k

     ...Opportunity Adobe is seeking to add Applied Scientists in Generative AI to our world-class AI Platform team...  ...with expertise in preparing data, training, fine-tuning and adapting large...  ...model training Experience on synthetic data generation Experience of working... 
    Data
    Temporary work
    Local area

    Adobe

    San Jose, CA
    3 days ago
  • $192k - $304.75k

    Responsibilities Conduct original research in the space of generative AI Implement and train large-scale generative AI models for various content creation applications Collaborate with other research team members, a diverse set of internal product teams, and external... 
    Senior

    University of Georgia- FACS

    Santa Clara, CA
    1 day ago
  • $165k - $195k

    Senior AI Research Scientist- Time-Series Foundational Models Full-time The Bosch Research and Technology...  ...Mixed Reality, Cloud Robotics, Big Data Visual Analytics, Explainable AI (...  ...following areas: data‑centric AI, synthetic data generation, agentic AI. Proficiency with... 
    Data
    Senior
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    4 days ago
  • $206.3k - $388k

     ...re looking for a Principal Scientist (P60) to shape the data strategy behind Adobe...  ...used to train high-quality generative models. Your work will...  ...millions of users. This is a senior individual contributor role...  ...efficiency Combine organic, synthetic, and model-generated data... 
    Data
    Temporary work
    Local area
    Worldwide

    Adobe

    San Jose, CA
    5 days ago
  • $168k - $264.5k

    NVIDIA is searching for a world‑class generative AI researcher to join the fundamental generative AI research team at NVIDIA Research. We...  ...molecules, molecular dynamics, proteins, RNA, or other scientific data. Excellent programming skills in some prototyping environments... 
    Data

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $126k - $423k

     ...looking for multiple passionate Research Scientists to join the Research Group at Applied...  ...create cutting‑edge technology enabling next‑generation physical AI, with emphasis on the two...  ...researchers can access millions of miles of data from large fleets, and deploy methods... 
    Data
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Decisive Point

    Sunnyvale, CA
    4 days ago
  • $127k - $236.5k

     ...has access to healthcare today and for generations to come. Join Roche, where every voice matters...  ..., we are broadening access to genomic data and lowering barriers to adoption. Our...  ...Chemistry & Fluidic Systems Integration Scientist/Engineer to serve as a Subject Matter... 
    Data
    Senior
    Local area
    Relocation package

    F. Hoffmann-La Roche Gruppe

    Santa Clara, CA
    5 days ago
  •  ...seeking a highly motivated and independent Senior/Principal Scientist to join the In Vivo Pharmacology team...  ...pharmacodynamic/efficacy models, and generating preclinical proof-of-concept and...  ...complex scientific concepts and data to multidisciplinary audiences. Strong... 
    Data
    Senior
    Worldwide

    Kodiak Sciences Inc

    Palo Alto, CA
    3 days ago
  •  ...speciation Support with running high-throughput screening and lab column experiments; operate across lab and pilot setups Geochemical data analysis and modelling (translate results into process parameters) Liaise with cross-disciplinary internal team (biology,... 
    Data
    Senior
    Full time

    CGP Search

    Sunnyvale, CA
    2 days ago
  • $185k - $215k

     ...Principal Scientist/Associate Director, Oligonucleotide Translational Science San Francisco...  ...solely inheriting and interpreting data generated by others. You think in terms of...  ...supporting safety packages Serve as a senior scientific contributor to siRNA and/or... 
    Data
    Senior
    Full time
    Flexible hours

    GondolaBio, LLC

    Palo Alto, CA
    3 days ago
  • $235k - $291k

     ...Join to apply for the Machine Learning Research Scientist role at Arc Institute The Arc Institute is a new scientific...  ...for both training of models as well as the large-scale generation of new experimental data to train those models, as part of Arc’s Virtual Cell... 
    Data
    Senior

    Arc Institute

    Palo Alto, CA
    4 days ago
  • $192k - $304.75k

     ...We are now looking for a Research Scientist with a focus in System Software and I/O! NVIDIA is seeking Research Scientists with a focus...  ...emerging workloads such as recommender systems, graph analytics, and data frames. Your base salary will be determined based on your... 
    Data
    Senior
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • A leading retail company seeks a Senior Data Scientist (Machine Learning Engineer) to join their Trust and Safety team. In this role, you will design and deploy production-grade ML systems and manage the full model lifecycle to enhance compliance detection. The ideal candidate... 
    Data
    Senior

    Walmart

    Sunnyvale, CA
    2 days ago
  • $85k - $160k

    A leading technology solutions provider is seeking a Senior Simulation Engineer in Mountain View, CA, to drive the development of synthetic data generation capabilities. This crucial on-site role involves architecting and maintaining simulation-to-data pipelines, leveraging... 
    Data
    Senior

    EPAM Systems

    Mountain View, CA
    5 days ago
  • $187k - $261.5k

     ...Senior Machine Learning Scientist – Personalization We are looking for a Senior Machine Learning Scientist to help shape the next generation of deep learning systems for personalization, including recommendation...  ...across problem formulation, data exploration, feature... 
    Data
    Senior

    Traveltechessentialist

    San Jose, CA
    3 days ago
  • $183.83k - $275.98k

     ...Senior ML Research Scientist, End-to-End Autonomous Driving Get AI-powered advice on this job and more...  ...with other teams to determine data and infrastructure support needs, and...  ...sensor data (camera, LiDAR, radar) to generating behaviors. Experiment quickly and... 
    Data
    Senior
    Full time

    Nuro

    Mountain View, CA
    3 days ago
  • $123.65k - $203.15k

     ...Job Title Senior Applied Scientist, Machine Learning Role Overview We are seeking a Senior...  ...using user behaviour and subscription data to enhance personalization and product...  ...validate and enhance model performance. Generative AI Enablement: Leverage GenAI tools—... 
    Data
    Senior
    Temporary work
    Work from home
    Home office
    Flexible hours

    McAfee

    San Jose, CA
    3 days ago
  • $184k - $223k

     ...Senior Instrument Calibration Scientist Mountain View or San Jose CA About the Role Muon seeks a Senior...  ...calibration artifacts to the data team. Previous experience developing...  ..., post-processing, and test report generation. ~ Excellent communication, presentation... 
    Data
    Senior
    Permanent employment
    Full time
    Temporary work
    Remote work
    Flexible hours

    Muon Space

    San Jose, CA
    3 days ago
  • $187k - $261.5k

     ...building a more open world. Join us. Senior Machine Learning Scientist Introduction to the Team: Expedia...  ...singular technology platform powered by data and machine learning provides secure...  ...prompting, and structured output generation Experience defining and running evaluation... 
    Data
    Senior
    Local area
    Worldwide
    Flexible hours

    Expedia , Inc.

    San Jose, CA
    3 days ago
  • $187k

     ...building a more open world. Join us. Senior Machine Learning Scientist – Personalization Introduction to...  ...Learning Scientist to help shape the next generation of deep learning systems for...  ...scientific work across problem formulation, data exploration, feature engineering,... 
    Data
    Senior
    Local area
    Flexible hours

    Expedia , Inc.

    San Jose, CA
    3 days ago
  •  ...Roku, Inc. in San Jose is seeking a technical leader to apply sophisticated methodologies in Advertising. You'll build generative models for optimizing image and video ads and contribute to innovative research aligned with rigorous statistical techniques. The ideal candidate... 
    Senior

    Roku

    San Jose, CA
    4 days ago
  • $269.4k - $412.6k

     ...intuitive design, intelligent software, and next-generation safety and entertainment features. Every...  ...Embodied AI organization, you will be a senior individual contributor driving cutting-...  ...iteration by distributed teams. Strong data processing skills using tools like Numpy,... 
    Data
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Scientist, Synthetic Data Generation. Be the first to apply!