Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Research Manager, World Model Evaluation

$272k - $431.25k

NVIDIA

Senior Research Manager – World‑Model Evaluation Lead world‑model evaluation and benchmarking across NVIDIA’s Physical AI model portfolio. Responsibilities Lead a team of Research Scientists focused on world‑model evaluation, benchmarking, and diagnostics for NVIDIA Physical AI models, including world foundation models, world‑action models, synthetic data generation systems, robotics, simulation, and embodied AI workflows. Define the scientific roadmap for closed‑system and open‑system evaluation, including open‑loop and closed‑loop benchmarks, metrics, failure taxonomy, model comparison, and evaluation‑to‑training feedback loops. Develop benchmarks for physical plausibility, temporal consistency, scene dynamics, object permanence, spatial reasoning, action conditioning, affordances, controllability, long‑horizon coherence, SDG quality, and WAM usefulness. Develop open‑system and mechanistic evaluation methods using model internals, including representation probing, causal interventions, activation analysis, ablations, sparse autoencoders, attention and feature analysis, and circuit‑style diagnostics. Drive evaluation‑to‑model‑improvement loops with training, post‑training, data curation, simulation, robotics, SDG, WAM, and applied research teams, including failure discovery, data generation, post‑training priorities, model roadmap feedback, and re‑evaluation. Publish high‑quality papers, technical reports, benchmarks, and open‑source evaluation artifacts while establishing rigorous standards for validity, reproducibility, dataset hygiene, leakage prevention, and model comparison. Qualifications Strong research background in machine learning, computer vision, multimodal AI, robotics, world models, representation learning, model evaluation, or mechanistic interpretability. Experience leading research teams, research programs, or cross‑functional technical initiatives with measurable scientific and product impact. Deep understanding of modern foundation models, including video models, vision‑language‑action models, diffusion or flow models, self‑supervised learning, or world‑model architectures. Experience designing serious benchmarks, evaluation datasets, metrics, diagnostic tools, or model analysis frameworks for complex ML systems. Familiarity with world‑model evaluation and open‑system analysis techniques, such as physical plausibility, temporal consistency, action conditioning, counterfactual reasoning, representation probing, activation patching, causal interventions, sparse autoencoders, or feature attribution. PhD, or equivalent experience in Computer Science, Electrical Engineering, Robotics, Machine Learning, AI, or a related field, with 12+ overall years of relevant research or engineering experience as well as 5+ years of management experience. Ability to work onsite at NVIDIA’s Santa Clara headquarters; this is not a remote position. Ways to Stand Out Built influential benchmarks, evaluation suites, model diagnostics, or interpretability tools used by research or production teams. Published in areas such as world models, video generation, physical AI, embodied AI, robotics, representation learning, mechanistic interpretability, self‑supervised learning, or model evaluation. Experience evaluating generative video models, action‑conditioned world models, robotics foundation models, world‑action models, synthetic data generation systems, simulation systems, or vision‑language‑action models. Strong point of view on what current benchmarks miss, and excitement to build the next generation of evaluation science for Physical AI. Benefits Base salary range: $272,000 – $431,250 USD; additional equity and benefits. Eligible for equity and benefits. Equal Employment Opportunity NVIDIA is committed to fostering an inclusive work environment and is proud to be an equal opportunity employer. NVIDIA does not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Research Manager, World Model Evaluation in Santa Clara, CA vacancy
  • $272k - $431.25k

    NVIDIA AI is seeking a Senior Research Manager based in Santa Clara, California, to lead world-model evaluation in Physical AI. The role involves spearheading a team of research scientists to define scientific roadmaps and benchmarks, focusing on precise diagnostics and... 
    Senior

    NVIDIA AI

    Santa Clara, CA
    3 days ago
  • $272k - $431.25k

     ...Requisition ID: JR2019461 Job Category: Research. Time Type: Full time. At...  ..., we’re generating it! Our world model team is pushing the boundaries of...  ...Physical AI. We are looking for a Senior Research Manager to lead world-model evaluation and benchmarking across NVIDIA’s... 
    Senior
    Full time

    NVIDIA AI

    Santa Clara, CA
    3 days ago
  • $224k - $356.5k

     ...driving cars that can understand the world. Doing what’s never been done...  ...-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you will play a meaningful...  ...communicate effectively across research, engineering, and product teams.... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • NVIDIA is seeking a Senior Research Manager to lead world-model evaluation and benchmarking in Santa Clara, California. The ideal candidate will have a strong research background in machine learning, deep understanding of modern foundation models, and extensive leadership... 
    Suggested

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $165k - $185k

     ...Senior AI Research Scientist- World Model Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh...  ...events, and seminars. Offer expert insights to the management team in relevant technology sectors, aiding in... 
    Senior
    Full time
    Work experience placement
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    4 days ago
  • $165k - $185k

     ...Robert Bosch Group is seeking a Senior AI Research Scientist to join our team in Sunnyvale, California. The role involves conducting research on GenAI and Foundation Models, working alongside an international team to develop scalable AI solutions in automotive and other... 
    Senior

    Robert Bosch Group

    Sunnyvale, CA
    3 days ago
  • Apple Inc. is seeking a Senior Machine Learning Engineer in Cupertino, California, to evaluate and refine Apple's AI systems. You will design and develop key infrastructures for model and agent evaluations, contribute to quality improvements, and work closely with product... 
    Senior

    Apple Inc.

    Cupertino, CA
    2 days ago
  • $128.7k - $261.3k

     ...repeatable, high-velocity model deployments through...  ...perception, trajectory,etc.) to evaluate compressed models,...  ...similar) ~ Published research, open-source...  ...frequency dictated by your manager}. The selected candidate...  ...GM Our vision is a world with Zero Crashes, Zero... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  • $244.14k - $413.16k

     ...Senior Staff Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading smart technology...  ...Learning Engineer / Research Scientist to drive...  ...will work closely with world-class researchers,...  ...Conduct systematic ablation, evaluation, and visualization of... 
    Senior
    Full time

    XPENG

    Santa Clara, CA
    1 day ago
  • $172.43k - $230.95k

     ...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe is on a mission to accelerate...  ...to tokens — to power the world's most ambitious AI...  ...building a comprehensive managed platform for the entire application...  ...: versioning, lineage, evaluation, and reproducible fine-... 
    Senior
    Temporary work

    Crusoe

    Sunnyvale, CA
    2 days ago
  •  ...the Institute of Foundation Models  We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our...  ...model training, alongside world-class researchers, data scientists...  ..., experimentation, and evaluation workflows.  ~ This role... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    13 days ago
  • $220k - $320k

     ...the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our...  ...model training, alongside world-class researchers, data scientists...  ...pipelines, scaling our evaluation frameworks, and ensuring... 
    Visa sponsorship
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    11 days ago
  •  ...Senior AI/ML Research Engineer – Model Development It started with a simple idea: what if surgery could be...  ...faster, improving outcomes around the world. The problems we solve demand creativity...  ...Develop, fine-tune, and evaluate the AI/ML models—including foundation... 
    Senior
    Local area
    Worldwide
    Flexible hours

    Intuitive

    Sunnyvale, CA
    22 hours ago
  • $300k

     ...Institute of Foundation Models We are a dedicated research lab for building,...  ..., using, and risk-managing foundation models. Our...  ...our general-purpose world modeling efforts....  ...performance, safety and evaluation benchmarks, driving...  ...and key insights to senior leadership and... 
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  •  ...Research Intern Applied Intuition, Inc. is powering the future of physical AI....  ...flexibility and trust our employees to manage their schedules responsibly. This may...  ...Conduct research on pretraining world-action foundation model with various world modalities including... 
    For contractors
    For subcontractor
    Casual work
    Internship
    Work at office
    Immediate start
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    3 days ago
  • $198.3k - $342.8k

    Senior Research Manager, Multimodal Reasoning - SIML Cupertino, California, United...  .../ alignment, automatic evaluation, distributed training & inference...  ...to partner with world class system engineers to prototype...  ...interface with large scale modeling & data infrastructure is a... 
    Senior
    Relocation

    Apple Inc.

    Cupertino, CA
    1 day ago
  •  ...ability to measure it. At Sanas, model quality spans dimensions that automated...  ...translation fluency under real‑world disfluency. We are looking for a Research Scientist who can define what "...  ...Sanas's model families, build the evaluation infrastructure to measure it rigorously... 

    Sanas

    Palo Alto, CA
    14 hours ago
  • $136k - $218.5k

    ## Senior Program Manager, NVIDIA ResearchApplylocations: US, CA, Santa Claratime...  ...technical areas of NVIDIA Research. This role works with...  ...and Faculty Fellow planning, evaluation, and selection* Track and report...  ...be one of the technology world’s most desirable employers,... 
    Senior
    Work at office

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

    NVIDIA is searching for a senior or principal engineer...  ...large‑scale foundation model training in the Generalist Embodied Agent Research (GEAR) group. Our team...  ...programming, and cluster management tools like Kubernetes....  ...one of the technology world's most desirable employers... 
    Senior
    Full time

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

    Responsibilities Build performance modeling and prediction tools for AI workloads at Data-center scale Develop production tools and workflows...  ...to propose new features or improve existing features with real world use cases Qualifications BS+ in Computer Science or related (or... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $100.08k - $118.81k

    Overview Research and Program Manager, Disease Ecology in a Changing World (DECO) is a fixed‑term, two‑year position at Stanford Doerr School of Sustainability. The...  ...outcomes. Develop new protocols; test and evaluate a variety of approaches. Conceptualize and draft... 
    Full time
    Fixed term contract
    Remote work

    Stanford University

    Palo Alto, CA
    2 days ago
  • $190k - $250k

     ...that doesn't just perceive the world, it learns how the physics of...  ...large-scale generative world models that learn to predict...  ...trucks. We are looking for a research scientist to lead the design...  ...and radar outputs Design evaluation frameworks that measure world... 
    Temporary work
    Work at office
    Visa sponsorship
    Flexible hours

    Kodiak

    Mountain View, CA
    26 days ago
  • $281k - $356k

     ...Senior Staff Software Engineer, Model Post Training Waymo is an autonomous driving technology...  ...the mission to be the world's most trusted driver....  ...a world-class team of researchers and engineers to develop...  ...bar for how Waymo trains, evaluates, and deploys LLM models in... 
    Senior
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  • Apple Inc. is seeking an expert to evaluate machine learning and deep learning models, playing a crucial role in creating robust evaluation frameworks. The ideal candidate will collaborate with multidisciplinary teams, utilizing statistical methods and Python expertise... 

    Apple Inc.

    Sunnyvale, CA
    22 hours ago
  • $34 per hour

     ...Welo Global is seeking a Data Quality Associate based in Sunnyvale, CA. The role involves evaluating AI model outputs, providing structured feedback, and performing audits on data quality. Candidates should possess a university degree and have critical thinking, attention... 
    Full time

    Welo Global

    Sunnyvale, CA
    3 days ago
  •  ...Summary MathWorks has a hybrid work model that enables staff members to split their...  ...and with greater confidence. As a Senior Application Engineer, you will build...  ...public events. Assist customers in evaluating and adopting our solutions.Establish rapport... 
    Senior
    Full time
    Work experience placement
    Work at office
    Flexible hours

    The MathWorks Inc

    Santa Clara, CA
    3 days ago
  • $300k

    A leading research institute in Sunnyvale is seeking a visionary who will lead world modeling efforts and manage a multidisciplinary team. This role involves designing innovative simulators based on cutting-edge research and requires a Ph.D. or M.S. with substantial experience... 

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $200k - $350k

     ...About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research...  ...edge foundation model training, alongside world‑class researchers, data scientists, and... 
    Work experience placement
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $204k - $259k

     ...the mission to be the world's most trusted driver....  ...mission of the Waymo Applied Research team is to develop...  ...report to a Technical Lead Manager. You will: Conduct applied foundation model research and...  ...experiments by training and evaluating large deep learning models... 
    Senior
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago
  • $184k - $287.5k

     ...robots, and self‑driving cars that can perceive and understand the world. We’re looking to grow our company, and form teams with the...  ...architect, and implement new features related to CUDA’s memory model and multi‑node scalability geared towards next‑gen AI applications... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Research Manager, World Model Evaluation. Be the first to apply!