Research Scientist, Vision-Language-Action Models

Neara

Return to jobs list Overview Research Scientist, Vision-Language-Action Models Job type: Full Time · Department: Manufacturing Engineering · Work type: On-Site Menlo Park, California, United States About Matter Matter is building the AI-native autonomy stack for physical manufacturing in the United States. We operate as a contract manufacturer, deploying software and autonomy in our own factories, which gives us something most AI companies don’t have: a live production environment as a training ground. Our long-term vision is to become the infrastructure layer for American manufacturing, the way AWS became infrastructure for software. The Role We are hiring a Research Scientist to lead the development and deployment of Vision-Language-Action (VLA) models for robotic manipulation in live manufacturing work cells. This is not a lab role. You will train models, close the Sim2Real loop, and deploy them on physical robots running production programs. Matter’s Sim2Real pipeline spans NVIDIA Isaac Sim, physics-accurate virtual builds of our modular assembly equipment, and 100% data collection from real factory operations. You will operate at the center of this flywheel design, improving models with every production run. What You'll Do Develop and fine-tune VLA models for precision assembly tasks, including dexterous manipulation, part handling, and test operations Design and manage the Sim2Real training pipeline: domain randomization, synthetic data generation, physics simulation (NVIDIA Isaac Sim, MuJoCo), and sim-to-physical transfer Build evaluation frameworks to benchmark real-world manipulation performance against manufacturing tolerances and repeatability requirements Collaborate with controls and automation engineers to fuse learned policies with traditional control architectures for production safety Contribute to the Physical AI architecture decisions: model selection, data strategy, training infrastructure, and deployment protocols Publish novel research in top tier 1 conferences — though shipping production systems is the primary measure of success What We’re Looking For PhD, Graduate or equivalent research depth in robotics, machine learning, or a related field Hands-on experience training and deploying VLA, VLM, or generalist robot policies on physical hardware (not just simulation) Strong foundation in imitation learning, reinforcement learning, and general machine learning methods Proficiency in PyTorch; experience with NVIDIA Isaac Sim, MuJoCo, or similar physics engines Ability to debug the full stack: model architecture, training data quality, sim calibration, sensor noise, and hardware edge cases Comfort operating in a high-velocity, ambiguous environment where you own systems end-to-end Nice to Have Experience with MARL or multi-robot coordination Background in manufacturing, industrial automation, or robotic assembly Why Matter Most VLA research is validated in a lab or on a tabletop. At Matter, your models run on a production factory floor, handling real parts for real customers. The feedback loop is immediate and grounded. The training data is yours because the factory is yours. No one else in this space has that combination at the stage we’re at. #J-18808-Ljbffr

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the Research Scientist, Vision-Language-Action Models in Menlo Park, CA vacancy

Senior Research Scientist- Vision-Language-Action (VLA) Models
$185k - $215k
...Description The Bosch Research and Technology Center... ...focuses on Foundation Models, Big Data Visual Analytics... ...AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud... ...As a Senior Research Scientist- Vision- Language- Action (VLA) Models, you contribute...
Language
Full time
Work experience placement
Local area
Worldwide
Robert Bosch Group
Sunnyvale, CA
1 day ago
Research Scientist: Vision-Language-Action for Embodied AI
$165k - $185k
...Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate...
Language
Robert Bosch Group
Sunnyvale, CA
1 day ago
Research Scientist - Vision Language Model
$150k
...About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using... ...world-class researchers, data scientists, and engineers, tackling the most... ...Summary As a Research Scientist in the Vision Language Model (VLM) team, your role will...
Language
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Research Scientist - World Action Foundation Model, Robotics
$126k - $423k
...looking for multiple passionate Research Scientists to join the Research Group... ...on pretraining world-action foundation model with various world modalities including vision and physics associated with... ..., human data incorporation, language modality, and spatial reasoning...
Language
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
15 days ago
Senior Research Scientist, Multimodal Foundation Models and Robotics
$192k - $304.75k
...We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is... ...following topics: LLMs; Large vision‑language models; Video generative models and diffusion algorithms; or Action‑based transformers. Outstanding...
Language
University of Georgia- FACS
Santa Clara, CA
1 day ago
Machine Learning Research Scientist: Generative Modeling for Planning
$160.36k - $240.54k
...Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ)... ...foundation models. Leverage large language models and world foundation models... ...autonomous driving. Experiences in vision-language-action models, reinforcement learning...
Language
Nuro
Mountain View, CA
3 days ago
Research Scientist (Model Evaluation)
...as our ability to measure it. At Sanas, model quality spans dimensions that automated... ...-world disfluency. We’re looking for a Research Scientist who can define what "better" actually... ...Noise Cancellation, Speech Enhancement, Language Translation, and more — ensuring each captures...
Language
Sanas
Palo Alto, CA
1 day ago
Research Scientist, Vision-Language Multimodal AI
...The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding...
Language
Institute of Foundation Models
Sunnyvale, CA
1 day ago
NLP Research Scientist
$100k - $300k
...Overview OPPO Research Center is seeking a passionate and innovative Research Scientist to advance our next-generation AI hardware platforms... ...deployment of multimodal intelligence models that seamlessly integrate language, vision and action. As part of our collaborative...
Language
Full time
OPPO US Research Center
Palo Alto, CA
2 days ago
Senior Vision-Language-Action AI Scientist
$185k - $215k
...Robert Bosch Group is seeking a Senior Research Scientist specializing in Vision-Language-Action models in Sunnyvale, California. This role involves conducting cutting-edge research to advance AI technologies in autonomous driving systems. Candidates should have a Ph....
Language
Robert Bosch Group
Sunnyvale, CA
1 day ago
AI Engineer, Vision & Foundation Models, Optimus
$176k - $420k
...You will join the team building the vision and multimodal foundation models that allow Optimus to understand,... ...compression, and fusion of vision, language, audio, and tactile data ~... ...video generation, vision-language-action models ~ Human/Object Reconstruction...
Language
Hourly pay
Full time
Temporary work
Flexible hours
Tesla
Palo Alto, CA
4 days ago
Senior Research Scientist, Multi-Modal Language Models
$192k - $304.75k
...We’re now looking for a Senior Research Scientist, Multi-Modal Language Models! NVIDIA is seeking a Senior Research Scientist passionate about multi modal... ...or related areas 4+ years of experience in computer vision, especially multi‑modal LLMs Proficiency in Python with...
Language
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Staff Vision-Language-Action Robotics ML Engineer
A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of expertise in machine learning, a commitment to data quality, and strong production...
Language
GenesisAI
San Carlos, CA
4 days ago
Research Scientist - Computer Vision
$150k
...the Institute of Foundation Models We are a dedicated research lab for building,... ...world-class researchers, data scientists, and engineers, tackling the... ...specializing in Computer Vision your role will be crucial... ...AI‑related concepts (e.g., language modeling, computer vision)...
Language
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Senior Applied Research Scientist - Foundation Models
...role Ambient.ai is hiring a Senior Applied Research Scientist to build the next generation of foundation models for computer vision. You will join a team responsible for building... ...from pre‑training and fine‑tuning on image‑language data to applying distillation and...
Language
Full time
Local area
Flexible hours
Ambient
Redwood City, CA
1 day ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human... ...Design and implement end-to-end modeling pipelines for machine assembly tasks,... ...Qualifications Familiarity with large language models, vision-language models, or agentic AI frameworks...
Language
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
4 days ago
Staff Research Scientist: Vision-Language AI for Autonomy
...General Motors is seeking a Staff Research Scientist specializing in Vision-Language Models to redefine mobility. You will lead advancements in AI for autonomous driving at the Mountain View Technical Center. This remote position requires a Ph.D. and 5+ years of experience...
Language
Remote work
General Motors
Mountain View, CA
1 day ago
AI Research Scientist- Multimodal Foundational Models
$165k - $195k
...AI Research Scientist- Multimodal Foundational Models Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale... ...Valley focuses on Foundation Models,Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data...
Language
Full time
Work experience placement
Local area
Worldwide
Robert Bosch Group
Sunnyvale, CA
1 day ago
AI Research Scientist, Robotics
$219k - $301k
...intelligent robots and advanced AI models, we’re hiring talent... ...world. We’re seeking a Research Scientist ready to use their skills... ..., Robotics, Computer Vision, Machine Learning, Language, a related field, or equivalent... ..., imitation learning, action‑conditioned world models,...
Language
Hourly pay
Work experience placement
Internship
Local area
NLP PEOPLE
Menlo Park, CA
1 day ago
Embodied Robotics AI Scientist - Vision-Language Action
$147k - $211k
Google Inc. in Mountain View, CA is seeking a Roboticist to develop Vision Language Action models that enhance robot capabilities. This role involves designing algorithms, implementing research ideas, and collaborating with teams in a cutting-edge AI environment. A PhD...
Language
Google Inc.
Mountain View, CA
2 days ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
$176k - $253k
At Toyota Research Institute (TRI), we’re on a mission to improve... ...are looking for a Research Scientist to join us in building intelligent... ...to explore how large language models and agentic infrastructure can... ...with large language models, vision‑language models, or agentic...
Language
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
1 day ago
Senior AI Research Scientist- World Model
$165k - $185k
...Senior AI Research Scientist- World Model Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale... ...Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI...
Language
Full time
Work experience placement
Worldwide
Robert Bosch Group
Sunnyvale, CA
1 day ago
Machine Learning Researcher / Engineer (Foundational Models)
...first post-transformer model that adapts and thinks... ...Stamirowska, a complexity scientist who created a team... ...in ChatGPT) and a key researcher behind OpenAI’s reasoning... ...with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background...
Language
Permanent employment
Full time
Contract work
Immediate start
Remote work
Flexible hours
Pathway Genomics
Palo Alto, CA
12 days ago
Senior Vision Language Model Engineer
$184k - $287.5k
...application is built. We are seeking a senior vision language model engineer to design and build agentic... ...you'll be doing: Partner with our researchers to develop and evaluate prototypes of... ...(e.g., video, sensor, language/action traces) tailored for end‑to‑end physical...
Language
NVIDIA
Santa Clara, CA
2 days ago
Research Scientist - Agentic AI & Foundation Models
$150k
...A leading AI research lab in Sunnyvale is seeking a professional to work on advanced agentic language models. The role involves developing algorithms, contributing to research, and collaborating on state-of-the-art models. Ideal candidates hold a degree in Computer Science...
Language
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Research Scientist: Data for Foundation Models
$150k
...A leading AI research institution in Sunnyvale is seeking a Research Scientist to curate web-scale data crucial for developing foundation models. This role involves pioneering data collection methods... ...and have experience with large language models. Competitive salary...
Language
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Senior Machine Learning Researcher, Large Behavior Models & Diffusion Policy
$200k - $287.5k
...At Toyota Research Institute (TRI), we're on a mission... ...Policy and Large Behavior Models (LBM). The... ...-of-the-art, pixels-to-action, end-to-end system for... ...and integrating visual-language-action modalities. Beyond... ...with a focus on computer vision as the primary sensing...
Language
Local area
Shift work
Toyota Research Institute
Los Altos, CA
4 days ago
Research Scientist
...platform to build custom AI models in hours, not months –... ...Oumi also develops an open research stack and models in collaboration... ...Overview The Research Scientist will be an integral part... ...state-of-the-art in large language models (LLMs), vision language models (VLMs), and...
Language
Worldwide
Flexible hours
Oumi
Palo Alto, CA
3 days ago
Research Scientist - Salesforce AI Research
$117.2k - $313.7k
...About the Role Salesforce AI Research is seeking outstanding AI Research Scientists / Research Engineers to build and deploy high‑impact AI solutions... ..., autonomous workflows Multimodal & Computer Vision – Vision‑language models, video understanding, visual grounding for...
Language
Full time
100 Salesforce, Inc.
Palo Alto, CA
2 days ago
Research Scientist - Robotics
$158k - $304k
...role We are looking for a passionate Research Scientist to join the Research Team at Applied... ...and tools to develop cutting‑edge models at scale. In addition to your research... ...robotic foundation model and vision‑language‑action model, reinforcement learning and real...
Language
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Flexible hours
Decisive Point
Mountain View, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Vision-Language-Action Models. Be the first to apply!