Research Scientist, Vision-Language-Action Models
Neara
Return to jobs list Overview Research Scientist, Vision-Language-Action Models Job type: Full Time · Department: Manufacturing Engineering · Work type: On-Site Menlo Park, California, United States About Matter Matter is building the AI-native autonomy stack for physical manufacturing in the United States. We operate as a contract manufacturer, deploying software and autonomy in our own factories, which gives us something most AI companies don’t have: a live production environment as a training ground. Our long-term vision is to become the infrastructure layer for American manufacturing, the way AWS became infrastructure for software. The Role We are hiring a Research Scientist to lead the development and deployment of Vision-Language-Action (VLA) models for robotic manipulation in live manufacturing work cells. This is not a lab role. You will train models, close the Sim2Real loop, and deploy them on physical robots running production programs. Matter’s Sim2Real pipeline spans NVIDIA Isaac Sim, physics-accurate virtual builds of our modular assembly equipment, and 100% data collection from real factory operations. You will operate at the center of this flywheel design, improving models with every production run. What You'll Do Develop and fine-tune VLA models for precision assembly tasks, including dexterous manipulation, part handling, and test operations Design and manage the Sim2Real training pipeline: domain randomization, synthetic data generation, physics simulation (NVIDIA Isaac Sim, MuJoCo), and sim-to-physical transfer Build evaluation frameworks to benchmark real-world manipulation performance against manufacturing tolerances and repeatability requirements Collaborate with controls and automation engineers to fuse learned policies with traditional control architectures for production safety Contribute to the Physical AI architecture decisions: model selection, data strategy, training infrastructure, and deployment protocols Publish novel research in top tier 1 conferences — though shipping production systems is the primary measure of success What We’re Looking For PhD, Graduate or equivalent research depth in robotics, machine learning, or a related field Hands-on experience training and deploying VLA, VLM, or generalist robot policies on physical hardware (not just simulation) Strong foundation in imitation learning, reinforcement learning, and general machine learning methods Proficiency in PyTorch; experience with NVIDIA Isaac Sim, MuJoCo, or similar physics engines Ability to debug the full stack: model architecture, training data quality, sim calibration, sensor noise, and hardware edge cases Comfort operating in a high-velocity, ambiguous environment where you own systems end-to-end Nice to Have Experience with MARL or multi-robot coordination Background in manufacturing, industrial automation, or robotic assembly Why Matter Most VLA research is validated in a lab or on a tabletop. At Matter, your models run on a production factory floor, handling real parts for real customers. The feedback loop is immediate and grounded. The training data is yours because the factory is yours. No one else in this space has that combination at the stage we’re at. #J-18808-Ljbffr
$185k - $215k
...Description The Bosch Research and Technology Center... ...focuses on Foundation Models, Big Data Visual Analytics... ...AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud... ...As a Senior Research Scientist- Vision- Language- Action (VLA) Models, you contribute...LanguageFull timeWork experience placementLocal areaWorldwide$165k - $185k
...Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate...Language$150k
...About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using... ...world-class researchers, data scientists, and engineers, tackling the most... ...Summary As a Research Scientist in the Vision Language Model (VLM) team, your role will...Language$126k - $423k
...looking for multiple passionate Research Scientists to join the Research Group... ...on pretraining world-action foundation model with various world modalities including vision and physics associated with... ..., human data incorporation, language modality, and spatial reasoning...LanguageFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workDay shift$192k - $304.75k
...We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is... ...following topics: LLMs; Large vision‑language models; Video generative models and diffusion algorithms; or Action‑based transformers. Outstanding...Language$160.36k - $240.54k
...Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ)... ...foundation models. Leverage large language models and world foundation models... ...autonomous driving. Experiences in vision-language-action models, reinforcement learning...Language- ...as our ability to measure it. At Sanas, model quality spans dimensions that automated... ...-world disfluency. We’re looking for a Research Scientist who can define what "better" actually... ...Noise Cancellation, Speech Enhancement, Language Translation, and more — ensuring each captures...Language
- ...The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding...Language
$100k - $300k
...Overview OPPO Research Center is seeking a passionate and innovative Research Scientist to advance our next-generation AI hardware platforms... ...deployment of multimodal intelligence models that seamlessly integrate language, vision and action. As part of our collaborative...LanguageFull time$185k - $215k
...Robert Bosch Group is seeking a Senior Research Scientist specializing in Vision-Language-Action models in Sunnyvale, California. This role involves conducting cutting-edge research to advance AI technologies in autonomous driving systems. Candidates should have a Ph....Language$176k - $420k
...You will join the team building the vision and multimodal foundation models that allow Optimus to understand,... ...compression, and fusion of vision, language, audio, and tactile data ~... ...video generation, vision-language-action models ~ Human/Object Reconstruction...LanguageHourly payFull timeTemporary workFlexible hours$192k - $304.75k
...We’re now looking for a Senior Research Scientist, Multi-Modal Language Models! NVIDIA is seeking a Senior Research Scientist passionate about multi modal... ...or related areas 4+ years of experience in computer vision, especially multi‑modal LLMs Proficiency in Python with...Language- A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of expertise in machine learning, a commitment to data quality, and strong production...Language
$150k
...the Institute of Foundation Models We are a dedicated research lab for building,... ...world-class researchers, data scientists, and engineers, tackling the... ...specializing in Computer Vision your role will be crucial... ...AI‑related concepts (e.g., language modeling, computer vision)...LanguageVisa sponsorship- ...role Ambient.ai is hiring a Senior Applied Research Scientist to build the next generation of foundation models for computer vision. You will join a team responsible for building... ...from pre‑training and fine‑tuning on image‑language data to applying distillation and...LanguageFull timeLocal areaFlexible hours
- ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human... ...Design and implement end-to-end modeling pipelines for machine assembly tasks,... ...Qualifications Familiarity with large language models, vision-language models, or agentic AI frameworks...LanguageWork experience placementInternshipLocal areaShift work
- ...General Motors is seeking a Staff Research Scientist specializing in Vision-Language Models to redefine mobility. You will lead advancements in AI for autonomous driving at the Mountain View Technical Center. This remote position requires a Ph.D. and 5+ years of experience...LanguageRemote work
$165k - $195k
...AI Research Scientist- Multimodal Foundational Models Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale... ...Valley focuses on Foundation Models,Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data...LanguageFull timeWork experience placementLocal areaWorldwide$219k - $301k
...intelligent robots and advanced AI models, we’re hiring talent... ...world. We’re seeking a Research Scientist ready to use their skills... ..., Robotics, Computer Vision, Machine Learning, Language, a related field, or equivalent... ..., imitation learning, action‑conditioned world models,...LanguageHourly payWork experience placementInternshipLocal area$147k - $211k
Google Inc. in Mountain View, CA is seeking a Roboticist to develop Vision Language Action models that enhance robot capabilities. This role involves designing algorithms, implementing research ideas, and collaborating with teams in a cutting-edge AI environment. A PhD...Language$176k - $253k
At Toyota Research Institute (TRI), we’re on a mission to improve... ...are looking for a Research Scientist to join us in building intelligent... ...to explore how large language models and agentic infrastructure can... ...with large language models, vision‑language models, or agentic...LanguageWork experience placementInternshipLocal areaShift work$165k - $185k
...Senior AI Research Scientist- World Model Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale... ...Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI...LanguageFull timeWork experience placementWorldwide- ...first post-transformer model that adapts and thinks... ...Stamirowska, a complexity scientist who created a team... ...in ChatGPT) and a key researcher behind OpenAI’s reasoning... ...with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background...LanguagePermanent employmentFull timeContract workImmediate startRemote workFlexible hours
$184k - $287.5k
...application is built. We are seeking a senior vision language model engineer to design and build agentic... ...you'll be doing: Partner with our researchers to develop and evaluate prototypes of... ...(e.g., video, sensor, language/action traces) tailored for end‑to‑end physical...Language$150k
...A leading AI research lab in Sunnyvale is seeking a professional to work on advanced agentic language models. The role involves developing algorithms, contributing to research, and collaborating on state-of-the-art models. Ideal candidates hold a degree in Computer Science...LanguageVisa sponsorship$150k
...A leading AI research institution in Sunnyvale is seeking a Research Scientist to curate web-scale data crucial for developing foundation models. This role involves pioneering data collection methods... ...and have experience with large language models. Competitive salary...Language$200k - $287.5k
...At Toyota Research Institute (TRI), we're on a mission... ...Policy and Large Behavior Models (LBM). The... ...-of-the-art, pixels-to-action, end-to-end system for... ...and integrating visual-language-action modalities. Beyond... ...with a focus on computer vision as the primary sensing...LanguageLocal areaShift work- ...platform to build custom AI models in hours, not months –... ...Oumi also develops an open research stack and models in collaboration... ...Overview The Research Scientist will be an integral part... ...state-of-the-art in large language models (LLMs), vision language models (VLMs), and...LanguageWorldwideFlexible hours
$117.2k - $313.7k
...About the Role Salesforce AI Research is seeking outstanding AI Research Scientists / Research Engineers to build and deploy high‑impact AI solutions... ..., autonomous workflows Multimodal & Computer Vision – Vision‑language models, video understanding, visual grounding for...LanguageFull time$158k - $304k
...role We are looking for a passionate Research Scientist to join the Research Team at Applied... ...and tools to develop cutting‑edge models at scale. In addition to your research... ...robotic foundation model and vision‑language‑action model, reinforcement learning and real...LanguageFull timeFor contractorsFor subcontractorCasual workWork at officeImmediate startRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist, Vision-Language-Action Models. Be the first to apply!
- safety scientist Menlo Park, CA
- water quality scientist Menlo Park, CA
- scientist biology Menlo Park, CA
- scientist immunology Menlo Park, CA
- lab scientist Menlo Park, CA
- scientist Menlo Park, CA
- deep learning scientist Menlo Park, CA
- manufacturing scientist Menlo Park, CA
- research scientist - biology Menlo Park, CA
- machine learning research scientist Menlo Park, CA

