Research Scientist- Vision-Language-Action (VLA) Models

$165k - $185k

Bosch Group

Job Description

Company Description

The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania, and Cambridge, Massachusetts is a part of the global Bosch Group ( a company with over 70 billion euro revenue, 400,000 employees worldwide, a very diverse product portfolio, and a history spanning over 125 years. The Research and Technology Center North America (RTC-NA) is dedicated to providing technologies and system solutions for various Bosch business fields, primarily in the field of artificial intelligence, energy technologies, internet technologies, circuit design, semiconductors and wireless, as well as advanced MEMS design.

As a part of the global research, our AI research in Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI System Engineering, Time-series Analysis. We develop scalable, intelligent, and trustworthy AIoT solutions for Bosch products and services in application areas such as automated driving, advanced driver assistance systems (ADAS), robotics, smart manufacturing, enterprise AI, health care, smart home and building solutions.

Originating from the AI research in Silicon Valley, our Intelligent Autonomous Systems group is responsible for enabling future autonomous Bosch products by pushing the boundaries of automated driving, advanced driver assistance systems (ADAS), robotics and automation through key innovations that encompass system architecture and AI components. These include methods for motion planning, high level task planning and decision making as well as systems for making these technologies work on real products by building frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units to transfer our solutions into future products. We also actively collaborate with leading groups in academia and industry to promote research ideas and publish research findings in internationally renowned conferences and journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL.

Job Description

As a Research Scientist- Vision-Language-Action (VLA) Models, you contribute to research projects at the forefront of the ADAS/AD industry. Key responsibilities include:

Conduct research and engineering in core AI and machine learning fields to enable Embodied AI (including computer vision, autonomous planning, open-world learning, and so on) for related business domains of ADAS/AD, industrial automation, robotics etc.
Push the boundaries in (modular) end-to-end perception and planning for ADAS/AD, incorporating advancements in large vision-language-(action) models to aid reasoning capabilities and explainability.
Collaborate cross-functionally with global research and engineering teams to ensure seamless technology transfer and system integration.
Implement research results to solve real-world challenges, ensuring high-quality system integration within Bosch's existing platforms.
Stay at the forefront of innovation by actively engaging with academic and industry communities through conferences, workshops, and technical events.
Document and disseminate research findings through high-caliber publications and/or patent submissions.

Qualifications

Basic Qualifications

Ph.D. in Computer Science, Robotics or a related discipline or Master's degree with >= 2 years industry experience after graduation.
A minimum of 3 years of R&D experience, or an equivalent graduate research background, primarily in AI technologies including Computer Vision and Robotic or Automotive Motion and Behavioral Planning.
Proficiency in one or more programming languages commonly used in machine learning (e.g., Python, C++, Rust).
Strong interpersonal, communication, and teamwork capabilities.
Knowledge of major machine learning frameworks like TensorFlow or PyTorch.
Hands-on experience in reinforcement learning for behavior or motion planning or other applicable contexts and familiarity with common RL techniques (e.g. PPO, DQN, DDPG).
A strong portfolio of publications in premier machine learning, deep learning, robotics and computer vision journals and conferences.

Preferred Qualifications

Experience with real-world product development and deployment of autonomous systems.
Hands-on experience building and applying multimodal transformer-based sequence-to-sequence models, especially multimodal vision-language-action models.
Hands-on experience in computer vision and deep learning, with work in any of the following areas: multimodal transformers, multimodal language models, diffusion models, NeRF, gaussian splatting, object detection / segmentation, 3D scene understanding, sensor calibration, SfM, voxel/BEV grid-based feature representation.

Additional Information

We offer a competitive base salary for this position with a range in US-California of --$165,000 - $185,000 along with an annual corporate bonus, and a long-term incentive bonus designed to reward sustained impact and contribution over time. Within the salary range, the individual pay is determined based on several factors, including, but not limited to, work experience and job knowledge, complexity of the role, job location, etc.

Your well-being matters at Bosch! We offer a a benefits package designed to empower you in every area of your life. This includes premium health coverage, a 401(k) with generous matching, resources for financial planning and goal setting, ample paid time off, parental leave, and comprehensive life and disability protection. Your Recruiter can share more details for this position during the interview process.

Learn more about our full benefits offerings by visiting:

Equal Opportunity Employer, including disability / veterans.

*Bosch adheres to Federal, State, and Local laws regarding drug-testing. Employment is contingent upon the successful completion of a drug screen and background check. Candidates who have been offered the position must pass both screenings before their start date.

#LI-JM1

Apply

Vacancy posted 12 days ago

Similar jobs that could be interesting for youBased on the Research Scientist- Vision-Language-Action (VLA) Models in Sunnyvale, CA vacancy

Senior Research Scientist- Vision-Language-Action (VLA) Models
$185k - $215k
...Description The Bosch Research and Technology Center... ...focuses on Foundation Models, Big Data Visual Analytics... ...AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud... ...As a Senior Research Scientist - Vision‑Language‑Action (VLA) Models, you contribute...
Language
Work experience placement
Local area
Worldwide
Bosch USA
Sunnyvale, CA
12 hours ago
VLA Vision Scientist for Autonomous Systems
The Bosch USA Research and Technology Center North America in Sunnyvale, CA invites a Research Scientist- Vision-Language-Action (VLA) Models to join our AI research efforts focused on ADAS/AD, robotics and intelligent systems. You will conduct cutting-edge AI research,...
Language
Bosch USA
Sunnyvale, CA
5 days ago
Research Scientist - World-Action Foundation Model, Robotics
$126k - $423k
...looking for multiple passionate Research Scientists to join the Research Group... ...on pretraining world-action foundation model with various world modalities including vision and physics associated with... ..., human data incorporation, language modality, and spatial reasoning...
Language
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
12 hours ago
Research Scientist - Vision Language Model
$150k
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using... ...world-class researchers, data scientists, and engineers, tackling the most... ...Summary As a Research Scientist in the Vision Language Model (VLM) team, your role will...
Language
Institute of Foundation Models
Sunnyvale, CA
2 days ago
Vision-Language-Action AI Scientist
$165k - $185k
Bosch Group in Sunnyvale, California is seeking a Research Scientist specialized in Vision-Language-Action (VLA) Models. The role involves conducting cutting-edge research in AI and machine learning, focusing on Embodied AI. Candidates should hold a Ph.D. or a Master’s...
Language
Bosch Group
Sunnyvale, CA
12 hours ago
Research Intern - World-Action Model / VLA for Autonomous Driving
$39 - $66 per hour
...Description The Bosch Research and Technology Center... ...Valley focuses on Foundation Models, Big Data Visual... ...Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud... ...Intern for World-Action Models / VLA for Autonomous Driving,...
Language
Work experience placement
Internship
Local area
Worldwide
Bosch Group
Sunnyvale, CA
12 days ago
Senior Vision-Language-Action AI Scientist (Autonomy)
Bosch USA in Sunnyvale, California, is seeking a Senior Research Scientist - Vision-Language-Action (VLA) Models to push the envelope in Embodied AI for ADAS/AD, robotics, and automation. You will conduct research spanning computer vision, autonomous planning, and open...
Language
Bosch USA
Sunnyvale, CA
12 hours ago
Senior Research Scientist, Multimodal Foundation Models and Robotics
$192k - $304.75k
We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is... ...following topics: LLMs; Large vision‑language models; Video generative models and diffusion algorithms; or Action‑based transformers. Outstanding...
Language
University of Georgia- FACS
Santa Clara, CA
1 day ago
Senior / Staff AI Research Scientist, Foundation Models
...‑office collaboration. We are looking for a Senior ML Research Engineer, Embodied Intelligence to advance robotic embodied... ...with humans. Responsibilities Design and deploy vision-language(-action) models (VLM/VLA) for contextual understanding and generalized robot action...
Language
Work at office
Visa sponsorship
RoboForce
Milpitas, CA
12 hours ago
Research Scientist, Vision-Language Multimodal AI
The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding...
Language
Institute of Foundation Models
Sunnyvale, CA
2 days ago
Senior Research Scientist, Multi-Modal Language Models
$192k - $304.75k
Senior Research Scientist, Multi‑Modal Language Models NVIDIA is seeking a Senior Research Scientist passionate about multi‑modal language models. Our team... ...related areas. 4+ years of experience in computer vision, especially multi‑modal LLMs. Proficiency in Python...
Language
NVIDIA
Santa Clara, CA
12 hours ago
Research Scientist [ Computer Vision ]
$175k - $250k
...are building foundation models trained on rich,... ...at the frontier of AI research, and we believe the highest... ...'re seeking Research Scientists to join our growing AI... ...— including computer vision, neuroscience, NLP, and... ...video generation. Vision‑Language‑Action models. 3D...
Language
Visa sponsorship
Flexible hours
Metamorphic
Palo Alto, CA
12 hours ago
Machine Learning Research Scientist: Generative Modeling for Planning
$160.36k - $240.54k
...flexible, partner-led business model, Nuro is working toward a... ...models. Leverage large language models and world foundation... ..., in industry, or both. Research experiences in generative models... ...driving. Experiences in vision-language-action models, reinforcement learning...
Language
Immediate start
Flexible hours
Nuro
Mountain View, CA
29 days ago
Senior AI Research Scientist: Foundation Models in Robotics
RoboForce in Milpitas, CA is seeking a Senior / Staff AI Research Scientist, Foundation Models to advance robotic embodied intelligence. You will... ...across physical tasks, with emphasis on vision-language-action models, multi-modal data, and deployment on RoboForce...
Language
RoboForce
Milpitas, CA
4 days ago
VLA Vision Architect & Research Leader
Dormont Manufacturing Co in Mountain View, California, is seeking a PhD graduate to serve as a technical pathfinder for Vision-Language-Action models. You will guide a talented engineering team while bridging academic theory with practical applications in a hybrid work...
Language
Dormont Manufacturing Co
Mountain View, CA
1 day ago
Senior Vision-Language Model Architect
Aurora is seeking a skilled Aurora Driver to lead the technical vision for Vision-Language-Action models in Mountain View, California. You will identify weaknesses in existing architectures and mentor a high-performing engineering team. This role requires a PhD in AI or...
Language
Aurora CO
Mountain View, CA
12 hours ago
AI Research Scientist- World Model (Hiring Immediately)
$165k - $185k
...Company Description The Bosch Research and Technology Center North America with offices... ...Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data...
Language
Part time
Work experience placement
Immediate start
Worldwide
Bosch Group
Sunnyvale, CA
4 days ago
AI Research Scientist- Multimodal Foundational Models
$165k - $195k
AI Research Scientist- Multimodal Foundational Models Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale... ...Valley focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data...
Language
Full time
Work experience placement
Local area
Worldwide
Robert Bosch Group
Sunnyvale, CA
2 days ago
Senior Staff Research Scientist, Robot Intelligence
$197.8k - $296.6k
...Robot Intelligence Lab at Samsung Research America is a new facility dedicated... ...looking for a Senior Staff Research Scientist with solid technical skills and rich... ...topics such as robotics foundation models, vision‑language‑action (VLA) models, vision language models (...
Language
Samsung-Research-America
Mountain View, CA
4 days ago
Research Scientist / Engineer, Foundation Model Evaluation
$184.7k - $324.8k
Research Scientist / Engineer, Foundation Model Evaluation Cupertino, California, United States Software and Services... ...‑ended creative tasks to precise, action‑taking workflows. If you’re drawn... ...or a related area (e.g., natural language generation, information retrieval,...
Language
Relocation
Apple Inc.
Cupertino, CA
2 days ago
Senior Machine Learning Researcher, Large Behavior Models & Diffusion Policy
$200k - $287.5k
...At Toyota Research Institute (TRI), we’re on a mission... ...Policy and Large Behavior Models (LBM). The Opportunity... ...‑of‑the‑art, pixels‑to‑action, end‑to‑end system for... ...and integrating visual‑language‑action modalities. Beyond... ...a focus on computer vision as the primary sensing...
Language
Local area
Shift work
TRI
Los Altos, CA
4 days ago
AI Experience Researcher, Product Evaluation, Vision Products Group
$130.2k - $237.6k
AI Experience Researcher, Product Evaluation, Vision Products Group Boulder,... ...collaborating with ML and data scientists, software engineers... ...patterns in model behaviors and... ...translating results into actionable recommendations for... ..., preferably Large Language Models Familiarity...
Language
Relocation
Apple Inc.
Sunnyvale, CA
2 days ago
Research Scientist, Wayve Labs
...software and foundation models enable vehicles to... ...systems. Our vision is to create... ...looking for Applied Scientists to join Wayve Labs... ...a high‑conviction research team with the strategic... ...and costs of actions. Representation Learning... ..., using vision, language, and active...
Language
Full time
Work at office
Work from home
Visa sponsorship
Relocation package
Flexible hours
Icehouseventures
Sunnyvale, CA
2 days ago
Research Scientist, Gemini Tool Use, DeepMind
$262k - $365k
Research Scientist, Gemini Tool Use, DeepMind corporate_fare DeepMind place... .... Experience with Large Language Models, Natural Language Processing... ...research conferences and visioning activities. Deliver full projects... ...and affirmative action employer. We are committed...
Language
Google Inc.
Mountain View, CA
1 day ago
Senior ML Engineer, Autonomous Driving Foundation Models
$244.14k - $413.16k
...lead the development of large-scale Vision-Language-Action models for autonomous vehicles. The role involves... ..., driving generative model research, and refining driving policies through... ...learning experience, especially with VLA and embedded AI technologies, alongside...
Language
Full time
XPENG
Santa Clara, CA
12 hours ago
Applied Research Scientist, Proactive Intelligence, — Agentic Systems and Generative Modeling
$184.7k - $324.8k
Applied Research Scientist, Proactive Intelligence, — Agentic Systems and Generative Modeling Cupertino, California, United States Machine Learning and AI AI represents a... ...from Human Preferences and Training Large Language Models (LLMs). Working knowledge of large-scale...
Language
Relocation
Apple Inc.
Cupertino, CA
2 days ago
Senior AI/ML Research Scientist - Vision & Multimodal
...that enhance content discovery. The role requires a strong foundation in AI/ML, with experience in training and deploying vision-language models. You will work closely with cross-functional teams and need to demonstrate excellent communication skills. The compensation...
Language
Flexible hours
Netflix
Los Gatos, CA
12 hours ago
Research Scientist 5 Content Representation Models CRM
...Team The Content Representation Models team creates a single, unified “language” for Netflix’s entire library by... ...About the Role We are looking for a Research Scientist specializing in embeddings and... ...in LLMs Experience in computer vision or multimodal AI Industry...
Language
Hourly pay
Full time
Immediate start
Flexible hours
Netflix
Los Gatos, CA
12 hours ago
Senior Research Scientist- Robotics AI
$185k - $215k
Company Description Bosch Research and Technology... ...focuses on foundation models, big data visual... ...explainable AI (XAI), natural language processing, computer vision and mixed reality,... ...a Senior Research Scientist - Robotics AI, you... ...vision‑language (action) models to aid reasoning...
Language
Work experience placement
Local area
Worldwide
Bosch USA
Sunnyvale, CA
12 hours ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
$176k - $253k
...Job Description At Toyota Research Institute (TRI), we’re on a... ...are looking for a Research Scientist to join us in building intelligent... ...to explore how large language models and agentic infrastructure can... ...with large language models, vision-language models, or agentic...
Language
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
15 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Vision-Language-Action (VLA) Models. Be the first to apply!