Research Scientist- Vision-Language-Action (VLA) Models

$165k - $185k

Full-time

Bosch Group

Company DescriptionThe Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania, and Cambridge, Massachusetts is a part of the global Bosch Group ( a company with over 70 billion euro revenue, 400,000 employees worldwide, a very diverse product portfolio, and a history spanning over 125 years. The Research and Technology Center North America (RTC-NA) is dedicated to providing technologies and system solutions for various Bosch business fields, primarily in the field of artificial intelligence, energy technologies, internet technologies, circuit design, semiconductors and wireless, as well as advanced MEMS design.As a part of the global research, our AI research in Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI System Engineering, Time-series Analysis. We develop scalable, intelligent, and trustworthy AIoT solutions for Bosch products and services in application areas such as automated driving, advanced driver assistance systems (ADAS), robotics, smart manufacturing, enterprise AI, health care, smart home and building solutions.Originating from the AI research in Silicon Valley, our Intelligent Autonomous Systems group is responsible for enabling future autonomous Bosch products by pushing the boundaries of automated driving, advanced driver assistance systems (ADAS), robotics and automation through key innovations that encompass system architecture and AI components. These include methods for motion planning, high level task planning and decision making as well as systems for making these technologies work on real products by building frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units to transfer our solutions into future products. We also actively collaborate with leading groups in academia and industry to promote research ideas and publish research findings in internationally renowned conferences and journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL.Job DescriptionAs a Research Scientist- Vision-Language-Action (VLA) Models, you contribute to research projects at the forefront of the ADAS/AD industry. Key responsibilities include:Conduct research and engineering in core AI and machine learning fields to enable Embodied AI (including computer vision, autonomous planning, open-world learning, and so on) for related business domains of ADAS/AD, industrial automation, robotics etc.Push the boundaries in (modular) end-to-end perception and planning for ADAS/AD, incorporating advancements in large vision-language-(action) models to aid reasoning capabilities and explainability.Collaborate cross-functionally with global research and engineering teams to ensure seamless technology transfer and system integration.Implement research results to solve real-world challenges, ensuring high-quality system integration within Bosch's existing platforms.Stay at the forefront of innovation by actively engaging with academic and industry communities through conferences, workshops, and technical events.Document and disseminate research findings through high-caliber publications and/or patent submissions.QualificationsBasic QualificationsPh.D. in Computer Science, Robotics or a related discipline or Master's degree with >= 2 years industry experience after graduation.A minimum of 3 years of R&D experience, or an equivalent graduate research background, primarily in AI technologies including Computer Vision and Robotic or Automotive Motion and Behavioral Planning.Proficiency in one or more programming languages commonly used in machine learning (e.g., Python, C++, Rust).Strong interpersonal, communication, and teamwork capabilities.Knowledge of major machine learning frameworks like TensorFlow or PyTorch.Hands-on experience in reinforcement learning for behavior or motion planning or other applicable contexts and familiarity with common RL techniques (e.g. PPO, DQN, DDPG).A strong portfolio of publications in premier machine learning, deep learning, robotics and computer vision journals and conferences.Preferred QualificationsExperience with real-world product development and deployment of autonomous systems.Hands-on experience building and applying multimodal transformer-based sequence-to-sequence models, especially multimodal vision-language-action models.Hands-on experience in computer vision and deep learning, with work in any of the following areas: multimodal transformers, multimodal language models, diffusion models, NeRF, gaussian splatting, object detection / segmentation, 3D scene understanding, sensor calibration, SfM, voxel/BEV grid-based feature representation.Additional InformationWe offer a competitive base salary for this position with a range in US-California of --$165,000 - $185,000 along with an annual corporate bonus, and a long-term incentive bonus designed to reward sustained impact and contribution over time. Within the salary range, the individual pay is determined based on several factors, including, but not limited to, work experience and job knowledge, complexity of the role, job location, etc.Your well-being matters at Bosch! We offer a a benefits package designed to empower you in every area of your life. This includes premium health coverage, a 401(k) with generous matching, resources for financial planning and goal setting, ample paid time off, parental leave, and comprehensive life and disability protection. Your Recruiter can share more details for this position during the interview process.Learn more about our full benefits offerings by visiting: Opportunity Employer, including disability / veterans.*Bosch adheres to Federal, State, and Local laws regarding drug-testing. Employment is contingent upon the successful completion of a drug screen and background check. Candidates who have been offered the position must pass both screenings before their start date.#LI-JM1 Legal Entity: Robert Bosch LLC

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Research Scientist- Vision-Language-Action (VLA) Models in Sunnyvale, CA vacancy

Senior Research Scientist- Vision- Language- Action (VLA) Models
$185k - $215k
...Description The Bosch Research and Technology Center... ...focuses on Foundation Models, Big Data Visual Analytics... ...AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud... ...As a Senior Research Scientist- Vision- Language- Action (VLA) Models, you contribute...
Language
Full time
Work experience placement
Local area
Worldwide
Bosch Group
Sunnyvale, CA
2 days ago
Research Scientist: Vision-Language-Action for Embodied AI
$165k - $185k
Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate...
Language
Robert Bosch Group
Sunnyvale, CA
4 days ago
Research Scientist - World-Action Foundation Model, Robotics
$126k - $423k
...looking for multiple passionate Research Scientists to join the Research Group... ...on pretraining world-action foundation model with various world modalities including vision and physics associated with... ...human data incorporation, language modality, and spatial reasoning...
Language
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Applied Intuition
Sunnyvale, CA
12 hours ago
Machine Learning Research Scientist: Generative Modeling for Planning
$160.36k - $240.54k
...Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ)... ...foundation models. Leverage large language models and world foundation models... ...autonomous driving. Experiences in vision-language-action models, reinforcement learning...
Language
Nuro
Mountain View, CA
1 day ago
Senior Research Scientist, Multimodal Foundation Models and Robotics
$184k - $299k
We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is... ...following topics: LLMs; Large vision-language models; Video generative models and diffusion algorithms; or Action-based transformers. Outstanding...
Language
NVIDIA Corporation
Santa Clara, CA
1 day ago
Senior Vision Language Model Engineer
$184k - $287.5k
...built. We are seeking a senior vision language model engineer to design and build... ...doing: Partner with our researchers to develop and evaluate... ...g., video, sensor, language/action traces) tailored for end‑to‑... ...modeling, and multimodal VLM/VLA or foundation models. Excellent...
Language
NVIDIA
Santa Clara, CA
19 hours ago
Research Scientist, Robotics, Embodied AI, DeepMind
$147k - $211k
...Experience training embodied reasoning VLMs (Vision Language Models). Experience working with simulators... ..., you will develop Vision Language Action (VLA) models that combine Gemini's world... .... Write software to implement research ideas and iterate. Participate in research...
Language
Full time
Google Inc.
Mountain View, CA
19 hours ago
Research Scientist - Computer Vision
$150k
...Institute of Foundation Models We are a dedicated research lab for building, understanding... ...-class researchers, data scientists, and engineers, tackling... ...specializing in Computer Vision your role will be crucial... ...‑related concepts (e.g., language modeling, computer vision...
Language
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
4 days ago
Embodied Robotics AI Scientist - Vision-Language Action
$147k - $211k
Google Inc. in Mountain View, CA is seeking a Roboticist to develop Vision Language Action models that enhance robot capabilities. This role involves designing algorithms, implementing research ideas, and collaborating with teams in a cutting-edge AI environment. A PhD...
Language
Google Inc.
Mountain View, CA
19 hours ago
Senior AI Research Scientist- Time-Series Foundational Models
$165k - $195k
Senior AI Research Scientist- Time-Series Foundational Models Full-time The Bosch Research and Technology Center North America with offices in... ...Valley focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data...
Language
Full time
Work experience placement
Local area
Worldwide
Robert Bosch Group
Sunnyvale, CA
19 hours ago
Research Engineer/Research Scientist- Video Generation Modeling
...foundation world models that control our robots... ...our cutting edge research and end-to-end... ...looking for Research Scientists and Research... ...training for our video action model. Our... ...architectures) or language model pretraining... ...than robot-data-only VLA models Your models...
Language
Rhoda ai
Palo Alto, CA
3 days ago
Principal Engineer, ML (VLA Automated Driving)
$235.52k - $323.04k
...Summary: The Principal Engineer, ML (VLA Automated Driving) is the technical anchor for Vision-Language-Action (VLA /VLAM) models for our Level 2++ to Level 4 Automated Driving... ...systems and helps turn promising research into robust in-vehicle capability. This...
Language
Permanent employment
Temporary work
Cariad, Inc.
Mountain View, CA
a month ago
Senior AI Research Scientist- World Model
$165k - $185k
Company Description The Bosch Research and Technology Center North America with offices... ...Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science...
Language
Full time
Work experience placement
Worldwide
Bosch Group
Sunnyvale, CA
2 days ago
AI Experience Researcher, Product Evaluation, Vision Products Group
$141.8k - $258.6k
AI Experience Researcher, Product Evaluation, Vision Products Group Sunnyvale... ...collaborating with ML and data scientists, software engineers... ...patterns in model behaviors and... ...translating results into actionable recommendations for... ..., preferably Large Language Models Familiarity...
Language
Relocation
Apple Inc.
Sunnyvale, CA
19 hours ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
$176k - $253k
...At Toyota Research Institute (TRI), we're on a mission to improve... ...are looking for a Research Scientist to join us in building intelligent... ...to explore how large language models and agentic infrastructure can... ...with large language models, vision-language models, or agentic...
Language
Work experience placement
Internship
Local area
Remote work
Shift work
Toyota Research Institute
Los Altos, CA
2 days ago
Research Scientist - Agentic AI & Foundation Models
$150k
A leading AI research lab in Sunnyvale is seeking a professional to work on advanced agentic language models. The role involves developing algorithms, contributing to research, and collaborating on state-of-the-art models. Ideal candidates hold a degree in Computer Science...
Language
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Research Scientist: Data for Foundation Models
$150k
A leading AI research institution in Sunnyvale is seeking a Research Scientist to curate web-scale data crucial for developing foundation models. This role involves pioneering data collection methods... ...and have experience with large language models. Competitive salary...
Language
Institute of Foundation Models
Sunnyvale, CA
4 days ago
Research Scientist - Robotics
$158k - $304k
...role We are looking for a passionate Research Scientist to join the Research Team at Applied... ...and tools to develop cutting‑edge models at scale. In addition to your research... ...robotic foundation model and vision‑language‑action model, reinforcement learning and real...
Language
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Flexible hours
Decisive Point
Mountain View, CA
1 day ago
Research Scientist - Autonomy
$158k - $304k
...role We are looking for a passionate Research Scientist to join the Research Team at Applied... ...and tools to develop cutting‑edge models at scale. In addition to your research... ...including end‑to‑end autonomy, vision‑language‑action model, reinforcement learning Work...
Language
Full time
For contractors
For subcontractor
Casual work
Work at office
Immediate start
Remote work
Day shift
Decisive Point
Mountain View, CA
1 day ago
Research Scientist- End-to-End Planning in Autonomous Systems
$165k - $185k
Research Scientist- End-to-End Planning in Autonomous Systems Full-time... ...focuses on Foundation Models, Big Data Visual Analytics... ...Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud... ...large vision-language-(action) models to aid reasoning...
Language
Full time
Work experience placement
Local area
Worldwide
Robert Bosch Group
Sunnyvale, CA
4 days ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
$176k - $253k
At Toyota Research Institute (TRI), we’re on a mission to improve... ...are looking for a Research Scientist to join us in building intelligent... ...to explore how large language models and agentic infrastructure can... ...with large language models, vision‑language models, or agentic...
Language
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
4 days ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
...Description Job Description At Toyota Research Institute (TRI), we’re on a mission to... ...Design and implement end-to-end modeling pipelines for machine assembly tasks,... ...Qualifications Familiarity with large language models, vision-language models, or agentic AI frameworks...
Language
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
23 days ago
Senior Staff Machine Learning Engineer - Autonomous Driving Foundation Models
$244.14k - $413.16k
...Engineer – Autonomous Driving Foundation Models Santa Clara, CA XPENG is a leading... ..., we are developing large-scale Vision-Language-Action (VLA) models and World Models to handle the... ...The ability to balance cutting-edge research with the deterministic requirements of...
Language
Full time
Overseas
XPENG
Santa Clara, CA
19 hours ago
Localization QA Tester / Proofreader - Multiple Languages (Cupertino, CA)
$35 per hour
...services in translation, localization, and adaptation for over 250 languages with a growing network of over 400,000 in-country linguistic... ...: ▪️ Medical Insurance ▪️ Dental Insurance ▪️ Vision Insurance ▪️ FSA and HSA ▪️ VoluntaryLifeInsurance ▪️ 401(k)RetirementPlan...
Language
Hourly pay
Remote work
GrabJobs
Cupertino, CA
4 days ago
Senior Staff Research Scientist, Google Cloud AI Research
$248k - $349k
Senior Staff Research Scientist, Google Cloud AI Research corporate... ...data mining, natural language processing, hardware... ...our advanced AI models, delivers computing power... ...conferences and visioning activities. Deliver... ...opportunity and affirmative action employer. We are...
Language
Full time
Worldwide
Google Inc.
Sunnyvale, CA
19 hours ago
Staff Machine Learning Engineer - Autonomous Driving Model Quantization & Deployment
$215.28k - $364.32k
...Learning Engineer – Autonomous Driving Model Quantization & Deployment Santa... ...smart connectivity. The challenge of Vision-Language-Action (VLA) models and Foundation Models isn't... ...Engineer to bridge the gap between massive research models and production-ready L4...
Language
Full time
XPENG
Santa Clara, CA
19 hours ago
Founding Forward Engineer
...and turn Fish Audio's models into production voice systems... ...the bridge between our research and a customer's... ...regulated data, or seven languages in a single conversation... ...ownership and bias for action - you move without... ...Platinum medical, dental, and vision coverage. Daily lunch...
Language
Contract work
Work at office
Remote work
Visa sponsorship
39 Ai, Inc.
Mountain View, CA
19 hours ago
Machine Learning Researcher, Foundation Models [SWE Org]
$147.4k - $272.1k
...Machine Learning Researcher, Foundation Models [SWE Org] We build frontier foundation... ...tasks to precise, action-taking workflows. If you're... ...world-class engineers and scientists to tackle some of the most... ...focus on pretraining, large language model (LLM) architecture,...
Language
Relocation
Apple
Cupertino, CA
3 days ago
Staff Machine Learning Engineer - Foundation Model
$215.28k - $364.32k
...Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a... ...time Machine Learning Engineer / Research Scientist to drive the modeling and algorithmic... ...development of XPENG's next-generation Vision-Language-Action (VLA) Foundation Model — the core brain...
Language
Full time
XPENG
Santa Clara, CA
1 day ago
World Model Research Scientist- Physical AI
$190k - $250k
...large-scale generative world models that learn to predict... ...trucks. We are looking for a research scientist to lead the design and development... ...scenarios conditioned on actions, 3D scene context, and text.... ...Excellent Medical, Dental, and Vision plans through Kaiser...
Temporary work
Work at office
Visa sponsorship
Flexible hours
Kodiak
Mountain View, CA
6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist- Vision-Language-Action (VLA) Models. Be the first to apply!