Senior Research Scientist- Vision-Language-Action (VLA) Models (Hiring Immediately)
$185k - $215kPart-time
Bosch Group
Company Description The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania, and Cambridge, Massachusetts is a part of the global Bosch Group ( a company with over 70 billion euro revenue, 400,000 employees worldwide, a very diverse product portfolio, and a history spanning over 125 years. The Research and Technology Center North America (RTC-NA) is dedicated to providing technologies and system solutions for various Bosch business fields, primarily in the field of artificial intelligence, energy technologies, internet technologies, circuit design, semiconductors and wireless, as well as advanced MEMS design. As a part of the global research, our AI research in Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI System Engineering, Time-series Analysis. We develop scalable, intelligent, and trustworthy AIoT solutions for Bosch products and services in application areas such as automated driving, advanced driver assistance systems (ADAS), robotics, smart manufacturing, enterprise AI, health care, smart home and building solutions. Originating from the AI research in Silicon Valley, our Intelligent Autonomous Systems group is responsible for enabling future autonomous Bosch products by pushing the boundaries of automated driving, advanced driver assistance systems (ADAS), robotics and automation through key innovations that encompass system architecture and AI components. These include methods for motion planning, high level task planning and decision making as well as systems for making these technologies work on real products by building frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units to transfer our solutions into future products. We also actively collaborate with leading groups in academia and industry to promote research ideas and publish research findings in internationally renowned conferences and journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL. Job Description As a Senior Research Scientist- Vision-Language-Action (VLA) Models, you contribute to research projects at the forefront of the ADAS/AD industry. Key responsibilities include:
- Conduct research and engineering in core AI and machine learning fields to enable Embodied AI (including computer vision, autonomous planning, open-world learning, and so on) for related business domains of ADAS/AD, industrial automation, robotics etc.
- Push the boundaries in (modular) end-to-end perception and planning for ADAS/AD, incorporating advancements in large vision-language-(action) models to aid reasoning capabilities and explainability.
- Collaborate cross-functionally with global research and engineering teams to ensure seamless technology transfer and system integration.
- Implement research results to solve real-world challenges, ensuring high-quality system integration within Bosch's existing platforms.
- Stay at the forefront of innovation by actively engaging with academic and industry communities through conferences, workshops, and technical events.
- Document and disseminate research findings through high-caliber publications and/or patent submissions.
- Ph.D. in Computer Science, Robotics or a related discipline or Master's degree with >= 2/4years industry experience after graduation.
- A minimum of 5years of R&D experience, or an equivalent graduate research background, primarily in AI technologies including Computer Vision and Robotic or Automotive Motion and Behavioral Planning.
- Proficiency in one or more programming languages commonly used in machine learning (e.g., Python, C++, Rust).
- Strong interpersonal, communication, and teamwork capabilities.
- Knowledge of major machine learning frameworks like TensorFlow or PyTorch.
- Hands-on experience in reinforcement learning for behavior or motion planning or other applicable contexts and familiarity with common RL techniques (e.g. PPO, DQN, DDPG).
- A strong portfolio of publications in premier machine learning, deep learning, robotics and computer vision journals and conferences.
- Experience with real-world product development and deployment of autonomous systems.
- Hands-on experience building and applying multimodal transformer-based sequence-to-sequence models, especially multimodal vision-language-action models.
- Hands-on experience in computer vision and deep learning, with work in any of the following areas: multimodal transformers, multimodal language models, diffusion models, NeRF, gaussian splatting, object detection / segmentation, 3D scene understanding, sensor calibration, SfM, voxel/BEV grid-based feature representation.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Research Scientist- Vision-Language-Action (VLA) Models (Hiring Immediately) in Sunnyvale, CA vacancy
$165k - $185k
...Company Description The Bosch Research and Technology Center North America with offices... ...Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data...Immediate startLanguagePart timeWork experience placementWorldwide$184k - $299k
...are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA... ...topics: LLMs; Large vision-language models; Video generative... ...algorithms; or Action-based transformers.... ...discriminate (including in our hiring and promotion...SeniorLanguage$193.93k - $352.29k
...vehicles are the most immediate and profound... ...partner-led business model, Nuro is working toward... ...collaborate closely with researchers and engineers on the... ...models. Leverage large language models and world foundation... .... Experiences in vision-language-action models, reinforcement...Immediate startSeniorLanguageFlexible hours$192k - $304.75k
We’re now looking for a Senior Research Scientist, Multi-Modal Language Models! NVIDIA is seeking a Senior Research Scientist... ...years of experience in computer vision, especially multi‑modal LLMs... ...not discriminate (including in our hiring and promotion practices) on the basis...SeniorLanguage- ...Research Intern Applied Intuition, Inc... ...deployed to our system immediately help our... ...pretraining world-action foundation model with various... ...modalities including vision and physics... ...incorporation, language modality, and spatial... ...the Research Scientists and Engineers on...Immediate startLanguageFor contractorsFor subcontractorCasual workInternshipWork at officeRemote workDay shift
$165k - $195k
Senior AI Research Scientist- Time-Series Foundational Models Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale... ...focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data...SeniorLanguageFull timeWork experience placementLocal areaWorldwide$240k - $320k
...at Bosch and our vision of future mobility... ...Description As the Senior Principal Engineer... ...innovative research into production-grade... ...with data sources, model repositories, and... ...caliber team of AI scientists and engineers, fostering... ...in programming languages such as Python and...Immediate startSeniorLanguageFull timePart timeWork experience placementLocal areaFlexible hours$150k
About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using... ...world-class researchers, data scientists, and engineers, tackling the most... ...Summary As a Research Scientist in the Vision Language Model (VLM) team, your role will...Language$208k - $327.75k
...We are looking for a Senior AI Architect to help... ...next generation of AI model paradigms for autonomous... ...of frontier AI research, hardware architecture... ...vehicle stack, including Vision-Language-Action (VLA) models, Multimodal... ...discriminate (including in our hiring and promotion...SeniorLanguageWorldwide$184k - $299k
...Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an... ...methods for post-training model optimization (pruning, quantization... ...the top venues in computer vision and machine learning. Our... ...~ Experience with large language models and large vision‑...SeniorLanguage$248k - $349k
Senior Staff Research Scientist, Google Cloud AI Research corporate... ...mining, natural language processing,... ...our advanced AI models, delivers computing... ...during the hiring process. Please... ...conferences and visioning activities. Deliver... ...and affirmative action employer. We are...SeniorLanguageFull timeWorldwide- Senior Applied Scientist, Delivery Foundation Model job at Amazon.com Services LLC. Santa Clara, CA.... ...direction for specific research initiatives, ensuring robust... ...ambitious research vision with real-world impact.... ...using Python, C++ or other languages. Strong publication record...SeniorLanguageWorldwide
$192k - $304.75k
Responsibilities Conduct original research in the space of generative... ...large-scale generative AI models for various content... ...of deep learning, computer vision, natural language processing, or computer graphics... ...(including in its hiring and promotion practices) on...SeniorLanguage$272k - $431.25k
...are seeking a Senior Research Manager to lead world‑model evaluation and... ...team of Research Scientists focused on world... ...models, world‑action models, synthetic... ..., computer vision, multimodal AI,... ...models, vision‑language‑action models,... ...(including in hiring and promotion practices...SeniorLanguage$192k - $304.75k
Senior Research Scientist, AI-Mediated Reality and Interaction page... ...interaction and 4D world modeling using new ideas in... ...generative modeling, large language models, human... ...top venues in computer vision, artificial intelligence... ...(including in our hiring and promotion practices...SeniorLanguage$156k - $234k
...powers pioneering research and development advancements... ...Description: The Senior Principal... ...statistical methods and models, including... ...applications and programming languages a plus. Knowledge... .../affirmative action employer and all... ..., dental and vision insurance, 401k retirement...Immediate startSeniorLanguageRemote workWorldwideFlexible hours- The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding...Language
$187k - $215k
...Senior AI Engineer KlearNow is building the intelligence... ...building with Large Language Models (LLMs) in production... ...impact, your work ships immediately Small, sharp team with a bias for action and a frontier mindset... ...Comprehensive medical, dental, and vision insurance Equity...Immediate startSeniorLanguageWork at officeVisa sponsorshipFlexible hours$147k - $211k
...training embodied reasoning VLMs (Vision Language Models). Experience working with... ...develop Vision Language Action (VLA) models that combine Gemini... ...location during the hiring process. Please note that the... ...Write software to implement research ideas and iterate. Participate...LanguageFull time$147k - $211k
Google Inc. in Mountain View, CA is seeking a Roboticist to develop Vision Language Action models that enhance robot capabilities. This role involves designing algorithms, implementing research ideas, and collaborating with teams in a cutting-edge AI environment. A PhD...Language$126k - $423k
...multiple passionate Research Scientists to join the... ...to our system immediately help our... ...self‑play RL, VLA post‑training,... ...fields of computer vision, machine... ...machine learning model training Nice... ...salary for new hire salaries for the... ...affirmative action to employ and...Immediate startFull timeFor contractorsFor subcontractor$180k - $260k
Overview Senior/ Staff Software Engineer, Motion Planning — Mountain View, CA Gatik... ...based planning, trajectory optimization, model predictive control, or search-based... ...Knowledge of diffusion models and/or Vision-Language-Action (VLA) models is a strong plus. Excellent analytical...SeniorLanguageWork at office$170k - $270k
...applications for skilled Senior Artificial Intelligence (AI) Research Scientists for their Artificial... ..., text mining, Large Language Model (LLM), foundation model... ...data sets to provide actionable insights that aid in enhancing... ...target for new hires. Actual pay may be...SeniorLanguageWork at office- ...company in Santa Clara, California is seeking a talented engineer to join their CoreLLM team, focusing on the development of large language models. The role requires AI expertise to confront real-world challenges and deliver innovative solutions for enterprise clients....SeniorLanguageFlexible hours
$192k - $356.5k
NVIDIA Gruppe is looking for a Senior Research Scientist specializing in Multi-Modal Language Models to enhance our open-source multi-modal models. You will collaborate... ...and over 4 years of experience in computer vision, particularly with multi-modal LLMs. Competitive...SeniorLanguage$180k - $260k
About the role We’re seeking Senior or Staff Engineers to help build the core behavior planning system for autonomous... ...time, safety‑critical systems. Knowledge of diffusion models and/or Vision‑Language‑Action (VLA) models is a strong plus. Strong debugging, system thinking...SeniorLanguageWork at office$116.1k - $193.4k
...of state-of-the-art computer vision and Artificial Intelligence /... ...learning (AI/ML) algorithms and models from conceptualization to deployment... ...in at least one programming language used in Computer Vision (e.g.... ..., proceeding, hearing, or action, including an investigation conducted...SeniorLanguageFor contractors$256k - $278k
...functionally with engineering, research, and other cross... ...with foundational models and LLMs. About the job... ...preferred location during the hiring process. Please note... ...scaling across major languages using synthetic and real... ...and affirmative action employer. We are committed...SeniorLanguageFull time$193k - $241k
...are looking for a Senior Staff Aerodynamics... ...will use existing model definition to create... ...medical, dental, vision, and life... ...company matching vested immediately, commuter assistance... ...political convictions, language, social condition,... ...recruiting, hiring, placement, promotion...Immediate startSeniorLanguageFull timeWork at officeLocal areaFlexible hours3 days per week- ...long‑horizon reasoning models and autonomous agents... ...and that starts with hiring the right team. The work... ...priorities into actionable hiring strategies. Build... ...mission and long‑term vision in a way that attracts... ...and you can speak the language of the candidates you’...SeniorLanguageLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Research Scientist- Vision-Language-Action (VLA) Models (Hiring Immediately). Be the first to apply!
Related searches
- r&d scientist Sunnyvale, CA
- quality control scientist Sunnyvale, CA
- machine learning research scientist Sunnyvale, CA
- validation scientist Sunnyvale, CA
- scientist Sunnyvale, CA
- qc scientist Sunnyvale, CA
- research scientist Sunnyvale, CA
- decision scientist Sunnyvale, CA
- application scientist Sunnyvale, CA
- operations research scientist Sunnyvale, CA



