Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Research Scientist- Vision- Language- Action (VLA) Models

$185k - $215k
Full-time

Bosch Group

Company Description The Bosch Research and Technology Center North America with offices in Sunnyvale, California, Pittsburgh, Pennsylvania, and Cambridge, Massachusetts is a part of the global Bosch Group ( a company with over 70 billion euro revenue, 400,000 employees worldwide, a very diverse product portfolio, and a history spanning over 125 years. The Research and Technology Center North America (RTC-NA) is dedicated to providing technologies and system solutions for various Bosch business fields, primarily in the field of artificial intelligence, energy technologies, internet technologies, circuit design, semiconductors and wireless, as well as advanced MEMS design. As a part of the global research, our AI research in Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science, AI System Engineering, Time-series Analysis. We develop scalable, intelligent, and trustworthy AIoT solutions for Bosch products and services in application areas such as automated driving, advanced driver assistance systems (ADAS), robotics, smart manufacturing, enterprise AI, health care, smart home and building solutions. Originating from the AI research in Silicon Valley, our Intelligent Autonomous Systems group is responsible for enabling future autonomous Bosch products by pushing the boundaries of automated driving, advanced driver assistance systems (ADAS), robotics and automation through key innovations that encompass system architecture and AI components. These include methods for motion planning, high level task planning and decision making as well as systems for making these technologies work on real products by building frameworks that take advantage of technologies in the field of reliable distributed computing. We work with internal partners of different Bosch business units to transfer our solutions into future products. We also actively collaborate with leading groups in academia and industry to promote research ideas and publish research findings in internationally renowned conferences and journals such as CVPR, ICRA, IROS, RSS, NeurIPS and CoRL. Job Description As a Senior Research Scientist- Vision- Language- Action (VLA) Models, you contribute to research projects at the forefront of the ADAS/AD industry. Key responsibilities include: Conduct research and engineering in core AI and machine learning fields to enable Embodied AI (including computer vision, autonomous planning, open-world learning, and so on) for related business domains of ADAS/AD, industrial automation, robotics etc. Push the boundaries in (modular) end-to-end perception and planning for ADAS/AD, incorporating advancements in large vision-language-(action) models to aid reasoning capabilities and explainability. Collaborate cross-functionally with global research and engineering teams to ensure seamless technology transfer and system integration. Implement research results to solve real-world challenges, ensuring high-quality system integration within Bosch's existing platforms. Stay at the forefront of innovation by actively engaging with academic and industry communities through conferences, workshops, and technical events. Document and disseminate research findings through high-caliber publications and/or patent submissions. Qualifications Basic Qualifications Ph.D. in Computer Science, Robotics or a related discipline or Master's degree with >= 2/4 years industry experience after graduation. A minimum of 5 years of R&D experience, or an equivalent graduate research background, primarily in AI technologies including Computer Vision and Robotic or Automotive Motion and Behavioral Planning. Proficiency in one or more programming languages commonly used in machine learning (e.g., Python, C++, Rust). Strong interpersonal, communication, and teamwork capabilities. Knowledge of major machine learning frameworks like TensorFlow or PyTorch. Hands-on experience in reinforcement learning for behavior or motion planning or other applicable contexts and familiarity with common RL techniques (e.g. PPO, DQN, DDPG). A strong portfolio of publications in premier machine learning, deep learning, robotics and computer vision journals and conferences. Preferred Qualifications Experience with real-world product development and deployment of autonomous systems. Hands-on experience building and applying multimodal transformer-based sequence-to-sequence models, especially multimodal vision-language-action models. Hands-on experience in computer vision and deep learning, with work in any of the following areas: multimodal transformers, multimodal language models, diffusion models, NeRF, gaussian splatting, object detection / segmentation, 3D scene understanding, sensor calibration, SfM, voxel/BEV grid-based feature representation. Additional Information We offer a competitive base salary for this position with a range in US-California of --$185,000 - $215,000 along with an annual corporate bonus, and a long-term incentive bonus designed to reward sustained impact and contribution over time. Within the salary range, the individual pay is determined based on several factors, including, but not limited to, work experience and job knowledge, complexity of the role, job location, etc. Your well-being matters at Bosch! We offer a a benefits package designed to empower you in every area of your life. This includes premium health coverage, a 401(k) with generous matching, resources for financial planning and goal setting, ample paid time off, parental leave, and comprehensive life and disability protection. Your Recruiter can share more details for this position during the interview process. Learn more about our full benefits offerings by visiting: Equal Opportunity Employer, including disability / veterans. *Bosch adheres to Federal, State, and Local laws regarding drug-testing. Employment is contingent upon the successful completion of a drug screen and background check. Candidates who have been offered the position must pass both screenings before their start date.

#LI-JM1

Legal Entity: Robert Bosch LLC

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Research Scientist- Vision- Language- Action (VLA) Models in Sunnyvale, CA vacancy
  • $184k - $287.5k

     ...is built. We are seeking a senior vision language model engineer to design and build...  ...doing: Partner with our researchers to develop and evaluate...  ...., video, sensor, language/action traces) tailored for end‑to...  ...modeling, and multimodal VLM/VLA or foundation models. Excellent... 
    Senior
    Language

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $299k

    We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is searching for...  ...following topics: LLMs; Large vision-language models; Video generative...  ...and diffusion algorithms; or Action-based transformers. Outstanding... 
    Senior
    Language

    NVIDIA Corporation

    Santa Clara, CA
    13 hours ago
  • $193.93k - $352.29k

     ...Senior/Staff Machine Learning Research Scientist: Generative Modeling for Planning Mountain View, California (HQ) Nuro...  ...foundation models. Leverage large language models and world foundation...  ...driving. Experiences in vision-language-action models, reinforcement learning... 
    Senior
    Language

    Nuro

    Mountain View, CA
    13 hours ago
  • $165k - $185k

    Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate... 
    Language

    Robert Bosch Group

    Sunnyvale, CA
    3 days ago
  • $165k - $195k

    Senior AI Research Scientist- Time-Series Foundational Models Full-time The Bosch Research and Technology Center North America with offices in Sunnyvale...  ...focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data... 
    Senior
    Language
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    4 days ago
  • $244.14k - $413.16k

     ...Senior Staff Machine Learning Engineer – Autonomous Driving Foundation Models Santa Clara, CA XPENG is a leading smart technology...  ...developing large-scale Vision-Language-Action (VLA) models and World Models...  ...to balance cutting-edge research with the deterministic requirements... 
    Senior
    Language
    Full time
    Overseas

    XPENG

    Santa Clara, CA
    4 days ago
  • $165k - $185k

    Company Description The Bosch Research and Technology Center North America with offices...  ...Silicon Valley focuses on Foundation Models, Big Data Visual Analytics, Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Data Science... 
    Senior
    Language
    Full time
    Work experience placement
    Worldwide

    Bosch Group

    Sunnyvale, CA
    1 day ago
  • $174.72k - $295.68k

     ...Senior Machine Learning Engineer - Foundation Model Santa Clara, CA XPENG is a leading smart technology...  ...Machine Learning Engineer / Research Scientist to drive the modeling and...  ...XPENG's next-generation Vision-Language-Action (VLA) Foundation Model — the core... 
    Senior
    Language
    Full time

    XPENG

    Santa Clara, CA
    1 day ago
  • $248k - $349k

    Senior Staff Research Scientist, Google Cloud AI Research corporate_fare...  ...data mining, natural language processing, hardware...  ...our advanced AI models, delivers computing...  ...research conferences and visioning activities. Deliver...  ...and affirmative action employer. We are committed... 
    Senior
    Language
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago
  •  ...Delivery Foundation Model team, where you'll...  ...world-class scientists and engineers to pioneer...  ...an exceptional Senior Applied Scientist...  ...direction for specific research initiatives,...  ...ambitious research vision with real‑world impact...  ..., C++ or other languages. Strong publication... 
    Senior
    Language
    Worldwide

    Itlearn360

    Santa Clara, CA
    4 days ago
  • $184k - $299k

    Senior Research Scientist, Efficient Deep Learning NVIDIA is searching for an outstanding...  ...methods for post-training model optimization (pruning,...  ...the top venues in computer vision and machine learning. Our...  .... Experience with large language models and large vision‑language... 
    Senior
    Language

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $126k - $423k

     ...looking for multiple passionate Research Scientists to join the Research Group...  ...on pretraining world-action foundation model with various world modalities including vision and physics associated with...  ..., human data incorporation, language modality, and spatial reasoning... 
    Language
    Full time
    For contractors
    For subcontractor
    Casual work
    Work at office
    Immediate start
    Remote work
    Day shift

    Applied Intuition

    Sunnyvale, CA
    2 days ago
  • $150k - $200k

     ...Bonsai Robotics develops affordable, vision-based autonomy that makes off-road equipment...  ...and deploying end-to-end Vision-Language-Action (VLA) models for outdoor autonomy. You'll build...  ...for robotics and bring new ideas from research into practice Qualifications ~... 
    Senior
    Language

    Bonsai Robotics

    San Jose, CA
    13 hours ago
  • $192k - $304.75k

    Senior Research Scientist, AI-Mediated Reality and Interaction page is loaded##...  ...AI interaction and 4D world modeling using new ideas in artificial...  ...generative modeling, large language models, human behavior...  ...at top venues in computer vision, artificial intelligence and... 
    Senior
    Language

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...simulation (Omniverse), world models (Cosmos), foundation...  ..., and natural language intelligence. Come join...  ...Doing Conduct applied research and design innovative algorithms...  ...of geometric computer vision and video/vision-...  ...Experience with vision‑language‑action models. Compensation &... 
    Senior
    Language

    NLP PEOPLE

    Santa Clara, CA
    13 hours ago
  • $224k - $356.5k

     ...understood using advanced computer vision and deep learning. Our team...  ..., image, and 3D data into actionable insights. You will...  ...perception, simulation, and large models to bring research into production at scale....  .... Background in vision-language models or related multi-... 
    Senior
    Language

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $147k - $211k

     ...Experience training embodied reasoning VLMs (Vision Language Models). Experience working with simulators...  ..., you will develop Vision Language Action (VLA) models that combine Gemini's world...  .... Write software to implement research ideas and iterate. Participate in research... 
    Language
    Full time

    Google Inc.

    Mountain View, CA
    4 days ago
  • $200k - $287.5k

     ...At Toyota Research Institute (TRI), we're on...  ...and Large Behavior Models (LBM). The Opportunity...  ...are looking for a Senior Machine Learning...  ...-art, pixels-to-action, end-to-end system...  ...visual-language-action modalities....  ...focus on computer vision as the primary sensing... 
    Senior
    Language
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    1 day ago
  • $150k

     ...Institute of Foundation Models We are a dedicated research lab for building, understanding...  ...-class researchers, data scientists, and engineers, tackling...  ...specializing in Computer Vision your role will be crucial...  ...‑related concepts (e.g., language modeling, computer vision... 
    Language
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $147k - $211k

    Google Inc. in Mountain View, CA is seeking a Roboticist to develop Vision Language Action models that enhance robot capabilities. This role involves designing algorithms, implementing research ideas, and collaborating with teams in a cutting-edge AI environment. A PhD... 
    Language

    Google Inc.

    Mountain View, CA
    4 days ago
  • $241k - $326k

     ...: This role is a Senior Technical Leader (Uber...  ...solutions. Advanced modeling skills are required,...  ...and train large language models (LLMs) from scratch...  ...(or industry + research) experience in AI and/...  ...proceeding, hearing, or action, including an investigation... 
    Senior
    Language
    For contractors
    Work at office
    Flexible hours

    LinkedIn

    Sunnyvale, CA
    3 days ago
  • $235.52k - $323.04k

     ...The Principal Engineer, ML (VLA Automated Driving) is the technical anchor for Vision-Language-Action (VLA /VLAM) models for our Level 2++ to Level...  ...and helps turn promising research into robust in-vehicle...  ...Agentic Workflow Serve as a senior technical leader across... 
    Language
    Permanent employment
    Temporary work

    Cariad, Inc.

    Mountain View, CA
    29 days ago
  • $180k - $260k

    Overview Senior/ Staff Software Engineer, Motion Planning — Mountain View, CA Gatik...  ...based planning, trajectory optimization, model predictive control, or search-based...  ...Knowledge of diffusion models and/or Vision-Language-Action (VLA) models is a strong plus. Excellent analytical... 
    Senior
    Language
    Work at office

    Booster

    Mountain View, CA
    4 days ago
  •  ...company in Santa Clara, California is seeking a talented engineer to join their CoreLLM team, focusing on the development of large language models. The role requires AI expertise to confront real-world challenges and deliver innovative solutions for enterprise clients.... 
    Senior
    Language
    Flexible hours

    Victrays

    Santa Clara, CA
    4 days ago
  •  ...foundation world models that control...  ...our cutting edge research and end-to-end...  ...looking for Research Scientists and Research...  ...for our video action model. Our...  ...levels — from senior to staff — and...  ...architectures) or language model pretraining...  ...robot-data-only VLA models Your... 
    Language

    Rhoda ai

    Palo Alto, CA
    2 days ago
  • $180k - $260k

    About the role We’re seeking Senior or Staff Engineers to help build the core behavior planning system for autonomous...  ...time, safety‑critical systems. Knowledge of diffusion models and/or Vision‑Language‑Action (VLA) models is a strong plus. Strong debugging, system thinking... 
    Senior
    Language
    Work at office

    Booster

    Mountain View, CA
    13 hours ago
  • $224k - $356.5k

     ...been done before takes vision, innovation, and the world...  ...computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems,...  ...on experience with large language models and NLP, including...  ...communicate effectively across research, engineering, and... 
    Senior
    Language

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $204k - $259k

     ...ML Frameworks & Efficiency team partners with Research and Production teams across Waymo to develop models in Perception and Planning that are core to...  ...Stay current with the latest research in RL, Vision-Language-Action (VLA) models, and World models to inform and inspire... 
    Senior
    Language
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $212k - $386.3k

     ...Senior Computer Vision Architect The Video Computer Vision organization within Video Engineering is a centralized applied research and engineering entity responsible for developing real...  ...vision transformers, foundation models, and vision-language models. These candidates... 
    Senior
    Language
    Relocation

    Apple

    Sunnyvale, CA
    13 hours ago
  • $204k - $259k

     ...Senior Machine Learning Engineer, Computer Vision/VLM Waymo is an autonomous driving technology company with the mission...  ...-art computer vision / multimodal models (e.g., Gemini) to extract the rich...  ...prompting strategies for Vision-Language Models (VLMs) to elicit complex,... 
    Senior
    Language
    Full time
    Remote work

    Waymo

    Mountain View, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Research Scientist- Vision- Language- Action (VLA) Models. Be the first to apply!