Research Scientist, Vision-Language Multimodal AI
Institute of Foundation Models
The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding and reasoning. Qualified candidates will have a PhD in a relevant field and strong Python/PyTorch skills. Responsibilities include research, architecture development, and mentoring junior researchers while contributing to impactful AI solutions. #J-18808-Ljbffr Institute of Foundation Models
$165k - $195k
AI Research Scientist- Multimodal Foundational Models Full-time The Bosch Research and Technology Center North America with offices... ...Silicon Valley focuses on Foundation Models,Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data Visual...LanguageFull timeWork experience placementLocal areaWorldwide$165k - $185k
Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate...Language$185k - $215k
...Description The Bosch Research and Technology... ...global research, our AI research in Silicon... ...Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality,... ...a Senior Research Scientist- Vision- Language- Action... ...and applying multimodal transformer-based sequence...LanguageFull timeWork experience placementLocal areaWorldwide$50 per hour
...Lead Generative AI Analyst We are looking for highly detail... ...Chinese (zh-CN) multimedia and language data labeling and review work... ...You'll Do Lead a team of multimodal annotators and reviewers to deliver... ...data labeling, computer vision labeling, content QA, or a closely...LanguageFull time$184k - $299k
We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA... ..., large-scale robot learning, game AI, and physical simulation. Our past... ...the following topics: LLMs; Large vision-language models; Video generative models and...Language- ...detail-oriented Lead Generative AI Analyst to lead day-to-day... ...Chinese (zh-CN) multimedia and language data labeling and review work... ...What You’ll Do Lead a team of multimodal annotators and reviewers to... ...multimodal data labeling, computer vision labeling, content QA, or a...LanguageFull time
$150k
...Models We are a dedicated research lab for building,... ...the next generation of AI builders, and drive transformative... ...researchers, data scientists, and engineers,... ...Scientist in the Vision Language Model (VLM) team, your... ...advancing state-of-the-art multimodal foundation models that...Language$165k - $180k
AI Research Scientist - GenAI at Bosch Group - Sunnyvale, CA, United States Company Description... ..., Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics,... ...following areas: AI agent, RAG, multimodal transformers, multimodal language...LanguageWork experience placementWorldwide$181.1k - $318.4k
...States Machine Learning and AI We're seeking research scientists and engineers to create... ...innovations in language models, machine learning,... ...reinforcement learning, computer vision, robotics, and related areas... ...models (including large multimodal models) and agentic...LanguageRelocation$140k - $195k
...the Team : Our AI Research team, reporting directly... ...systems-integrating multimodal foundation models,... ...world-class team of scientists and engineers, and represent... ..., including computer vision, robotic manipulation... ...or more of systems languages (C++/Java)...LanguageWork at officeLocal areaWork from homeRelocation packageFlexible hours- ...technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced... ...applied machine learning and proficiency in programming languages such as Python or C++. The role offers a competitive salary...Language
$147k - $211k
...Experience training embodied reasoning VLMs (Vision Language Models). Experience working with... ...Preferred qualifications: Experience with AI technology. Experience with Python... ...capabilities. Write software to implement research ideas and iterate. Participate in research...LanguageFull time$168k - $264.5k
NVIDIA Research’s \"AI-Mediated Reality and Interaction\" team is looking for a research scientist. Our mission is to create interactive physical... ...generative modeling, large language models, human behavior understanding... ...at top venues in computer vision, artificial intelligence...Language$117.2k - $313.7k
About the Role Salesforce AI Research is seeking outstanding AI Research Scientists / Research Engineers to build and deploy high‑impact AI... ...reinforcement learning, autonomous workflows Multimodal & Computer Vision - Vision‑language models, video understanding, visual...LanguageFull time$150k
...Models We are a dedicated research lab for building,... ...the next generation of AI builders, and drive transformative... ...researchers, data scientists, and engineers,... ...specializing in Computer Vision your role will be crucial... ...related concepts (e.g., language modeling, computer...LanguageVisa sponsorship$156k - $387.6k
...focus on cutting-edge research in content understanding... ...multilingual learning, multimodal pretraining, and more.... ...limited to): - General AI platform design and development... ...expertise in large language models (LLMs) and... ...to medical, dental, and vision insurance, a 401(k) savings...LanguageTemporary workLocal area- ...mission to make frontier AI truly open for all.... ...develops an open research stack and models in... ...Overview The Research Scientist will be an integral... ...of-the-art in large language models (LLMs), vision language models (... ...including text and multimodal models, and both open...LanguageWorldwideFlexible hours
- A leading technology company is seeking a Senior Research Scientist to focus on Multimodal Foundation Models and Robotics. This position involves designing AI algorithms for humanoid robots, developing training methods for foundation models, and working with a collaborative...
$174k - $252k
Google Inc. seeks a researcher in AI for healthcare in DeepMind's Frontier Health team. You will develop multimodal AI systems and novel methods, while publishing findings that contribute to patient care. The ideal candidate has a PhD in Computer Science, at least 2 years...Full time$150k
A leading AI research lab in Sunnyvale, California, is seeking a Research Scientist specializing in Computer Vision. The role involves advancing innovative computer vision technologies, requiring a Master’s degree and experience with Gen AI techniques. Responsibilities...$35 per hour
...serves localization teams through AI-enabled multilingual content... ...enterprises in more than 300 languages, enabling accurate, culturally... ...(from Canada); be able to research and resolve linguistic questions... ...Insurance • Dental Insurance • Vision Insurance • FSA and HSA •...LanguageHourly payWorldwideMonday to Friday- ...foundation models, LLMs, and multimodal LLMs? We are looking... ...a centralized applied research and engineering... ...time on-device Computer Vision and Machine Perception... ...foundation models and agentic AI systems that enable... ...systems (e.g. vision, language, video, etc.). ~ Proficiency...Language
$262k - $365k
Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements PhD in... ...deep) learning, data mining, natural language processing, hardware and software... ...organizing research conferences and visioning activities. Deliver full projects...LanguageFull timeWorldwide$165k - $195k
The Robert Bosch Group is seeking an AI Research Scientist in Sunnyvale, California. This full-time role focuses on leading research in multimodal foundational models to enhance AI functionalities in various Bosch products. The ideal candidate will have a Ph.D. in a relevant...Full time- ...the full stack of a unified multimodal foundation model, from pretraining... .... This is foundational research with direct physical impact.... ...architectures where vision, language, and kinematics share a unified... ...consequential work in embodied AI right now, with the people and...Language
$48 - $63 per hour
Samsung AI Research Center (SAIC) located in Mountain View, California... ...research interns for the Vision Intelligence lab. Our goal is... ...pushing the state-of-the-art in multimodal understanding and generation... ...of experts, etc. Vision Language Action Models (VLA) for...LanguageHourly payInternshipWork at officeWorldwide$171.6k - $302.2k
...Machine Learning Engineer - Vision Products Software AI is transforming how people interact with... ...intersection of machine learning research and product engineering—rapidly iterating... ...architectures; ranging from large language and multimodal models to ranking and...LanguageRelocation$170.7k - $300.2k
...and craft upcoming research directions in the field of multimodal foundation models that... ...deeply technical scientists and engineers to... ...touching aspects of language and multimodal foundation... ...Craft technical vision for the team Work... ...development of cutting-edge AI technologies in the...LanguageRelocation$147.4k - $272.1k
...Applied Machine Learning Research Engineer - Multimodal for Human... ...foundation and large language models, and many applications... ...in the computer vision and machine learning... ...partnering with HW, SW and AI teams to shape Apple... ...of experts—research scientists, ML engineers,...LanguageWorldwideRelocation$181.1k - $318.4k
...AIML - Machine Learning Researcher - Multimodal Agent The AIML Multimodal Foundation... ...cutting-edge generative AI technologies based on multimodal large language models to enable innovative features... ...applied machine learning, computer vision, multimodal LLM, and agent...LanguageRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist, Vision-Language Multimodal AI. Be the first to apply!
- scientist Sunnyvale, CA
- research scientist machine learning deep learning Sunnyvale, CA
- deep learning scientist Sunnyvale, CA
- senior principal scientist Sunnyvale, CA
- machine learning scientist Sunnyvale, CA
- scientist immunology Sunnyvale, CA
- safety scientist Sunnyvale, CA
- research scientist - biology Sunnyvale, CA
- research scientist Sunnyvale, CA
- lab scientist Sunnyvale, CA



