Research Scientist, Vision-Language Multimodal AI
Institute of Foundation Models
The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding and reasoning. Qualified candidates will have a PhD in a relevant field and strong Python/PyTorch skills. Responsibilities include research, architecture development, and mentoring junior researchers while contributing to impactful AI solutions. #J-18808-Ljbffr Institute of Foundation Models
$185k - $215k
...Description The Bosch Research and Technology... ...global research, our AI research in Silicon... ...Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality,... ...a Senior Research Scientist- Vision-Language-... ...building and applying multimodal transformer-based...LanguagePart timeWork experience placementLocal areaImmediate startWorldwide- General Motors seeks a Staff Research Scientist specializing in Vision-Language Models (VLMs) to redefine mobility and shape autonomous transportation. Based remotely... ..., training, and optimization of state-of-the-art AI models for autonomous vehicles. A Ph.D. and 5+ years...LanguageRemote job
$184k - $299k
...We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA... ..., large-scale robot learning, game AI, and physical simulation. Our past... ...the following topics: LLMs; Large vision-language models; Video generative models and...Language$164.4k - $193.55k
...Description: Title: Machine Learning/Computer Vision Research Scientist Job Location: 2535 Augustine Drive, 3... ...a focus on developing generalized AI solutions capable of few-shot learning... ...learning/deep learning and natural language processing techniques for addressing numerous...LanguageFull timeWork at officeRemote work$150k
...Models We are a dedicated research lab for building,... ...the next generation of AI builders, and drive transformative... ...researchers, data scientists, and engineers,... ...Scientist in the Vision Language Model (VLM) team, your... ...advancing state-of-the-art multimodal foundation models that...Language$165k - $180k
AI Research Scientist - GenAI at Bosch Group - Sunnyvale, CA, United States Company Description... ..., Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics,... ...following areas: AI agent, RAG, multimodal transformers, multimodal language...LanguageWork experience placementWorldwide$181.1k - $318.4k
...States Machine Learning and AI We're seeking research scientists and engineers to create... ...innovations in language models, machine learning,... ...reinforcement learning, computer vision, robotics, and related areas... ...models (including large multimodal models) and agentic...LanguageRelocation$38 per hour
...We are hiring Creative Writing Generative AI Analysts in the United States to perform annotation efforts of multimedia and language data labeling and review work (e.g., video... ...datasets, including text, image, video, and multimodal content Create, review, and evaluate...LanguageFull timeRemote work- NLP PEOPLE in Los Gatos, California is looking for a Machine Learning Scientist to lead the development of cutting-edge large language and vision-language models. Join a dynamic team focused on optimizing algorithms for real-time interaction in gaming. The ideal candidate...Language
- ...technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced... ...applied machine learning and proficiency in programming languages such as Python or C++. The role offers a competitive salary...Language
$50 per hour
...Lead Generative AI Analyst We are looking for highly detail... ...Chinese (zh-CN) multimedia and language data labeling and review work... ...What You'll Do Lead a team of multimodal annotators and reviewers to... ...multimodal data labeling, computer vision labeling, content QA, or a...LanguageFull time$192k - $304.75k
Responsibilities Conduct original research in the space of generative AI Implement and train large-scale generative AI models for various content... ...of theory and practice of deep learning, computer vision, natural language processing, or computer graphics Track record of...Language$168k - $264.5k
NVIDIA Research’s \"AI-Mediated Reality and Interaction\" team is looking for a research scientist. Our mission is to create interactive physical... ...generative modeling, large language models, human behavior understanding... ...at top venues in computer vision, artificial intelligence...Language$117.2k - $313.7k
About the Role Salesforce AI Research is seeking outstanding AI Research Scientists / Research Engineers to build and deploy high‑impact AI... ...reinforcement learning, autonomous workflows Multimodal & Computer Vision - Vision‑language models, video understanding, visual...LanguageFull time- A leading technology company is seeking a Senior Research Scientist to focus on Multimodal Foundation Models and Robotics. This position involves designing AI algorithms for humanoid robots, developing training methods for foundation models, and working with a collaborative...
$248k - $349k
Senior Staff Research Scientist, Google Cloud AI Research corporate_fare Google place Sunnyvale, CA, USA... ...deep) learning, data mining, natural language processing, hardware and software performance... ...research conferences and visioning activities. Deliver full projects...LanguageFull timeWorldwide$165k - $195k
Company DescriptionThe Bosch Research and Technology Center North America with offices... ....As a part of the global research, our AI research in Silicon Valley focuses on Foundation Models, Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data...LanguageFull timeWork experience placementLocal areaWorldwide$48 - $63 per hour
Samsung AI Research Center (SAIC) located in Mountain View, California... ...research interns for the Vision Intelligence lab. Our goal is... ...pushing the state-of-the-art in multimodal understanding and generation... ...of experts, etc. Vision Language Action Models (VLA) for...LanguageHourly payInternshipWork at officeWorldwide$170.7k - $300.2k
...and craft upcoming research directions in the field of multimodal foundation models that... ...deeply technical scientists and engineers to... ...touching aspects of language and multimodal foundation... ...Craft technical vision for the team Work... ...development of cutting-edge AI technologies in the...LanguageRelocation- ...innovative and driven Applied Researchers in deep learning. Ideal... ...NVIDIA is a global leader in AI‑focused high‑performance and... ...State Space Models. Develop multimodal learning systems that integrate... ...models that integrate vision, language, and structured/time‑series...Language
$147.4k - $272.1k
...United States Machine Learning and AI We are actively seeking... ...is currently achievable with multimodal inputs and large language models. Our centralized applied research and engineering group is dedicated... ...cutting‑edge Computer Vision and Machine Perception technologies...LanguageRelocation$184.7k - $324.8k
3D Computer Vision Research Engineer, Apple Maps Cupertino, California,... ...States — Machine Learning and AI Apple is where individual... ...working with extremely large multimodal datasets collected from aerial... .... Experience with vision‑language models, multimodal LLMs, reasoning...LanguageRelocation- ...You’ll Do Lead hands‑on research at the intersection of... ...processing and modern AI methods. Invent and advance... ...processing, computer vision, graphics, and content... ...Applications in vision, audio, or multimodal domains (e.g., source... ...synthesis, captioning, language models). Experience...LanguageLocal areaWorldwideFlexible hours
$150k
A leading AI research lab in Sunnyvale is seeking a professional to work on advanced agentic language models. The role involves developing algorithms, contributing to research, and collaborating on state-of-the-art models. Ideal candidates hold a degree in Computer Science...LanguageVisa sponsorship$192k - $356.5k
NVIDIA Gruppe is looking for a Senior Research Scientist specializing in Multi-Modal Language Models to enhance our open-source multi-modal models. You will collaborate... ...field and over 4 years of experience in computer vision, particularly with multi-modal LLMs. Competitive...Language$147.4k - $272.1k
Computer Vision & Machine Learning Engineer Sunnyvale, California... ...Machine Learning and AI Apple is where individual... ...is a centralized applied research and engineering organization... ...in at least one area of multimodal systems (e.g. vision, language, video, etc.) Proficiency...LanguageRelocation$311.85k - $370k
...developer of Embodied AI technology. Our... ...systems. Our vision is to create autonomy... ...family of vision-language-action models, let... ...at either Applied Scientist or Machine Learning... ...LINGO line): push multimodal perception, reasoning... ...of taking ML from research into production...LanguageFull timeWork at officeWork from homeShift work$165k - $185k
Robert Bosch Group in Sunnyvale, California, seeks a Research Scientist in Robotics AI to contribute to advanced ADAS/AD technologies. You will conduct... ...&D experience and strong skills in relevant programming languages. A competitive salary range of $165,000 - $185,000,...Language$147k - $211k
...experience. Experience in ML/AI, backed by a publication... ...Google maintains a portfolio of research projects driven by fundamental... ...types of work. As a Research Scientist, you'll set up large-scale tests... ..., data mining, natural language processing, hardware and software...LanguageFull time- ...team for 3-6 months. The role focuses on advancing research in embodied foundation models, applying state-of-the-art AI methodologies to solve complex problems. Ideal candidates will have experience in vision-language models, Python programming skills, and a keen interest...LanguageFull timeInternship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Research Scientist, Vision-Language Multimodal AI. Be the first to apply!
- r&d scientist Sunnyvale, CA
- quality control scientist Sunnyvale, CA
- machine learning research scientist Sunnyvale, CA
- validation scientist Sunnyvale, CA
- scientist Sunnyvale, CA
- qc scientist Sunnyvale, CA
- research scientist Sunnyvale, CA
- decision scientist Sunnyvale, CA
- application scientist Sunnyvale, CA
- operations research scientist Sunnyvale, CA


