Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist, Vision-Language Multimodal AI

Institute of Foundation Models

The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding and reasoning. Qualified candidates will have a PhD in a relevant field and strong Python/PyTorch skills. Responsibilities include research, architecture development, and mentoring junior researchers while contributing to impactful AI solutions. #J-18808-Ljbffr

Vacancy posted 20 hours ago
Similar jobs that could be interesting for youBased on the Research Scientist, Vision-Language Multimodal AI in Sunnyvale, CA vacancy
  • $165k - $195k

     ...AI Research Scientist- Multimodal Foundational Models Full-time The Bosch Research and Technology Center North America with offices...  ...Silicon Valley focuses on Foundation Models,Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data Visual... 
    Language
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    19 hours ago
  • $165k - $185k

     ...Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate... 
    Language

    Robert Bosch Group

    Sunnyvale, CA
    19 hours ago
  • $165k - $185k

     ...Description The Bosch Research and Technology...  ...global research, our AI research in Silicon...  ...Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality,...  ...As a Research Scientist- Vision- Language- Action...  ...and applying multimodal transformer‑based sequence... 
    Language
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    20 hours ago
  •  ...General Motors is seeking a Staff Research Scientist specializing in Vision-Language Models to redefine mobility. You will lead advancements in AI for autonomous driving at the Mountain View Technical Center. This remote position requires a Ph.D. and 5+ years of experience... 
    Language
    Remote work

    General Motors

    Mountain View, CA
    20 hours ago
  • $50 per hour

     ...Lead Generative AI Analyst We are looking for highly detail...  ...Chinese (zh-CN) multimedia and language data labeling and review work...  ...You'll Do Lead a team of multimodal annotators and reviewers to deliver...  ...data labeling, computer vision labeling, content QA, or a closely... 
    Language
    Full time

    Welocalize

    Cupertino, CA
    1 day ago
  • $192k - $304.75k

     ...We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA...  ..., large-scale robot learning, game AI, and physical simulation. Our past...  ...the following topics: LLMs; Large vision‑language models; Video generative models and... 
    Language

    University of Georgia- FACS

    Santa Clara, CA
    19 hours ago
  •  ...detail-oriented Lead Generative AI Analyst to lead day-to-day...  ...Chinese (zh-CN) multimedia and language data labeling and review work...  ...What You’ll Do Lead a team of multimodal annotators and reviewers to...  ...multimodal data labeling, computer vision labeling, content QA, or a... 
    Language
    Full time

    Welo Global

    Cupertino, CA
    27 days ago
  • $150k

     ...Models We are a dedicated research lab for building,...  ...the next generation of AI builders, and drive transformative...  ...researchers, data scientists, and engineers,...  ...Scientist in the Vision Language Model (VLM) team, your...  ...advancing state-of-the-art multimodal foundation models that... 
    Language

    Institute of Foundation Models

    Sunnyvale, CA
    20 hours ago
  • $165k - $180k

    AI Research Scientist - GenAI at Bosch Group - Sunnyvale, CA, United States Company Description...  ..., Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics,...  ...following areas: AI agent, RAG, multimodal transformers, multimodal language... 
    Language
    Work experience placement
    Worldwide

    Victrays

    Sunnyvale, CA
    1 day ago
  • $181.1k - $318.4k

     ...States Machine Learning and AI We're seeking research scientists and engineers to create...  ...innovations in language models, machine learning,...  ...reinforcement learning, computer vision, robotics, and related areas...  ...models (including large multimodal models) and agentic... 
    Language
    Relocation

    Apple

    Santa Clara, CA
    1 day ago
  • $192k - $304.75k

     ...Responsibilities Conduct original research in the space of generative AI Implement and train large-scale generative AI models for various content...  ...of theory and practice of deep learning, computer vision, natural language processing, or computer graphics Track record of... 
    Language

    University of Georgia- FACS

    Santa Clara, CA
    1 day ago
  •  ...technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced...  ...applied machine learning and proficiency in programming languages such as Python or C++. The role offers a competitive salary... 
    Language

    Apple

    Santa Clara, CA
    1 day ago
  • $117.2k - $313.7k

     ...About the Role Salesforce AI Research is seeking outstanding AI Research Scientists / Research Engineers to build and deploy high‑impact AI...  ...reinforcement learning, autonomous workflows Multimodal & Computer Vision – Vision‑language models, video understanding, visual... 
    Language
    Full time

    100 Salesforce, Inc.

    Palo Alto, CA
    1 day ago
  • $150k

     ...Models We are a dedicated research lab for building,...  ...the next generation of AI builders, and drive transformative...  ...researchers, data scientists, and engineers,...  ...specializing in Computer Vision your role will be crucial...  ...related concepts (e.g., language modeling, computer vision... 
    Language
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    20 hours ago
  • $140k - $195k

     ...Center About the Team Our AI Research team, reporting...  ...systems—integrating multimodal foundation models, generative...  ...world-class team of scientists and engineers, and...  ..., including computer vision, robotic manipulation...  ...or more of systems languages (C++/Java) Demonstrated... 
    Language
    Work at office
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    19 hours ago
  • $147k - $211k

     ...Experience training embodied reasoning VLMs (Vision Language Models). Experience working with...  ...Preferred qualifications: Experience with AI technology. Experience with Python...  ...capabilities. Write software to implement research ideas and iterate. Participate in research... 
    Language
    Full time

    Google Inc.

    Mountain View, CA
    1 day ago
  • $168k - $264.5k

    NVIDIA Research’s \"AI-Mediated Reality and Interaction\" team is looking for a research scientist. Our mission is to create interactive physical...  ...generative modeling, large language models, human behavior understanding...  ...at top venues in computer vision, artificial intelligence... 
    Language

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  •  ...mission to make frontier AI truly open for all....  ...develops an open research stack and models in...  ...Overview The Research Scientist will be an integral...  ...of-the-art in large language models (LLMs), vision language models (...  ...including text and multimodal models, and both open... 
    Language
    Worldwide
    Flexible hours

    Oumi

    Palo Alto, CA
    20 hours ago
  •  ...A leading technology company is seeking a Senior Research Scientist to focus on Multimodal Foundation Models and Robotics. This position involves designing AI algorithms for humanoid robots, developing training methods for foundation models, and working with a collaborative... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $150k

     ...A leading AI research lab in Sunnyvale, California, is seeking a Research Scientist specializing in Computer Vision. The role involves advancing innovative computer vision technologies, requiring a Master’s degree and experience with Gen AI techniques. Responsibilities... 

    Institute of Foundation Models

    Sunnyvale, CA
    20 hours ago
  • $156k - $387.6k

     ...focus on cutting-edge research in content understanding...  ...multilingual learning, multimodal pretraining, and more....  ...limited to): - General AI platform design and development...  ...expertise in large language models (LLMs) and...  ...to medical, dental, and vision insurance, a 401(k) savings... 
    Language
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    2 days ago
  • $174k - $252k

    Google Inc. seeks a researcher in AI for healthcare in DeepMind's Frontier Health team. You will develop multimodal AI systems and novel methods, while publishing findings that contribute to patient care. The ideal candidate has a PhD in Computer Science, at least 2 years... 
    Full time

    Google Inc.

    Mountain View, CA
    1 day ago
  • $35 per hour

     ...serves localization teams through AI-enabled multilingual content...  ...enterprises in more than 300 languages, enabling accurate, culturally...  ...(from Canada); be able to research and resolve linguistic questions...  ...Insurance • Dental Insurance • Vision Insurance • FSA and HSA •... 
    Language
    Hourly pay
    Worldwide
    Monday to Friday

    Welocalize

    Cupertino, CA
    3 days ago
  • $165k - $195k

     ...The Robert Bosch Group is seeking an AI Research Scientist in Sunnyvale, California. This full-time role focuses on leading research in multimodal foundational models to enhance AI functionalities in various Bosch products. The ideal candidate will have a Ph.D. in a relevant... 
    Full time

    Robert Bosch Group

    Sunnyvale, CA
    19 hours ago
  •  ...A leading AI research institute in Sunnyvale, California, seeks a Research Scientist specializing in Natural Language Processing (NLP) to advance cutting-edge technologies and develop intelligent systems. The ideal candidate will hold a Master's or PhD in a relevant field... 
    Language
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $35 per hour

     ...Medical Insurance ▪️ Dental Insurance ▪️ Vision Insurance ▪️ FSA and HSA ▪️ Voluntary...  ...Native-level fluency in their respective language (grammar, vocabulary, composition,...  ...ethic  We may use artificial intelligence (AI) tools to support parts of the hiring... 
    Language
    Hourly pay
    Remote work

    Welo Global

    Cupertino, CA
    a month ago
  • $150k

     ...A leading AI research lab in Sunnyvale is seeking a professional to work on advanced agentic language models. The role involves developing algorithms, contributing to research, and collaborating on state-of-the-art models. Ideal candidates hold a degree in Computer Science... 
    Language
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    20 hours ago
  •  ...the full stack of a unified multimodal foundation model, from pretraining...  .... This is foundational research with direct physical impact....  ...architectures where vision, language, and kinematics share a unified...  ...consequential work in embodied AI right now, with the people and... 
    Language

    Prime Recruitment Partners

    Sunnyvale, CA
    19 hours ago
  • $48 - $63 per hour

    Samsung AI Research Center (SAIC) located in Mountain View, California...  ...research interns for the Vision Intelligence lab. Our goal is...  ...pushing the state-of-the-art in multimodal understanding and generation...  ...of experts, etc. Vision Language Action Models (VLA) for... 
    Language
    Hourly pay
    Internship
    Work at office
    Worldwide

    Samsung-Research-America

    Mountain View, CA
    4 days ago
  • $171.6k - $302.2k

     ...Machine Learning Engineer - Vision Products Software AI is transforming how people interact with...  ...intersection of machine learning research and product engineering—rapidly iterating...  ...architectures; ranging from large language and multimodal models to ranking and... 
    Language
    Relocation

    Apple

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Vision-Language Multimodal AI. Be the first to apply!