Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Research Scientist, Vision-Language Multimodal AI

Institute of Foundation Models

The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding and reasoning. Qualified candidates will have a PhD in a relevant field and strong Python/PyTorch skills. Responsibilities include research, architecture development, and mentoring junior researchers while contributing to impactful AI solutions. #J-18808-Ljbffr Institute of Foundation Models

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Research Scientist, Vision-Language Multimodal AI in Sunnyvale, CA vacancy
  • $165k - $195k

    AI Research Scientist- Multimodal Foundational Models Full-time The Bosch Research and Technology Center North America with offices...  ...Silicon Valley focuses on Foundation Models,Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics, Big Data Visual... 
    Language
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    1 day ago
  • $165k - $185k

    Robert Bosch Group seeks a motivated Research Scientist specializing in Vision-Language-Action Models in Sunnyvale, California. The role emphasizes cutting-edge research in AI, focusing on autonomous systems and collaboration across global teams. The successful candidate... 
    Language

    Robert Bosch Group

    Sunnyvale, CA
    2 days ago
  • $185k - $215k

     ...Description The Bosch Research and Technology...  ...global research, our AI research in Silicon...  ...Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality,...  ...a Senior Research Scientist- Vision- Language- Action...  ...and applying multimodal transformer-based sequence... 
    Language
    Full time
    Work experience placement
    Local area
    Worldwide

    Robert Bosch Group

    Sunnyvale, CA
    2 days ago
  • $50 per hour

     ...Lead Generative AI Analyst We are looking for highly detail...  ...Chinese (zh-CN) multimedia and language data labeling and review work...  ...You'll Do Lead a team of multimodal annotators and reviewers to deliver...  ...data labeling, computer vision labeling, content QA, or a closely... 
    Language
    Full time

    Welocalize

    Sunnyvale, CA
    3 days ago
  • $184k - $299k

    We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA...  ..., large-scale robot learning, game AI, and physical simulation. Our past...  ...the following topics: LLMs; Large vision-language models; Video generative models and... 
    Language

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...detail-oriented Lead Generative AI Analyst to lead day-to-day...  ...Chinese (zh-CN) multimedia and language data labeling and review work...  ...What You’ll Do Lead a team of multimodal annotators and reviewers to...  ...multimodal data labeling, computer vision labeling, content QA, or a... 
    Language
    Full time

    Welo Global

    Sunnyvale, CA
    24 days ago
  • $150k

     ...Models We are a dedicated research lab for building,...  ...the next generation of AI builders, and drive transformative...  ...researchers, data scientists, and engineers,...  ...Scientist in the Vision Language Model (VLM) team, your...  ...advancing state-of-the-art multimodal foundation models that... 
    Language

    Institute of Foundation Models

    Sunnyvale, CA
    3 days ago
  • $165k - $180k

    AI Research Scientist - GenAI at Bosch Group - Sunnyvale, CA, United States Company Description...  ..., Explainable AI (XAI), Natural Language Processing, Computer Vision & Mixed Reality, Cloud Robotics,...  ...following areas: AI agent, RAG, multimodal transformers, multimodal language... 
    Language
    Work experience placement
    Worldwide

    Victrays

    Sunnyvale, CA
    3 days ago
  • $181.1k - $318.4k

     ...States Machine Learning and AI We're seeking research scientists and engineers to create...  ...innovations in language models, machine learning,...  ...reinforcement learning, computer vision, robotics, and related areas...  ...models (including large multimodal models) and agentic... 
    Language
    Relocation

    Apple Inc.

    Santa Clara, CA
    1 day ago
  • $140k - $195k

     ...the Team : Our AI Research team, reporting directly...  ...systems-integrating multimodal foundation models,...  ...world-class team of scientists and engineers, and represent...  ..., including computer vision, robotic manipulation...  ...or more of systems languages (C++/Java)... 
    Language
    Work at office
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  •  ...technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced...  ...applied machine learning and proficiency in programming languages such as Python or C++. The role offers a competitive salary... 
    Language

    Apple Inc.

    Santa Clara, CA
    2 days ago
  • $147k - $211k

     ...Experience training embodied reasoning VLMs (Vision Language Models). Experience working with...  ...Preferred qualifications: Experience with AI technology. Experience with Python...  ...capabilities. Write software to implement research ideas and iterate. Participate in research... 
    Language
    Full time

    Google Inc.

    Mountain View, CA
    3 days ago
  • $168k - $264.5k

    NVIDIA Research’s \"AI-Mediated Reality and Interaction\" team is looking for a research scientist. Our mission is to create interactive physical...  ...generative modeling, large language models, human behavior understanding...  ...at top venues in computer vision, artificial intelligence... 
    Language

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $117.2k - $313.7k

    About the Role Salesforce AI Research is seeking outstanding AI Research Scientists / Research Engineers to build and deploy high‑impact AI...  ...reinforcement learning, autonomous workflows Multimodal & Computer Vision - Vision‑language models, video understanding, visual... 
    Language
    Full time

    100 Salesforce, Inc.

    Palo Alto, CA
    1 day ago
  • $150k

     ...Models We are a dedicated research lab for building,...  ...the next generation of AI builders, and drive transformative...  ...researchers, data scientists, and engineers,...  ...specializing in Computer Vision your role will be crucial...  ...related concepts (e.g., language modeling, computer... 
    Language
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $156k - $387.6k

     ...focus on cutting-edge research in content understanding...  ...multilingual learning, multimodal pretraining, and more....  ...limited to): - General AI platform design and development...  ...expertise in large language models (LLMs) and...  ...to medical, dental, and vision insurance, a 401(k) savings... 
    Language
    Temporary work
    Local area

    Tik Tok

    San Jose, CA
    4 days ago
  •  ...mission to make frontier AI truly open for all....  ...develops an open research stack and models in...  ...Overview The Research Scientist will be an integral...  ...of-the-art in large language models (LLMs), vision language models (...  ...including text and multimodal models, and both open... 
    Language
    Worldwide
    Flexible hours

    Oumi

    Palo Alto, CA
    20 hours ago
  • A leading technology company is seeking a Senior Research Scientist to focus on Multimodal Foundation Models and Robotics. This position involves designing AI algorithms for humanoid robots, developing training methods for foundation models, and working with a collaborative... 

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $174k - $252k

    Google Inc. seeks a researcher in AI for healthcare in DeepMind's Frontier Health team. You will develop multimodal AI systems and novel methods, while publishing findings that contribute to patient care. The ideal candidate has a PhD in Computer Science, at least 2 years... 
    Full time

    Google Inc.

    Mountain View, CA
    3 days ago
  • $150k

    A leading AI research lab in Sunnyvale, California, is seeking a Research Scientist specializing in Computer Vision. The role involves advancing innovative computer vision technologies, requiring a Master’s degree and experience with Gen AI techniques. Responsibilities... 

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $35 per hour

     ...serves localization teams through AI-enabled multilingual content...  ...enterprises in more than 300 languages, enabling accurate, culturally...  ...(from Canada); be able to research and resolve linguistic questions...  ...Insurance • Dental Insurance • Vision Insurance • FSA and HSA •... 
    Language
    Hourly pay
    Worldwide
    Monday to Friday

    Welocalize

    Cupertino, CA
    20 hours ago
  •  ...foundation models, LLMs, and multimodal LLMs? We are looking...  ...a centralized applied research and engineering...  ...time on-device Computer Vision and Machine Perception...  ...foundation models and agentic AI systems that enable...  ...systems (e.g. vision, language, video, etc.). ~ Proficiency... 
    Language

    Apple

    Sunnyvale, CA
    2 days ago
  • $262k - $365k

    Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements PhD in...  ...deep) learning, data mining, natural language processing, hardware and software...  ...organizing research conferences and visioning activities. Deliver full projects... 
    Language
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    11 hours ago
  • $165k - $195k

    The Robert Bosch Group is seeking an AI Research Scientist in Sunnyvale, California. This full-time role focuses on leading research in multimodal foundational models to enhance AI functionalities in various Bosch products. The ideal candidate will have a Ph.D. in a relevant... 
    Full time

    Robert Bosch Group

    Sunnyvale, CA
    1 day ago
  •  ...the full stack of a unified multimodal foundation model, from pretraining...  .... This is foundational research with direct physical impact....  ...architectures where vision, language, and kinematics share a unified...  ...consequential work in embodied AI right now, with the people and... 
    Language

    Prime Recruitment Partners

    Sunnyvale, CA
    2 days ago
  • $48 - $63 per hour

    Samsung AI Research Center (SAIC) located in Mountain View, California...  ...research interns for the Vision Intelligence lab. Our goal is...  ...pushing the state-of-the-art in multimodal understanding and generation...  ...of experts, etc. Vision Language Action Models (VLA) for... 
    Language
    Hourly pay
    Internship
    Work at office
    Worldwide

    Samsung Research America (SRA)

    Mountain View, CA
    1 day ago
  • $171.6k - $302.2k

     ...Machine Learning Engineer - Vision Products Software AI is transforming how people interact with...  ...intersection of machine learning research and product engineering—rapidly iterating...  ...architectures; ranging from large language and multimodal models to ranking and... 
    Language
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  • $170.7k - $300.2k

     ...and craft upcoming research directions in the field of multimodal foundation models that...  ...deeply technical scientists and engineers to...  ...touching aspects of language and multimodal foundation...  ...Craft technical vision for the team Work...  ...development of cutting-edge AI technologies in the... 
    Language
    Relocation

    Apple Inc.

    Sunnyvale, CA
    1 day ago
  • $147.4k - $272.1k

     ...Applied Machine Learning Research Engineer - Multimodal for Human...  ...foundation and large language models, and many applications...  ...in the computer vision and machine learning...  ...partnering with HW, SW and AI teams to shape Apple...  ...of experts—research scientists, ML engineers,... 
    Language
    Worldwide
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  • $181.1k - $318.4k

     ...AIML - Machine Learning Researcher - Multimodal Agent The AIML Multimodal Foundation...  ...cutting-edge generative AI technologies based on multimodal large language models to enable innovative features...  ...applied machine learning, computer vision, multimodal LLM, and agent... 
    Language
    Relocation

    Apple

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Research Scientist, Vision-Language Multimodal AI. Be the first to apply!