Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Multimodal LLM Researcher (MLLM)

Neura Market

Multimodal LLM Researcher (MLLM) About the Role At Pika, we are pioneering next-generation creative infrastructure built around real-time, multimodal generation and intelligent, agentic platforms. We are seeking accomplished Multimodal LLM Researchers (LLM, VLM, and Audio LM) to drive forward our mission to make agentic real-time generative technology accessible, dynamic, and transformative for millions of creators. As a core member of our research team, you will be integral to designing and building foundational technologies, developing novel approaches for large multimodal language models (LLMs/VLMs/Audio LMs), and orchestrating intelligent agentic systems that power scalable, interactive multimedia experiences. You will collaborate closely with engineering and product teams, shaping the future of real-time creative platforms. What You’ll Do Lead and contribute to research efforts focused on real-time, multimodal generation—including text, image, video, and audio synthesis—as well as orchestration of agentic platform infrastructure Design and prototype novel algorithms and architectures for high-fidelity, real-time multimodal synthesis and interactive experiences Focus on real-time aspects of model inference and synthesis across modalities Work on diffusion model distillation and/or develop diffusion-based world models for multimodal applications Train and finetune autoregressive and diffusion models in LLM, VLM, or Audio LM contexts with a focus on real-time performance Curate specific datasets, especially for video, audio, cross-modal, and sensory-rich data Collaborate with cross-functional teams to bring research advancements into production-ready technologies Publish work in top-tier conferences and journals; communicate research results internally and externally Stay at the cutting edge of real-time multimodal generative AI and agentic orchestration What We’re Looking For 5+ years of relevant experience, including research during graduate studies, in large language models, vision-language models, audio language models, deep learning, or related fields Demonstrated impact as first author on major publications in top conferences or journals (e.g., NeurIPS, CVPR, ICML, ICCV, SIGGRAPH, Interspeech, etc.) Deep expertise in at least one area: language modeling (LLM), vision-language modeling (VLM), or audio language modeling (Audio LM) Strong experience with generative models, including autoregressive and diffusion models, and their real-time deployment Hands‑on experience curating, constructing, or augmenting large, high-quality multimodal datasets Experience developing and deploying real-time systems and/or agentic orchestration infrastructure Strong programming and prototyping skills (Python, PyTorch, TensorFlow, etc.) Passion for building creative tools and platforms that empower users Excellent communication and collaboration skills What We Offer Competitive salary and substantial equity in a high‑growth startup Full health benefits + 401k matching and more Collaborative, mission‑driven team environment with major growth opportunities Flexible on‑site/remote hybrid (HQ in Palo Alto, CA) About Pika Pika empowers creators by building state‑of‑the‑art agentic and multimedia platforms. Our vision is to break down technical barriers to creativity, making real-time generative and intelligent orchestration accessible to all. Join us and shape the next evolution of creative technology! If you are a leading researcher excited by real-time multimodal AI and agentic platforms, we want to hear from you. #J-18808-Ljbffr Neura Market

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Multimodal LLM Researcher (MLLM) in Palo Alto, CA vacancy
  • Neura Market is searching for a Multimodal LLM Researcher to innovate in real-time generative AI technologies. The position involves leading research efforts in multimodal generation and collaborating closely with engineering teams to transform insights into groundbreaking... 
    Suggested
    Flexible hours

    Neura Market

    Palo Alto, CA
    1 day ago
  •  ...leading technology firm in California is seeking a passionate Research Scientist to advance next-generation AI hardware platforms. The role involves developing multimodal intelligence models, benchmarking innovative LLM architectures, and collaborating across teams to... 
    Suggested

    OPPO US Research Center

    Palo Alto, CA
    1 day ago
  • $181.1k - $318.4k

     ...AIML - Machine Learning Researcher - Multimodal Agent The AIML Multimodal Foundation Model Team is pioneering next-generation intelligent agent...  ...applied machine learning, computer vision, multimodal LLM, and agent training experience and solid engineering skills.... 
    Suggested
    Relocation

    Apple

    Santa Clara, CA
    3 days ago
  •  ...competitive salary with an employee stock option plan and the opportunity to be part of a leading AI startup focused on impactful research. The ideal candidates should have a strong background in machine learning and deep learning with the capability to conduct hands-on... 
    Suggested
    Remote work

    Pathway Vet Alliance

    Palo Alto, CA
    2 days ago
  • $170k - $216k

     ...into the Waymo Driver. We conduct our own research to address real-world problems and...  ...efficiently integrating new, diverse, and multimodal data streams (e.g., visual data from different...  ...focus on large-scale model development (LLM, VLM, or similar foundation models). Proven... 
    Suggested
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • A leading technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced multimodal foundation models and agent capabilities for Apple's products. Ideal candidates possess a PhD or MS in a... 

    Apple Inc.

    Santa Clara, CA
    1 day ago
  • $141.8k - $258.6k

    AI Experience Researcher, Product Evaluation, Vision Products Group Sunnyvale, California,...  ...write tests, and turn them into prompts for LLM judges or instructions for human raters....  ...setups Experience working with multimodal or agentic systems, AI/ML models, preferably... 
    Relocation

    Apple Inc.

    Sunnyvale, CA
    2 days ago
  • A leading technology company based in Santa Clara is seeking a researcher with expertise in Machine Learning and AI. This role focuses on developing foundational models for image and video generation. Candidates should possess a PhD, along with experience in computer vision... 

    Apple Inc.

    Santa Clara, CA
    2 days ago
  • $152k - $218.5k

    At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities...  ...as pythonocc-core for primitive fitting. Experience with multimodal generative models for boundary representations. Track record... 
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  • Real-time Video Researcher About the Role At Pika, we are pioneering next-generation creative infrastructure built around real-time video...  ...new developments in real-time video, generative AI, multimodal systems, and agentic orchestration What We’re Looking For 5+... 
    Remote work
    Flexible hours

    Pika

    Palo Alto, CA
    3 days ago
  •  ...Kaiser, co-author of the Transformer (“the T” in ChatGPT) and a key researcher behind OpenAI’s reasoning models. Pathway is headquartered in...  ...contributions. # You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface... 
    Permanent employment
    Full time
    Contract work
    Immediate start
    Remote work
    Flexible hours

    Pathway Vet Alliance

    Palo Alto, CA
    2 days ago
  • $200k - $300k

     ...Location Type Hybrid Department AI Perplexity is seeking top-tier AI Research Scientists and Engineers to advance our AI products and...  ...Research, Comet, and Search products Stay current with the latest LLM research, especially in model training, optimization, and... 
    Full time

    Pantera Capital

    Palo Alto, CA
    4 days ago
  • $182.5k - $343.2k

     ...systems for interactive game worlds and immersive content creation. We are looking for exceptional researchers and engineers who are passionate about advancing multimodal intelligence and world models at scale. Responsibility: 1. Research and develop core algorithms for... 
    Full time
    Relocation package

    Tencent

    Palo Alto, CA
    1 day ago
  • NeoCognition Inc. is seeking a Member of Technical Staff for research on LLM agents in Palo Alto, California. You will lead research projects and collaborate with engineers to create impactful AI systems. Essential qualifications include a solid foundation in machine learning... 

    NeoCognition Inc.

    Palo Alto, CA
    3 days ago
  • $100k - $300k

    Overview OPPO Research Center is seeking a passionate and innovative Research Scientist to...  ...the design, training, and deployment of multimodal intelligence models that seamlessly integrate...  ...AI conferences. Responsibilities Develop LLM model with parameter number around 1... 
    Full time

    OPPO US Research Center

    Palo Alto, CA
    3 days ago
  •  ...the order of listing. What you’ll do As a Research Scientist at Simular, you will: Shape...  ...directions in planning, reinforcement learning, multimodal reasoning, grounding, human-agent...  ...LLMs/VLMs Reinforcement learning and/or LLM‑based agents Computer vision and multimodal... 

    Simular Inc.

    Palo Alto, CA
    17 hours ago
  •  ..., and repeat. Oumi also develops an open research stack and models in collaboration with academic...  ...at Oumi will work on various aspects of LLM/VLM development including training,...  ...with various models, including text and multimodal models, and both open and closed models.... 
    Worldwide
    Flexible hours

    Oumi

    Palo Alto, CA
    4 days ago
  • $176k - $253.5k

     ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life....  ...setting, particularly in areas related to LLM training or large-scale ML. Industry experience...  ...and literature. Experience with LLM/MLLM pretraining, fine-tuning (e.g., SFT, RLHF)... 
    Temporary work
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  •  ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team advancing the state... 
    Work experience placement
    Internship
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  • $147.4k - $272.1k

     ...Machine Learning Researcher, Foundation Models [SWE Org] We build frontier foundation models that power intelligent experiences at Apple...  ...role, you will focus on pretraining, large language model (LLM) architecture, and scientific scaling of LLM. Experiences on full... 
    Relocation

    Apple

    Cupertino, CA
    17 hours ago
  • $200k - $287.5k

     ...At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team advancing the state... 
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    4 days ago
  •  ...contexts much more efficiently than GPUs. Sohu enables entirely new research directions and products. When our chips come out, these use...  ...new and verifiable benchmark for agent reasoning • Design LLM content understanding based recommendation systems You may be... 

    OpenReq

    Cupertino, CA
    1 day ago
  • $300k - $350k

     ...Lightmatter is (re)inventing the future of computing with light! Lightmatter is seeking a Principal of Machine Learning Infrastructure Researcher to join our Machine Learning team. This team is focused on inventing novel hardware systems using Lightmatter technology and... 
    Full time
    Temporary work
    Flexible hours

    Lightmatter

    Mountain View, CA
    3 days ago
  • $181.1k - $318.4k

     ...Senior Applied ML Researcher - Video Apps We are seeking a Senior Applied ML Researcher to design, train, and deploy state-of-the-art...  ...intersection of computer vision, audio signal processing, and multimodal learning, enabling intelligent systems that can see, hear, and... 
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  •  ...Role Number: 200601297-3760 Summary We are hiring a researcher with a strong technical background in Image/Video generation and editing, as well as Multimodal Foundation Models. You will play a critical role in the research and development of multimodal foundation... 

    Apple

    Santa Clara, CA
    17 hours ago
  •  ...ML Researcher Tilde Research is a moonshot AI lab advancing mechanistic interpretability, new architectures, and pretraining science. We build foundational understanding of models to advance the frontier of intelligence. About the role: As a ML Researcher, you... 
    Full time
    Internship

    Tilde

    Palo Alto, CA
    4 days ago
  • $110.8k - $253.4k

     ...through our expertise, tools, and products, accelerating the advancement of gaming technologies worldwide. Responsibilities: Research and analyze the latest advancements in gaming AI technologies, including but not limited to AI-driven 2D/3D content generation,... 
    Worldwide
    Relocation package

    Tencent

    Palo Alto, CA
    3 days ago
  • $147.4k - $272.1k

     ...AIML - Senior ML Researcher in Foundation Models, Responsible AI Join us as we build world-class groundbreaking products for our customers...  ...mitigations and safeguards to ensure safe deployment of LLM's in Apple products Advocate for scientific and engineering... 
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • Babylon in Palo Alto is seeking a Senior Researcher to conduct impactful research in the blockchain field. The ideal candidate will hold a Ph.D. in relevant areas and have a strong interest in blockchain technology. This role involves close collaboration with Babylon’s... 

    Alumni Ventures

    Palo Alto, CA
    4 days ago
  • Babylon is looking for a Senior Researcher. Babylon is a blockchain infrastructure startup founded by David Tse of Stanford and Fisher Yu, and backed by a16z, Paradigm, Polychain, and other leading investors. Babylon’s vision is to enable the trillion‑dollar Bitcoin asset... 
    Remote work

    Babylon Labs Ltd

    Palo Alto, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Multimodal LLM Researcher (MLLM). Be the first to apply!