Multimodal LLM Researcher (MLLM)

Pika

Multimodal LLM Researcher (MLLM) About the Role At Pika, we are pioneering next-generation creative infrastructure built around real-time, multimodal generation and intelligent, agentic platforms. We are seeking accomplished Multimodal LLM Researchers (LLM, VLM, and Audio LM) to drive forward our mission to make agentic real-time generative technology accessible, dynamic, and transformative for millions of creators. As a core member of our research team, you will be integral to designing and building foundational technologies, developing novel approaches for large multimodal language models (LLMs/VLMs/Audio LMs), and orchestrating intelligent agentic systems that power scalable, interactive multimedia experiences. You will collaborate closely with engineering and product teams, shaping the future of real-time creative platforms. What You’ll Do Lead and contribute to research efforts focused on real-time, multimodal generation—including text, image, video, and audio synthesis—as well as orchestration of agentic platform infrastructure Design and prototype novel algorithms and architectures for high-fidelity, real-time multimodal synthesis and interactive experiences Focus on real-time aspects of model inference and synthesis across modalities Work on diffusion model distillation and/or develop diffusion-based world models for multimodal applications Train and finetune autoregressive and diffusion models in LLM, VLM, or Audio LM contexts with a focus on real-time performance Curate specific datasets, especially for video, audio, cross-modal, and sensory-rich data Collaborate with cross-functional teams to bring research advancements into production-ready technologies Publish work in top-tier conferences and journals; communicate research results internally and externally Stay at the cutting edge of real-time multimodal generative AI and agentic orchestration What We’re Looking For 5+ years of relevant experience, including research during graduate studies, in large language models, vision-language models, audio language models, deep learning, or related fields Demonstrated impact as first author on major publications in top conferences or journals (e.g., NeurIPS, CVPR, ICML, ICCV, SIGGRAPH, Interspeech, etc.) Deep expertise in at least one area: language modeling (LLM), vision-language modeling (VLM), or audio language modeling (Audio LM) Strong experience with generative models, including autoregressive and diffusion models, and their real-time deployment Hands‑on experience curating, constructing, or augmenting large, high-quality multimodal datasets Experience developing and deploying real-time systems and/or agentic orchestration infrastructure Strong programming and prototyping skills (Python, PyTorch, TensorFlow, etc.) Passion for building creative tools and platforms that empower users Excellent communication and collaboration skills What We Offer Competitive salary and substantial equity in a high‑growth startup Full health benefits + 401k matching and more Collaborative, mission‑driven team environment with major growth opportunities Flexible on‑site/remote hybrid (HQ in Palo Alto, CA) About Pika Pika empowers creators by building state‑of‑the‑art agentic and multimedia platforms. Our vision is to break down technical barriers to creativity, making real-time generative and intelligent orchestration accessible to all. Join us and shape the next evolution of creative technology! If you are a leading researcher excited by real-time multimodal AI and agentic platforms, we want to hear from you. #J-18808-Ljbffr Pika

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Multimodal LLM Researcher (MLLM) in Palo Alto, CA vacancy

Real-Time Multimodal AI Researcher
Pika is seeking a Multimodal LLM Researcher to innovate in real-time multimodal generation across text, image, video, and audio. You will be a core member of the research team, collaborating closely with engineering and product teams to develop technologies that empower...
Suggested
Flexible hours
Pika
Palo Alto, CA
4 days ago
Omni-Multimodal AI Researcher
$163.8k - $307.6k
Lightspeed Studios in Palo Alto, California seeks candidates for a role focused on Omni multimodal large models. Responsibilities include conducting R&D, analyzing performance bottlenecks, and exploring next-generation architectures. Ideal candidates will have a Bachelor...
Suggested
Lightspeed Studios
Palo Alto, CA
4 days ago
Hunyuan Multimodal Algorithm Researcher (Omni-Modal)
$134.9k - $253.4k
...source collaboration, constructing new platforms and supporting business innovation. What The Role Entails Conduct research and development of Omni multimodal large models, including the design and construction of training data, foundational model algorithm design,...
Suggested
Full time
Relocation package
Tencent
Palo Alto, CA
4 days ago
Applied Research Scientist, Perception LLM/VLM (PhD, New Grad)
$170k - $216k
...into the Waymo Driver. We conduct our own research to address real-world problems and... ...efficiently integrating new, diverse, and multimodal data streams (e.g., visual data from different... ...focus on large-scale model development (LLM, VLM, or similar foundation models). Proven...
Suggested
Full time
Remote work
Dormont Manufacturing Co
Mountain View, CA
2 days ago
Applied AI Researcher: Domain-Scale GenAI & Production
...agentic reasoning (ModelMesh™), reliable model evaluation (LLM-IQ™), and multimodal understanding — serves regulated industries such energy, semiconductor... .... Trusted by Fortune 500 enterprises, we bring together research, engineering, product, and domain expertise to deliver AI...
Suggested
Articul8
Palo Alto, CA
4 days ago
Sr. AI/LLM Threat Researcher, Agentic Systems - AI Detection and Response (Hybrid)
$140k - $215k
...cybersecurity starts with you. About the Role As an AI Threat Researcher, you will lead the charge in identifying, documenting, and mitigating... ...locations. What You’ll Do Conduct deep‑dive analysis into LLM architectures and agentic frameworks to identify potential security...
Work at office
Local area
2 days per week
3 days per week
CrowdStrike Holdings, Inc.
Sunnyvale, CA
4 days ago
Senior ML Researcher - LLM Algorithmic Optimization
d-Matrix inc. is looking for a Senior Staff ML Researcher to join our Algo team in Santa Clara, CA. This hybrid position involves working... ...The successful candidate will develop algorithms for optimizing LLM inference on our DNN accelerators. Ideal applicants should have...
3 days per week
d-Matrix inc.
Santa Clara, CA
2 days ago
Foundational ML Researcher — Live AI, LLM Training, Remote
...competitive salary with an employee stock option plan and the opportunity to be part of a leading AI startup focused on impactful research. The ideal candidates should have a strong background in machine learning and deep learning with the capability to conduct hands-on...
Remote job
Pathway
Palo Alto, CA
4 days ago
On-Device AI Researcher: Multimodal, Low-VRAM Innovator
...agents, we specialize in areas such as finance, legal, coding, and creative tasks like voice and image generation. Our breakthrough multimodal AI models are renowned for outperforming cloud-dependent competitors, even while operating entirely offline on consumer-grade...
Local area
Decompute
Cupertino, CA
4 days ago
Lead Principal AI Researcher — Domain-Specific Models
...through ModelMesh(TM), reliable model evaluation through LLM-IQ(TM), and multimodal understanding to serve regulated industries including energy... .... Trusted by Fortune 500 enterprises, we bring together research, engineering, product, and domain expertise to deliver AI...
Shift work
Articul8
Palo Alto, CA
3 days ago
Senior Multimodal AI Researcher — Lead Innovations & Teams
$212k - $292k
Dolby Laboratories is seeking a Senior Research Leader to drive innovation in multimodal immersive technologies. Based in Sunnyvale, California, you will lead research projects focusing on AI and sensory experiences while mentoring a high-performing team. Ideal candidates...
Flexible hours
Dolby Laboratories
Sunnyvale, CA
4 days ago
Senior Staff ML Researcher - LLM Algorithmic Optimization
Location Hybrid, working on-site at our Santa Clara, CA, headquarters 3 days per week. The role Senior Staff ML Researcher - LLM Algorithmic Optimization What You Will Do d-Matrix is seeking machine learning researchers to join our Algo team. We’re looking for someone...
3 days per week
d-Matrix inc.
Santa Clara, CA
2 days ago
Multimodal AI Researcher: Foundation Models & Agents
A leading technology company in Santa Clara seeks a Machine Learning Researcher for its AIML Multimodal Foundation Model Team. You will develop advanced multimodal foundation models and agent capabilities for Apple's products. Ideal candidates possess a PhD or MS in a...
Apple
Santa Clara, CA
4 days ago
Perception LLM/VLM Scientist — Scalable Pretraining & Equity
$170k - $216k
...proficient in Python and deep learning frameworks. You will be responsible for optimizing pre-training pipelines and working with multimodal data streams. The expected salary range for this full-time position in California is $170,000—$216,000 USD, with additional...
Full time
Dormont Manufacturing Co
Mountain View, CA
3 days ago
AIML - Machine Learning Researcher, Data and ML Innovation
$181.1k - $318.4k
AIML - Machine Learning Researcher, Data and ML Innovation Santa Clara, California, United... ...States Machine Learning and AI The AIML Multimodal Foundation Model Team is pioneering next... ...machine learning, computer vision, multimodal LLM, and agent training experience and solid...
Relocation
Apple
Santa Clara, CA
2 days ago
Senior AI/LLM Threat Researcher — Hybrid, Equity
CrowdStrike in Sunnyvale, California is looking for an AI Threat Researcher to analyze and mitigate threats against Large Language Models. This role involves evaluating security interfaces and developing robust methodologies to enhance system defenses. The ideal candidate...
CrowdStrike
Sunnyvale, CA
4 days ago
AI Experience Researcher, Product Evaluation, Vision Products Group
$141.8k - $258.6k
AI Experience Researcher, Product Evaluation, Vision Products Group Sunnyvale, California,... ...write tests, and turn them into prompts for LLM judges or instructions for human raters.... ...setups Experience working with multimodal or agentic systems, AI/ML models, preferably...
Relocation
Apple Inc.
Sunnyvale, CA
4 days ago
Hybrid AI Threat Researcher: LLM & Agentic Security
CrowdStrike Holdings, Inc. is seeking an AI Threat Researcher to identify and mitigate threats against Large Language Models and AI Agents. This role involves deep analysis of AI security frameworks and developing testing methodologies, requiring a hybrid work model of...
2 days per week
3 days per week
CrowdStrike Holdings, Inc.
Sunnyvale, CA
4 days ago
Multimodal AI Researcher - Image/Video Gen & Editing
A leading technology company based in Santa Clara is seeking a researcher with expertise in Machine Learning and AI. This role focuses on developing foundational models for image and video generation. Candidates should possess a PhD, along with experience in computer vision...
Apple Inc.
Santa Clara, CA
16 hours ago
AI Researcher & Engineer - Multimodal (Audio)
$180k
...strong prioritization skills are important. All engineers and researchers are expected to have strong communication skills. They should... ...knowledge with their teammates. About the Role The multimodal team at xAI creates magical AI experiences beyond text, enabling...
Local area
Relocation
xAI
Palo Alto, CA
more than 2 months ago
Machine Learning Researcher / Engineer (Foundational Models)
...Kaiser, co-author of the Transformer ("the T" in ChatGPT) and a key researcher behind OpenAI's reasoning models. Pathway is headquartered in... ...contributions. # You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface...
Permanent employment
Full time
Contract work
Immediate start
Remote work
Flexible hours
Pathway Vet Alliance
Palo Alto, CA
5 days ago
Member of Technical Staff (AI Researcher)
$200k - $300k
...Location Type Hybrid Department AI Perplexity is seeking top-tier AI Research Scientists and Engineers to advance our AI products and... ...Research, Comet, and Search products Stay current with the latest LLM research, especially in model training, optimization, and...
Full time
Pantera Capital
Palo Alto, CA
2 days ago
Senior User Researcher, Enterprise & Platform
$185k - $210k
...open platform. With over 100 enterprise SaaS connectors, flexible LLM choice, and robust APIs, Glean gives organizations the... ...customers’ success. We are looking for an experienced Senior User Researcher to focus on our admin persona—the IT leaders, setup admins, and...
Work at office
Home office
Flexible hours
Dormont Manufacturing Company
Mountain View, CA
16 hours ago
Lead Research Scientist, LLM Agents & Systems
NeoCognition Inc. is seeking a Member of Technical Staff for research on LLM agents in Palo Alto, California. You will lead research projects and collaborate with engineers to create impactful AI systems. Essential qualifications include a solid foundation in machine learning...
NeoCognition Inc.
Palo Alto, CA
1 day ago
Research Scientist
...the order of listing. What you’ll do As a Research Scientist at Simular, you will: Shape... ...directions in planning, reinforcement learning, multimodal reasoning, grounding, human-agent... ...LLMs/VLMs Reinforcement learning and/or LLM‑based agents Computer vision and multimodal...
Simular Inc.
Palo Alto, CA
3 days ago
Academic Researchers remote
...Academic Researcher 50 openings $20 - $55/hr Required Skills: Academic research, document review, MS Word (advanced features: tracked changes, formatting), PDF editing and data extraction, written communication, verbal communication, attention to detail, literature...
For contractors
Remote work
ESR Healthcare
Palo Alto, CA
2 days ago
Machine Learning Research Scientist, Mechanical Intuition in Multimodal Models
$176k - $253k
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world‑class team advancing the state of...
Work experience placement
Internship
Local area
Shift work
Toyota Research Institute
Los Altos, CA
4 days ago
Research Scientist, Vision-Language Multimodal AI
The Institute of Foundation Models in Sunnyvale, California is seeking a Research Scientist for their Vision Language Model team. This role involves advancing state-of-the-art multimodal foundation models and developing large-scale systems combining visual understanding...
Institute of Foundation Models
Sunnyvale, CA
16 hours ago
Senior Research Scientist, Multimodal Foundation Models and Robotics
$192k - $304.75k
We are now looking for a Senior Research Scientist focused on Multimodal Foundation Models and Robotics! NVIDIA is searching for an outstanding research scientist to build humanoid robot foundation models and systems in the Generalist Embodied Agent Research (GEAR) group...
University of Georgia- FACS
Santa Clara, CA
4 days ago
Applied Scientist - Perception & Multimodal ML
$170k - $216k
Waymo, located in Mountain View, California, is looking for a Machine Learning Engineer who will develop multi-modal sensor fusion architectures and optimize models for their autonomous driving technology. The company seeks individuals with a background in computer science...
Dormont Manufacturing Co
Mountain View, CA
16 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Multimodal LLM Researcher (MLLM). Be the first to apply!