AI Resident - Learning From Videos (LFV)

$45 - $60 per hour

Toyota Research Institute

At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team advancing the state of the art in AI, robotics, driving, and material sciences.

The Team

The Learning From Videos (LFV) team in the Robotics division focuses on the development of foundation models capable of leveraging large-scale multi-modal (RGB, depth, flow, semantics, bounding boxes, tactile, audio, etc) data from multiple domains (driving, robotics, indoors, outdoors, etc) to improve downstream task performance.

Our approach emphasizes training scalability: by learning from multiple modalities, models can develop useful data-driven priors about 3D geometry, physics, and dynamics for world understanding.

Our research interests include, but are not limited to:

Video Generation
World Models
4D Reconstruction
Multi-Modal Models
Multi-View Geometry
Data Augmentation
Video-Language-Action Models

We focus primarily on embodied applications and aim to tackle some of the hardest scientific challenges in spatio-temporal reasoning, enabling autonomous agents to operate in real-world, unstructured environments.

The AI Resident

This year-long AI Residency is a research-focused position designed for early-career researchers and engineers who are excited to work on ambitious problems in embodied AI. The resident will be deeply integrated into the LFV team, contributing to both ongoing and new research efforts in areas including:

4D World Models
Physical and Embodied Intelligence
Multi-Modal Learning

As an AI Resident, you will collaborate closely with researchers and engineers at TRI on high-risk, pushing forward our understanding of spatio-temporal reasoning and zero-shot generalization. This is a research-focused position, targeting the development of methods and techniques that can solve real-world problems.

We welcome you to join a positive, friendly, and enthusiastic team of researchers, where you will contribute to helping people gain and maintain independence, access, and mobility. We work closely with other Toyota affiliates, and actively collaborate towards research publications and the productization of our developed technologies.

Responsibilities

Develop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning targeting physical applications.
Handle the ingestion of large-scale datasets for training, including streaming, online, and continual learning.
Contribute innovative solutions at the intersection of machine learning, computer vision, and robotics to improve real-world task performance.
Work closely with robotics and machine learning researchers and engineers to understand theoretical and practical needs.
Follow best practices producing maintainable code, both for internal use as well as for open-sourcing to the scientific community.
Contribute to research publications and technical reports.

Qualifications

Bachelor's or Master's degree in Computer Science, Electrical Engineering, Robotics, or a related technical field.
Exceptional candidates with equivalent research experience (e.g., strong publication record, open-source contributions, or industry research experience) are encouraged to apply.
Strong background in computer vision and its applications to robotics and embodied systems.
Demonstrated research experience through publications, technical projects, or open-source contributions.
Strong communication skills and a collaborative mindset, with the ability to learn quickly and contribute to team research efforts.
Passionate about assisting and amplifying older adults and those in need through dexterous manipulation, human-robot collaboration, and physical assistance innovation.

Bonus Qualifications

Spatio-temporal (4D) computer vision, including multi-view geometry, 3D/4D reconstruction, video generation, self-supervised learning, occlusion reasoning, etc.
Large-scale training of multi-modal deep learning methods, both in terms of dataset sizes and model complexity, context length extension, and efficient attention, distributed computing, etc.
Application of machine learning and computer vision to embodied applications.

The pay range for this position at commencement of employment is expected to be between $45 and $60/hour for California-based roles. Base pay offered will depend on multiple individualized factors, including, but not limited to, a candidate's experience, skills, job-related knowledge, and market location. TRI offers a generous benefits package including medical, dental, and vision insurance, and paid time off benefits (including holiday pay and sick time). Additional details regarding these benefit plans will be provided if an employee receives an offer of employment.

Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.

TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant's race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the AI Resident - Learning From Videos (LFV) in Los Altos, CA vacancy

Embodied AI Resident - 4D World Models - Multi-Modal
$45 - $60 per hour
Toyota Research Institute is seeking an AI Resident to join its Learning From Videos team in Los Altos, California. This year-long position is geared towards early-career researchers excited about tackling ambitious problems in embodied AI. Responsibilities include developing...
Resident
Video
Hourly pay
Toyota Research Institute
Los Altos, CA
18 hours ago
Senior Machine Learning Engineer, Computer Vision/VLM
$204k - $259k
...team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene... ...for computer vision tasks (e.g., detection, segmentation, video understanding) using Python and frameworks like PyTorch, JAX...
Video
Full time
Remote work
Waymo
Mountain View, CA
1 day ago
Machine Learning Engineer
$100k
...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java... ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients... ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data...
Resident
Video
Full time
H1b
SynergisticIT
Cupertino, CA
3 days ago
Machine Learning Integration Engineer, Tesla AI
$140k - $390k
...Expect Tesla is building the future of AI systems that operate in the real world where... ...are non-negotiable. As a Machine Learning Integration Engineer, you'll be at the forefront... ...Combine multiple data sources, including video, audio, thermal, mechanical, weather, and...
Video
Hourly pay
Full time
Temporary work
Worldwide
Flexible hours
Tesla
Palo Alto, CA
4 days ago
Machine Learning Engineer
...Machine Learning Engineer Creatify is building the world's first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube,...
Video
Creatify AI
Mountain View, CA
1 day ago
Senior Machine Learning Engineer, Driver Understanding and Evaluation
$204k - $259k
...behavior of the Waymo Driver. The DUE Machine Learning team will build and operate scalable... .... Build and scale production-ready video generation techniques (e.g., Diffusion, Flow... ...research in 4D world modeling and generative AI into robust, production-ready solutions....
Video
Full time
Waymo
Mountain View, CA
1 day ago
Machine Learning Software Engineer
$147k - $211k
...building, training, and deploying machine learning models using TensorFlow, JAX, or Adbrain.... ...and algorithms. Experience with generative AI techniques (e.g., LLMs, natural language... ...including search, display, shopping, travel and video advertising, as well as analytics. Our...
Video
Full time
Immediate start
Google Inc.
Mountain View, CA
4 days ago
Robotics Learning Engineer - Real-World Robot AI
Rhoda ai in Palo Alto, California, is seeking Research Scientists and Research Engineers to work on adapting web-pretrained video models for robotic tasks. Ideal candidates will possess deep robotics... ...and experience with reinforcement learning and robot systems. This role...
Video
Rhoda ai
Palo Alto, CA
2 days ago
Research Scientist / Engineer - Post-training & Robot Learning
...domain knowledge to adapt our web-pretrained video model to real robot tasks. Post-training... ...policy performance beyond what imitation learning alone achieves — reward design, online data... ..., self‑driving, or similar physical AI domains). Strong understanding of robot policy...
Video
Shift work
Rhoda ai
Palo Alto, CA
4 days ago
Machine Learning Engineer - Training Systems
At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid... ...hardware to the foundational models and video world models that control it. Our robots are... ...work at the intersection of large-scale learning, robotics, and systems, with a research team...
Video
Rhoda AI
Palo Alto, CA
3 days ago
2026 PhD / Masters Residency - Machine Learning / Software Engineering (Early Stage Project)
$109k - $157k
...Rapidly develop prototypes of artificial intelligence, machine learning models applied to sustainability and finance problems Apply... ...on project team needs and your availability Throughout your AI Residency you can expect: To be embedded into one of our confidential or...
Resident
Internship
Flexible hours
X Development, LLC
Mountain View, CA
4 days ago
Senior Software Developer, Machine Learning
$166k - $244k
Senior Software Developer, Machine Learning Mid Apply Note: By applying to this position you... ...Experience in one or more of the following: video compression, video processing, video... ...communication. Experience in Generative AI (Large Language Models, Multi-Modal, Large...
Video
Full time
Google Inc.
Mountain View, CA
2 days ago
Senior Machine Learning Engineer, Advertiser Growth
$140.7k - $223.4k
...Mountain View, CA, USA Senior Machine Learning Engineer, Advertiser Growth Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ... ...the creation of high-performing image and video assets tailored to specific campaign goals...
Video
Work at office
Worldwide
Relocation package
Unity Technologies
Mountain View, CA
7 days ago
2026 Summer PhD / Master's Residency - Machine Learning / Software Engineering, Bellwether
$109k - $157k
...generation of developer tools in the geospatial-AI domain. This project aims to... ...in your academic program throughout the residency. Throughout your AI Residency you... ...Consumer). Be willing and excited to learn new concepts and tactics across the full...
Resident
Summer work
Internship
Flexible hours
X: The Moonshot Factory
Mountain View, CA
4 days ago
Machine Learning Engineer (Auto Labeling)
...looking for the best At 42dot, our Machine Learning Engineers conduct research and development... ...for label generation from sensor and video data in autonomous driving contexts. Transfer... ...Driving, our autonomous driving AI software. Please upload all submission...
Video
Full time
42dot
Mountain View, CA
18 hours ago
Computer Vision and Machine Learning Engineer
$147.4k - $272.1k
...Computer Vision and Machine Learning Engineer The Video Computer Vision org is a centralized applied research and engineering team responsible for... ...the boundaries of what is possible with spatial media and AI. We value a culture of learning, intellectual curiosity, and...
Video
Relocation
Apple
Sunnyvale, CA
4 days ago
Machine Learning - Data Scientist Lead
$181.1k - $318.4k
...Machine Learning - Data Scientist Lead Do you have a passion for computer vision and deep... ...latest advances in multimodal models? The Video Engineering Data Analytics and Quality group... ...quality of some of Apple's most exciting AI experiences. Responsibilities Lead...
Video
Relocation
Apple
Sunnyvale, CA
2 days ago
Machine Learning Engineer - World Model
$150k
...research, nurture the next generation of AI builders, and drive transformative contributions... ...for high-performance computing in deep learning, driving impactful discoveries that... ...environments. ~ Experience with large-scale video or multimodal data pipelines. ~...
Video
Visa sponsorship
Institute of Foundation Models
Sunnyvale, CA
1 day ago
Machine Learning Engineer, Video Engineering
$181.1k - $318.4k
...Machine Learning Engineer, Video Engineering Help define the next generation of photo and video experiences at Apple. The Video Engineering group... ...to shape upcoming products at the frontier of generative AI, computer vision, and 3D understanding, with a strong emphasis...
Video
Relocation
Apple
Sunnyvale, CA
18 hours ago
Data Analyst/Engineer - Entry/Junior
$100k
...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java... ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients... ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data...
Resident
Video
Full time
H1b
SynergisticIT
Palo Alto, CA
3 days ago
Python/R Developer
$100k
...competitive advantage-Please visit the below videos exhibiting at Oracle Cloud world /Oracle... ...Data Scientists, Data Engineers, Machine Learning engineers for full time positions with... ...For data Science/Data Analyst/AI/Machine learning Positions REQUIRED SKILLS...
Resident
Video
Full time
SynergisticIT
Palo Alto, CA
13 hours ago
Staff Applications Engineer
$111.07k - $166.4k
...Across enterprise, cloud and AI, and carrier architectures, our... ...Marvell is a place to thrive, learn, and lead. The ever-growing... ...driven by AI, cloud services, video streaming, and 5G wireless megatrends... ...S. citizens, lawful permanent residents, or protected individuals as...
Resident
Video
Permanent employment
Internship
Work from home
Marvell
Santa Clara, CA
4 days ago
Machine Learning Video Processing Engineer
$126.8k - $190.9k
Machine Learning Video Processing Engineer Cupertino, California, United States Machine Learning and AI Want to work on cutting edge technology that keeps the customer front and center? The Video Engineering group at Apple is responsible for creating the image/video core...
Video
Relocation
Apple Inc.
Cupertino, CA
2 days ago
Machine Learning (ML) Engineer - Applied
...ModelCat is transforming how companies develop AI models for embedded, edge, and IoT devices... ...AI, including pipelines that combine deep-learning and conventional algorithms for embedded... ...and keypoints; action recognition (video), radar and point cloud data, multi-modal...
Video
Remote work
Nerdleveltech
Sunnyvale, CA
2 days ago
Lead Software Engineer, Frontend/Web App
...engineers, and operators building an AI platform to make flying safer,... ...they build, ship quickly, and learn fast, pushing the boundaries... ...systems. Experience with video streaming technologies and... ...card holders, lawful permanent residents, or individuals granted asylum...
Resident
Video
Permanent employment
Full time
Local area
Remote work
3 days per week
BeaconAI
San Carlos, CA
1 day ago
Senior Software Engineer - Computer Vision and VLM Deep Learning, Senior Software Engineer - Co[...]
$184k - $287.5k
...Isaac team is looking for a robotics deep learning expert to help with this revolution. In this... ...space of geometric computer vision and video/vision-language models. Deploy algorithms... ...related fields. Publications in top‑tier AI conferences or contributions to open‑source...
Video
NLP PEOPLE
Santa Clara, CA
18 hours ago
Applied Machine Learning Research Engineer - Multimodal for Human Understanding
$147.4k - $272.1k
...Applied Machine Learning Research Engineer - Multimodal for Human Understanding We're starting... ...Research Engineer to join our team in the Video Computer Vision group and help us push the... ...stack, and partnering with HW, SW and AI teams to shape Apple's products and bring...
Video
Worldwide
Relocation
Apple
Sunnyvale, CA
2 days ago
Computer Vision / Machine Learning Engineer - 3D Reconstruction
$126.8k - $190.9k
...Computer Vision / Machine Learning Engineer - 3D Reconstruction Apple is where individual... ...join something — you'll add something. The Video Computer Vision organization is working on... ...products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics...
Video
Relocation
Apple
Sunnyvale, CA
3 days ago
Developer Relations Engineer
$165k - $206.5k
...business workflows with enterprise AI. We help companies thrive in... ...evolve these tactics as you learn what resonates. What matters... ...technical blog posts, webinars, demo videos, case studies). Work with... .... If you are a California-resident, please read our California Applicant...
Resident
Video
Live in
Work at office
Shift work
3 days per week
Box
Redwood City, CA
18 hours ago
Machine Learning Expert
...Role - Machine Learning Expert (This role is open to US Citizens, Green Card holders, GC-EAD only. We do not sponsor visas.) Adidev is looking... ...machine learning models, with a special emphasis on Generative AI. In this role, you will craft and refine AI-driven solutions,...
Resident
Remote work
Visa sponsorship
Relocation package
GrabJobs
Palo Alto, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Resident - Learning From Videos (LFV). Be the first to apply!