Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Resident - Learning From Videos (LFV)

$45 - $60 per hour

Toyota Research Institute

Job Description

Job Description

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team advancing the state of the art in AI, robotics, driving, and material sciences.

The Team

The Learning From Videos (LFV) team in the Robotics division focuses on the development of foundation models capable of leveraging large-scale multi-modal (RGB, depth, flow, semantics, bounding boxes, tactile, audio, etc) data from multiple domains (driving, robotics, indoors, outdoors, etc) to improve downstream task performance. 

Our approach emphasizes training scalability: by learning from multiple modalities, models can develop useful data-driven priors about 3D geometry, physics, and dynamics for world understanding. 

Our research interests include, but are not limited to:

  • Video Generation
  • World Models
  • 4D Reconstruction
  • Multi-Modal Models
  • Multi-View Geometry
  • Data Augmentation
  • Video-Language-Action Models

We focus primarily on embodied applications and aim to tackle some of the hardest scientific challenges in spatio-temporal reasoning, enabling autonomous agents to operate in real-world, unstructured environments.

The AI Resident

This year-long AI Residency is a research-focused position designed for early-career researchers and engineers who are excited to work on ambitious problems in embodied AI. The resident will be deeply integrated into the LFV team, contributing to both ongoing and new research efforts in areas including:

  • 4D World Models
  • Physical and Embodied Intelligence
  • Multi-Modal Learning


As an AI Resident, you will collaborate closely with researchers and engineers at TRI on high-risk, pushing forward our understanding of spatio-temporal reasoning and zero-shot generalization. This is a research-focused position, targeting the development of methods and techniques that can solve real-world problems. 

We welcome you to join a positive, friendly, and enthusiastic team of researchers, where you will contribute to helping people gain and maintain independence, access, and mobility. We work closely with other Toyota affiliates, and actively collaborate towards research publications and the productization of our developed technologies.

Responsibilities
  • Develop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning targeting physical applications.
  • Handle the ingestion of large-scale datasets for training, including streaming, online, and continual learning.
  • Contribute innovative solutions at the intersection of machine learning, computer vision, and robotics to improve real-world task performance.
  • Work closely with robotics and machine learning researchers and engineers to understand theoretical and practical needs.
  • Follow best practices producing maintainable code, both for internal use as well as for open-sourcing to the scientific community.
  • Contribute to research publications and technical reports.
Qualifications

  • Bachelor's or Master’s degree in Computer Science, Electrical Engineering, Robotics, or a related technical field.
  • Exceptional candidates with equivalent research experience (e.g., strong publication record, open-source contributions, or industry research experience) are encouraged to apply.
  • Strong background in computer vision and its applications to robotics and embodied systems.
  • Demonstrated research experience through publications, technical projects, or open-source contributions.
  • Strong communication skills and a collaborative mindset, with the ability to learn quickly and contribute to team research efforts.
  • Passionate about assisting and amplifying older adults and those in need through dexterous manipulation, human-robot collaboration, and physical assistance innovation.
Bonus Qualifications
  • Spatio-temporal (4D) computer vision, including multi-view geometry, 3D/4D reconstruction, video generation, self-supervised learning, occlusion reasoning, etc.
  • Large-scale training of multi-modal deep learning methods, both in terms of dataset sizes and model complexity, context length extension, and efficient attention, distributed computing, etc.
  • Application of machine learning and computer vision to embodied applications.

The pay range for this position at commencement of employment is expected to be between $45 and $60/hour for California-based roles. Base pay offered will depend on multiple individualized factors, including, but not limited to, a candidate's experience, skills, job-related knowledge, and market location. TRI offers a generous benefits package including medical, dental, and vision insurance, and paid time off benefits (including holiday pay and sick time). Additional details regarding these benefit plans will be provided if an employee receives an offer of employment.

Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.

 

TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.

 

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 14 days ago
Similar jobs that could be interesting for youBased on the AI Resident - Learning From Videos (LFV) in Los Altos, CA vacancy
  • $45 - $60 per hour

     ...class team advancing the state of the art in AI, robotics, driving, and material sciences. The Team The Learning From Videos (LFV) team in the Robotics division focuses on...  ...‑world, unstructured environments. The AI Resident This year‑long AI Residency is a research‑focused... 
    Resident
    Video
    Local area
    Shift work

    Toyota Research Institute

    Los Altos, CA
    3 days ago
  • $45 - $60 per hour

    Toyota Research Institute is seeking an AI Resident to join its Learning From Videos team in Los Altos, California. This year-long position is geared towards early-career researchers excited about tackling ambitious problems in embodied AI. Responsibilities include developing... 
    Resident
    Video
    Hourly pay

    Toyota Research Institute

    Los Altos, CA
    3 days ago
  • $204k - $259k

     ...Senior Machine Learning Engineer, Computer Vision/VLM Waymo is an autonomous driving technology...  ...vision, deep learning, and generative AI to automatically analyze driving logs,...  ...vision tasks (e.g., detection, segmentation, video understanding) using Python and frameworks... 
    Video
    Full time
    Remote work

    Waymo

    Mountain View, CA
    2 days ago
  • $140k - $390k

     ...Expect Tesla is building the future of AI systems that operate in the real world where...  ...are non-negotiable. As a Machine Learning Integration Engineer, you'll be at the forefront...  ...Combine multiple data sources, including video, audio, thermal, mechanical, weather, and... 
    Video
    Hourly pay
    Full time
    Temporary work
    Worldwide
    Flexible hours

    Tesla

    Palo Alto, CA
    2 days ago
  • $100k

     ...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java...  ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients...  ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data... 
    Resident
    Video
    Full time
    H1b

    SynergisticIT

    Cupertino, CA
    1 day ago
  •  ...Machine Learning Engineer Creatify is building the world's first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube,... 
    Video

    Creatify AI

    Mountain View, CA
    4 days ago
  • $204k - $259k

     ...behavior of the Waymo Driver.  The DUE Machine Learning team will build and operate scalable...  .... Build and scale production-ready video generation techniques (e.g., Diffusion, Flow...  ...research in 4D world modeling and generative AI into robust, production-ready solutions.... 
    Video
    Full time

    Waymo

    Mountain View, CA
    3 days ago
  • $276k - $414k

     ...to express themselves, live in the moment, learn about the world, and have fun together....  ...scale recommender systems for all of Snap’s video content products. What you’ll do Lead the...  ...sound judgment Experience contributing to AI publications If you have a disability or special... 
    Video
    Live in
    Work at office
    Local area

    Snap

    Palo Alto, CA
    5 hours ago
  •  ...At 42dot, our team of skilled Machine Learning Engineers is dedicated to pioneering cutting...  ..., ICRA, etc.) \n Please refer to the videos from KCCV 2022 and UMOS Day 2021 for insights...  ...Driving, our autonomous driving AI software. Please upload all submission... 
    Video
    Full time

    42dot

    Mountain View, CA
    22 hours ago
  • $147k - $211k

     ...building, training, and deploying machine learning models using TensorFlow, JAX, or Adbrain....  ...and algorithms. Experience with generative AI techniques (e.g., LLMs, natural language...  ...including search, display, shopping, travel and video advertising, as well as analytics. Our... 
    Video
    Full time
    Immediate start

    Google Inc.

    Mountain View, CA
    2 days ago
  • Rhoda ai in Palo Alto, California, is seeking Research Scientists and Research Engineers to work on adapting web-pretrained video models for robotic tasks. Ideal candidates will possess deep robotics...  ...and experience with reinforcement learning and robot systems. This role... 
    Video

    Rhoda ai

    Palo Alto, CA
    22 hours ago
  •  ...domain knowledge to adapt our web-pretrained video model to real robot tasks. Post-training...  ...policy performance beyond what imitation learning alone achieves — reward design, online data...  ..., self‑driving, or similar physical AI domains). Strong understanding of robot policy... 
    Video
    Shift work

    Rhoda ai

    Palo Alto, CA
    2 days ago
  • At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid...  ...hardware to the foundational models and video world models that control it. Our robots are...  ...work at the intersection of large-scale learning, robotics, and systems, with a research team... 
    Video

    Rhoda AI

    Palo Alto, CA
    1 day ago
  • $175k - $275k

     ...About Abaka AI   Abaka AI is built on one mission: to be the world’s most trusted...  ...catalog of off-the-shelf datasets (image, video, multimodal, reasoning, 3D, and beyond) as...  ...Role   We’re hiring our first Machine Learning Engineer in the United States, a foundational... 
    Video
    Full time
    Immediate start
    Flexible hours

    Abaka Ai

    Palo Alto, CA
    22 hours ago
  • $140.7k - $223.4k

     ...Mountain View, CA, USA Senior Machine Learning Engineer, Advertiser Growth Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ...  ...the creation of high-performing image and video assets tailored to specific campaign goals... 
    Video
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    5 days ago
  • $109k - $157k

     ...Rapidly develop prototypes of artificial intelligence, machine learning models applied to sustainability and finance problems Apply...  ...on project team needs and your availability Throughout your AI Residency you can expect: To be embedded into one of our confidential or... 
    Resident
    Internship
    Flexible hours

    X Development, LLC

    Mountain View, CA
    2 days ago
  • $166k - $244k

    Senior Software Developer, Machine Learning Mid Apply Note: By applying to this position you...  ...Experience in one or more of the following: video compression, video processing, video...  ...communication. Experience in Generative AI (Large Language Models, Multi-Modal, Large... 
    Video
    Full time

    Google Inc.

    Mountain View, CA
    22 hours ago
  • $109k - $157k

     ...generation of developer tools in the geospatial-AI domain. This project aims to...  ...in your academic program throughout the residency. Throughout your AI Residency you...  ...Consumer). Be willing and excited to learn new concepts and tactics across the full... 
    Resident
    Summer work
    Internship
    Flexible hours

    X: The Moonshot Factory

    Mountain View, CA
    2 days ago
  •  ...the best At 42dot, our Senior Machine Learning Engineers conduct research and development...  ...tailored for label generation from sensor and video data in autonomous driving contexts....  ...Autonomous Driving, our autonomous driving AI software. Please upload all submission... 
    Video
    Full time

    42dot

    Mountain View, CA
    22 hours ago
  •  ...Machine Learning System Hardware Architect Sunnyvale, CA Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in the...  ...and power optimizations Familiarity with video, DSP, Ethernet, and PCIe MS or PhD in Electrical... 
    Video
    Work at office

    Baidu

    Sunnyvale, CA
    3 days ago
  • $181.1k - $318.4k

     ...Machine Learning - Data Scientist Lead Do you have a passion for computer vision and deep...  ...latest advances in multimodal models? The Video Engineering Data Analytics and Quality group...  ...quality of some of Apple's most exciting AI experiences. Responsibilities Lead... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    22 hours ago
  • $150k

     ...research, nurture the next generation of AI builders, and drive transformative contributions...  ...for high-performance computing in deep learning, driving impactful discoveries that...  ...environments. ~ Experience with large-scale video or multimodal data pipelines. ~... 
    Video
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    4 days ago
  • $181.1k - $318.4k

     ...Machine Learning Engineer, Video Engineering Help define the next generation of photo and video experiences at Apple. The Video Engineering group...  ...to shape upcoming products at the frontier of generative AI, computer vision, and 3D understanding, with a strong emphasis... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    3 days ago
  • $100k

     ...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java...  ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients...  ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data... 
    Resident
    Video
    Full time
    H1b

    SynergisticIT

    Palo Alto, CA
    1 day ago
  •  ...applications in the computer vision and machine learning domain that previously appeared infeasible...  ...Engineer to join our team in the Video Computer Vision group and help us ship cutting...  ...stack, and partnering with HW, SW and AI teams to shape Apple's products and bring... 
    Video

    Apple

    Sunnyvale, CA
    1 day ago
  • $111.07k - $166.4k

     ...Across enterprise, cloud and AI, and carrier architectures, our...  ...Marvell is a place to thrive, learn, and lead. The ever-growing...  ...driven by AI, cloud services, video streaming, and 5G wireless megatrends...  ...S. citizens, lawful permanent residents, or protected individuals as... 
    Resident
    Video
    Permanent employment
    Internship
    Work from home

    Marvell

    Santa Clara, CA
    2 days ago
  • $147.4k - $272.1k

     ...Applied Machine Learning Research Engineer - Multimodal for Human Understanding We're starting...  ...Research Engineer to join our team in the Video Computer Vision group and help us push the...  ...stack, and partnering with HW, SW and AI teams to shape Apple's products and bring... 
    Video
    Worldwide
    Relocation

    Apple

    Sunnyvale, CA
    22 hours ago
  •  ...engineers, and operators building an AI platform to make flying safer,...  ...they build, ship quickly, and learn fast, pushing the boundaries...  ...systems. Experience with video streaming technologies and...  ...card holders, lawful permanent residents, or individuals granted asylum... 
    Resident
    Video
    Permanent employment
    Full time
    Local area
    Remote work
    3 days per week

    BeaconAI

    San Carlos, CA
    4 days ago
  • $184k - $287.5k

     ...Isaac team is looking for a robotics deep learning expert to help with this revolution. In this...  ...space of geometric computer vision and video/vision-language models. Deploy algorithms...  ...related fields. Publications in top‑tier AI conferences or contributions to open‑source... 
    Video

    NLP PEOPLE

    Santa Clara, CA
    3 days ago
  • $126.8k - $190.9k

     ...Computer Vision / Machine Learning Engineer - 3D Reconstruction Apple is where individual...  ...join something — you'll add something. The Video Computer Vision organization is working on...  ...products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Resident - Learning From Videos (LFV). Be the first to apply!