Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Resident - Learning From Videos (LFV)

$45 - $60 per hour

Toyota Research Institute

At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team advancing the state of the art in AI, robotics, driving, and material sciences.

The Team

The Learning From Videos (LFV) team in the Robotics division focuses on the development of foundation models capable of leveraging large-scale multi-modal (RGB, depth, flow, semantics, bounding boxes, tactile, audio, etc) data from multiple domains (driving, robotics, indoors, outdoors, etc) to improve downstream task performance.


Our approach emphasizes training scalability: by learning from multiple modalities, models can develop useful data-driven priors about 3D geometry, physics, and dynamics for world understanding.


Our research interests include, but are not limited to:
  • Video Generation
  • World Models
  • 4D Reconstruction
  • Multi-Modal Models
  • Multi-View Geometry
  • Data Augmentation
  • Video-Language-Action Models
We focus primarily on embodied applications and aim to tackle some of the hardest scientific challenges in spatio-temporal reasoning, enabling autonomous agents to operate in real-world, unstructured environments.

The AI Resident

This year-long AI Residency is a research-focused position designed for early-career researchers and engineers who are excited to work on ambitious problems in embodied AI. The resident will be deeply integrated into the LFV team, contributing to both ongoing and new research efforts in areas including:
  • 4D World Models
  • Physical and Embodied Intelligence
  • Multi-Modal Learning
As an AI Resident, you will collaborate closely with researchers and engineers at TRI on high-risk, pushing forward our understanding of spatio-temporal reasoning and zero-shot generalization. This is a research-focused position, targeting the development of methods and techniques that can solve real-world problems.


We welcome you to join a positive, friendly, and enthusiastic team of researchers, where you will contribute to helping people gain and maintain independence, access, and mobility. We work closely with other Toyota affiliates, and actively collaborate towards research publications and the productization of our developed technologies.

Responsibilities
  • Develop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning targeting physical applications.
  • Handle the ingestion of large-scale datasets for training, including streaming, online, and continual learning.
  • Contribute innovative solutions at the intersection of machine learning, computer vision, and robotics to improve real-world task performance.
  • Work closely with robotics and machine learning researchers and engineers to understand theoretical and practical needs.
  • Follow best practices producing maintainable code, both for internal use as well as for open-sourcing to the scientific community.
  • Contribute to research publications and technical reports.
Qualifications
  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, Robotics, or a related technical field.
  • Exceptional candidates with equivalent research experience (e.g., strong publication record, open-source contributions, or industry research experience) are encouraged to apply.
  • Strong background in computer vision and its applications to robotics and embodied systems.
  • Demonstrated research experience through publications, technical projects, or open-source contributions.
  • Strong communication skills and a collaborative mindset, with the ability to learn quickly and contribute to team research efforts.
  • Passionate about assisting and amplifying older adults and those in need through dexterous manipulation, human-robot collaboration, and physical assistance innovation.
Bonus Qualifications
  • Spatio-temporal (4D) computer vision, including multi-view geometry, 3D/4D reconstruction, video generation, self-supervised learning, occlusion reasoning, etc.
  • Large-scale training of multi-modal deep learning methods, both in terms of dataset sizes and model complexity, context length extension, and efficient attention, distributed computing, etc.
  • Application of machine learning and computer vision to embodied applications.

The pay range for this position at commencement of employment is expected to be between $45 and $60/hour for California-based roles. Base pay offered will depend on multiple individualized factors, including, but not limited to, a candidate's experience, skills, job-related knowledge, and market location. TRI offers a generous benefits package including medical, dental, and vision insurance, and paid time off benefits (including holiday pay and sick time). Additional details regarding these benefit plans will be provided if an employee receives an offer of employment.

Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.

TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant's race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Resident - Learning From Videos (LFV) in Los Altos, CA vacancy
  • $100k

     ...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java...  ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients...  ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data... 
    Resident
    Video
    Full time
    H1b

    SynergisticIT

    Cupertino, CA
    1 day ago
  •  ...Machine Learning Engineer Creatify is building the world's first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube,... 
    Video

    Creatify AI

    Mountain View, CA
    4 days ago
  • $147k - $211k

     ...building, training, and deploying machine learning models using TensorFlow, JAX, or Adbrain....  ...and algorithms. Experience with generative AI techniques (e.g., LLMs, natural language...  ...including search, display, shopping, travel and video advertising, as well as analytics. Our... 
    Video
    Full time
    Immediate start

    Google Inc.

    Mountain View, CA
    2 days ago
  • $204k - $259k

     ...team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene...  ...for computer vision tasks (e.g., detection, segmentation, video understanding) using Python and frameworks like PyTorch, JAX... 
    Video
    Full time
    Remote work

    Waymo

    Mountain View, CA
    18 hours ago
  • At Rhoda AI, we’re building the next generation of generalist intelligent robots. We own...  ...knowledge to adapt our web-pretrained video model to real robot tasks. Post-training at...  ...policy performance beyond what imitation learning alone achieves — reward design, online data... 
    Video
    Shift work

    Rhoda AI

    Mountain View, CA
    4 days ago
  • $150k

     ...research, nurture the next generation of AI builders, and drive transformative contributions...  ...for high‑performance computing in deep learning, driving impactful discoveries that...  ...environments. Experience with large‑scale video or multimodal data pipelines. Experience building... 
    Video
    Visa sponsorship

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  • $100k

     ...competitive advantage-Please visit the below videos exhibiting at Oracle Cloud world /Oracle...  ...Data Scientists, Data Engineers, Machine Learning engineers for full time positions with...  ...For data Science/Data Analyst/AI/Machine learning Positions REQUIRED SKILLS... 
    Resident
    Video
    Full time

    SynergisticIT

    Palo Alto, CA
    3 days ago
  • $181.1k - $318.4k

    Sunnyvale, California, United States Machine Learning and AI Help define the next generation of photo and video experiences at Apple. The Video Engineering group develops key image and video technologies that contribute to experiences across many Apple products and services... 
    Video
    Relocation

    Apple Inc.

    Sunnyvale, CA
    4 days ago
  • $19 - $65 per hour

    PlusAI is a Physical AI company pioneering AI-based virtual driver software for factory...  ...: Design and train temporal models (e.g., Video Transformers, LSTMs, or 1D-CNNs) to recognize...  ...in classification and supervised learning. Preferred Skills Experience with distributed... 
    Video
    Hourly pay
    Internship

    Plus

    Santa Clara, CA
    4 days ago
  • $2,000 per month

     ...Machine Learning Research Engineer Cupertino, CA Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu)...  ...that would be impossible with GPUs, like real-time video generation models and extremely deep & parallel... 
    Video
    Work at office
    Relocation package

    ETCHED LLC

    Cupertino, CA
    2 days ago
  •  ...Role - Machine Learning Expert (This role is open to US Citizens, Green Card holders, GC-EAD only. We do not sponsor visas.)...  ...machine learning models, with a special emphasis on Generative AI. In this role, you will craft and refine AI-driven solutions, turning... 
    Resident
    Remote work
    Visa sponsorship
    Relocation package

    Adidev Technologies Inc

    Palo Alto, CA
    5 days ago
  • $184k - $287.5k

    We are seeking a Senior Machine Learning Engineer to join our end‑to‑end autonomous driving team...  ...tapping into the unlimited potential of AI to define the next era of computing. An...  ...maintaining high‑quality multimodal datasets (e.g., video, sensor, language/action traces) tailored... 
    Video

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $147.4k - $272.1k

     ...Computer Vision & Machine Learning Engineer Sunnyvale, California, United States Machine Learning and AI Apple is where individual imaginations gather together, committing...  ...of multimodal systems (e.g. vision, language, video, etc.) Proficiency in Python and in a modern deep... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    3 days ago
  • $181.1k - $318.4k

    Sunnyvale, California, United States Machine Learning and AI We’re starting to see the incredible potential of multimodal foundation and large...  ...Machine Learning Integration Engineer to join our team in the Video Computer Vision group and help us ship cutting edge computer... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  •  ..., X-ray fluoroscopy, and real-time camera video streams—to improve surgical navigation and...  ...medical imaging, computer vision, and machine learning. Apply state-of-the-art techniques such as...  ...challenges. Support the development of AI/ML features in alignment with Good Machine... 
    Video

    Intuitive

    Sunnyvale, CA
    2 days ago
  •  ...Stamirowska, a complexity scientist who created a team consisting of AI pioneers, including CTO Jan Chorowski who was the first person...  ...1 or 2 R&D Engineers with a strong track record in machine learning models research . This is an extremely ambitious... 
    Permanent employment
    Full time
    Contract work
    Immediate start
    Remote work
    Flexible hours

    Pathway Genomics

    Palo Alto, CA
    2 days ago
  •  ...Senior AI Engineer Gauss Labs is looking for a passionate and talented AI Engineer to develop cutting-edge Industrial AI solutions...  ...will be responsible for translating cutting-edge AI and machine learning research into robust, scalable software solutions. Your work... 

    Gauss Labs

    Palo Alto, CA
    4 days ago
  • $126.8k - $190.9k

    Sunnyvale, California, United States Machine Learning and AI Apple is where individual imaginations gather together, committing to the values...  ...’ll do more than join something — you’ll add something. The Video Computer Vision organization is working on breakthrough... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    3 days ago
  • $147.4k - $272.1k

    visionOS, Machine Learning Engineer/Computer Vision Sunnyvale, California, United States Machine Learning and AI Join our machine learning team at Apple to develop computer vision...  ...(RNNs, LSTMs, Transformers for video/time‑series data) Proficiency with CMake... 
    Video
    Relocation

    Apple

    Sunnyvale, CA
    1 day ago
  •  ...Role Overview: As a Machine Learning Engineer, you will play a central role in translating cutting-edge machine learning research into...  ...a focus on building reliable, maintainable, and high-impact AI-driven features that align with Nace.AI's strategic objectives.... 

    Nace AI

    Palo Alto, CA
    5 days ago
  • $126.8k - $220.9k

    Machine Learning Engineer - Visual Agents - Special Projects Cupertino, California, United States Machine Learning and AI Description The Special Projects team at Apple is developing novel...  ...product use cases Experience with video understanding, temporal reasoning, or... 
    Video
    Internship
    Relocation

    Apple

    Cupertino, CA
    5 days ago
  • $198.3k - $342.8k

    Machine Learning Manager, Data for Foundation Models - SIML Cupertino, California, United States Machine Learning and AI Do you believe that generative models can fundamentally change how...  ...visual generative AI for images or videos Track record of strong industrial research... 
    Video
    Relocation

    Apple

    Cupertino, CA
    3 days ago
  • $139k - $155k

     ...for a Research Scientist with deep expertise in quantized deep learning models for hardware acceleration in autonomous systems. In this...  ...with a focus on ML compilers, embedded systems, or hardware‑aware AI. Hands‑on experience with quantized model deployment, ML design... 
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Palo Alto, CA
    1 day ago
  •  ...Machine Learning Engineer Location: Cupertino, CA ABOUT THIS FEATURED OPPORTUNITY...  ...leveraging LLMs/VLMs to analyze databases, images, video, and enterprise signals to identify...  ...Language Models (VLMs) and multimodal AI systems that process and reason across image... 
    Video
    Local area
    Flexible hours

    INSPYR Solutions

    Cupertino, CA
    4 days ago
  • $106.9k - $229.4k

     ...your wellbeing, and truly belong. What’s in it for you? Constant learning, skill growth, great benefits, and a team that wants you to grow...  .... What you’ll do: Summary: At SAP, we amplify the strength of AI technology, fusing it with our robust industry‑focused data and... 
    Worldwide
    Flexible hours

    SAP Belgium NV/SA

    Palo Alto, CA
    2 days ago
  • $207k - $300k

    Staff Software Engineer, YouTube Ads Machine Learning YouTube Mountain View, CA, USA Bachelor’s degree or equivalent practical experience...  ...products including search, display, shopping, travel and video advertising, as well as analytics. Our teams create trusted experiences... 
    Video
    Full time

    Google Inc.

    Mountain View, CA
    3 days ago
  •  ...Job Description The Role: General Motors is seeking a Staff AI/ML Engineer for the Vehicle Mechatronic Embedded Controls (VMEC)...  ....g., tree-based models, time-series and anomaly detection, deep learning where appropriate) with a focus on reliability, explainability,... 
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package

    General Motors

    Mountain View, CA
    2 days ago
  • $139k - $155k

     ...for a Research Scientist with deep expertise in quantized deep learning models for hardware acceleration in autonomous systems. In this...  ...with a focus on ML compilers, embedded systems, or hardware-aware AI. ● Hands-on experience with quantized model deployment, ML design... 
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Palo Alto, CA
    2 days ago
  •  ...Deep Learning Field Engineer With the rapid growth of artificial intelligence, more and more...  ...knowledge is required to use cutting edge AI techniques to solve real-world problems....  ...execution systems, safety alert systems and video management systems. Perform... 
    Video
    Work experience placement
    Work at office
    Flexible hours

    Matroid

    Palo Alto, CA
    1 day ago
  • $291.5k - $369.1k

     ...platform designed for hybrid, multi‑cloud environments. Join the AI Models team at Splunk, where we advance the state of AI for high...  ...modeling for both structure and unstructured data, deep learning‑based time series modeling, advanced anomaly detection, and multi... 
    Full time
    Temporary work
    Local area
    Flexible hours

    CISCO, Inc.

    Los Altos, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Resident - Learning From Videos (LFV). Be the first to apply!