AI Resident - Learning From Videos (LFV)
$45 - $60 per hourToyota Research Institute
Job Description
Job Description
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we’ve built a world-class team advancing the state of the art in AI, robotics, driving, and material sciences.
The Team
The Learning From Videos (LFV) team in the Robotics division focuses on the development of foundation models capable of leveraging large-scale multi-modal (RGB, depth, flow, semantics, bounding boxes, tactile, audio, etc) data from multiple domains (driving, robotics, indoors, outdoors, etc) to improve downstream task performance.
Our approach emphasizes training scalability: by learning from multiple modalities, models can develop useful data-driven priors about 3D geometry, physics, and dynamics for world understanding.
Our research interests include, but are not limited to:
- Video Generation
- World Models
- 4D Reconstruction
- Multi-Modal Models
- Multi-View Geometry
- Data Augmentation
- Video-Language-Action Models
We focus primarily on embodied applications and aim to tackle some of the hardest scientific challenges in spatio-temporal reasoning, enabling autonomous agents to operate in real-world, unstructured environments.
The AI Resident
This year-long AI Residency is a research-focused position designed for early-career researchers and engineers who are excited to work on ambitious problems in embodied AI. The resident will be deeply integrated into the LFV team, contributing to both ongoing and new research efforts in areas including:
- 4D World Models
- Physical and Embodied Intelligence
- Multi-Modal Learning
As an AI Resident, you will collaborate closely with researchers and engineers at TRI on high-risk, pushing forward our understanding of spatio-temporal reasoning and zero-shot generalization. This is a research-focused position, targeting the development of methods and techniques that can solve real-world problems.
We welcome you to join a positive, friendly, and enthusiastic team of researchers, where you will contribute to helping people gain and maintain independence, access, and mobility. We work closely with other Toyota affiliates, and actively collaborate towards research publications and the productization of our developed technologies.
Responsibilities- Develop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning targeting physical applications.
- Handle the ingestion of large-scale datasets for training, including streaming, online, and continual learning.
- Contribute innovative solutions at the intersection of machine learning, computer vision, and robotics to improve real-world task performance.
- Work closely with robotics and machine learning researchers and engineers to understand theoretical and practical needs.
- Follow best practices producing maintainable code, both for internal use as well as for open-sourcing to the scientific community.
- Contribute to research publications and technical reports.
- Bachelor's or Master’s degree in Computer Science, Electrical Engineering, Robotics, or a related technical field.
- Exceptional candidates with equivalent research experience (e.g., strong publication record, open-source contributions, or industry research experience) are encouraged to apply.
- Strong background in computer vision and its applications to robotics and embodied systems.
- Demonstrated research experience through publications, technical projects, or open-source contributions.
- Strong communication skills and a collaborative mindset, with the ability to learn quickly and contribute to team research efforts.
- Passionate about assisting and amplifying older adults and those in need through dexterous manipulation, human-robot collaboration, and physical assistance innovation.
- Spatio-temporal (4D) computer vision, including multi-view geometry, 3D/4D reconstruction, video generation, self-supervised learning, occlusion reasoning, etc.
- Large-scale training of multi-modal deep learning methods, both in terms of dataset sizes and model complexity, context length extension, and efficient attention, distributed computing, etc.
- Application of machine learning and computer vision to embodied applications.
The pay range for this position at commencement of employment is expected to be between $45 and $60/hour for California-based roles. Base pay offered will depend on multiple individualized factors, including, but not limited to, a candidate's experience, skills, job-related knowledge, and market location. TRI offers a generous benefits package including medical, dental, and vision insurance, and paid time off benefits (including holiday pay and sick time). Additional details regarding these benefit plans will be provided if an employee receives an offer of employment.
Please reference this Candidate Privacy Notice to inform you of the categories of personal information that we collect from individuals who inquire about and/or apply to work for Toyota Research Institute, Inc. or its subsidiaries, including Toyota A.I. Ventures GP, L.P., and the purposes for which we use such personal information.
TRI is fueled by a diverse and inclusive community of people with unique backgrounds, education and life experiences. We are dedicated to fostering an innovative and collaborative environment by living the values that are an essential part of our culture. We believe diversity makes us stronger and are proud to provide Equal Employment Opportunity for all, without regard to an applicant’s race, color, creed, gender, gender identity or expression, sexual orientation, national origin, age, physical or mental disability, medical condition, religion, marital status, genetic information, veteran status, or any other status protected under federal, state or local laws.
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability. Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records for employment.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
$45 - $60 per hour
...class team advancing the state of the art in AI, robotics, driving, and material sciences. The Team The Learning From Videos (LFV) team in the Robotics division focuses on... ...‑world, unstructured environments. The AI Resident This year‑long AI Residency is a research‑focused...ResidentVideoLocal areaShift work$45 - $60 per hour
Toyota Research Institute is seeking an AI Resident to join its Learning From Videos team in Los Altos, California. This year-long position is geared towards early-career researchers excited about tackling ambitious problems in embodied AI. Responsibilities include developing...ResidentVideoHourly pay$204k - $259k
...Senior Machine Learning Engineer, Computer Vision/VLM Waymo is an autonomous driving technology... ...vision, deep learning, and generative AI to automatically analyze driving logs,... ...vision tasks (e.g., detection, segmentation, video understanding) using Python and frameworks...VideoFull timeRemote work$140k - $390k
...Expect Tesla is building the future of AI systems that operate in the real world where... ...are non-negotiable. As a Machine Learning Integration Engineer, you'll be at the forefront... ...Combine multiple data sources, including video, audio, thermal, mechanical, weather, and...VideoHourly payFull timeTemporary workWorldwideFlexible hours$100k
...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java... ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients... ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data...ResidentVideoFull timeH1b- ...Machine Learning Engineer Creatify is building the world's first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube,...Video
$204k - $259k
...behavior of the Waymo Driver. The DUE Machine Learning team will build and operate scalable... .... Build and scale production-ready video generation techniques (e.g., Diffusion, Flow... ...research in 4D world modeling and generative AI into robust, production-ready solutions....VideoFull time$276k - $414k
...to express themselves, live in the moment, learn about the world, and have fun together.... ...scale recommender systems for all of Snap’s video content products. What you’ll do Lead the... ...sound judgment Experience contributing to AI publications If you have a disability or special...VideoLive inWork at officeLocal area- ...At 42dot, our team of skilled Machine Learning Engineers is dedicated to pioneering cutting... ..., ICRA, etc.) \n Please refer to the videos from KCCV 2022 and UMOS Day 2021 for insights... ...Driving, our autonomous driving AI software. Please upload all submission...VideoFull time
$147k - $211k
...building, training, and deploying machine learning models using TensorFlow, JAX, or Adbrain.... ...and algorithms. Experience with generative AI techniques (e.g., LLMs, natural language... ...including search, display, shopping, travel and video advertising, as well as analytics. Our...VideoFull timeImmediate start- Rhoda ai in Palo Alto, California, is seeking Research Scientists and Research Engineers to work on adapting web-pretrained video models for robotic tasks. Ideal candidates will possess deep robotics... ...and experience with reinforcement learning and robot systems. This role...Video
- ...domain knowledge to adapt our web-pretrained video model to real robot tasks. Post-training... ...policy performance beyond what imitation learning alone achieves — reward design, online data... ..., self‑driving, or similar physical AI domains). Strong understanding of robot policy...VideoShift work
- At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid... ...hardware to the foundational models and video world models that control it. Our robots are... ...work at the intersection of large-scale learning, robotics, and systems, with a research team...Video
$175k - $275k
...About Abaka AI Abaka AI is built on one mission: to be the world’s most trusted... ...catalog of off-the-shelf datasets (image, video, multimodal, reasoning, 3D, and beyond) as... ...Role We’re hiring our first Machine Learning Engineer in the United States, a foundational...VideoFull timeImmediate startFlexible hours$140.7k - $223.4k
...Mountain View, CA, USA Senior Machine Learning Engineer, Advertiser Growth Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ... ...the creation of high-performing image and video assets tailored to specific campaign goals...VideoWork at officeWorldwideRelocation package$109k - $157k
...Rapidly develop prototypes of artificial intelligence, machine learning models applied to sustainability and finance problems Apply... ...on project team needs and your availability Throughout your AI Residency you can expect: To be embedded into one of our confidential or...ResidentInternshipFlexible hours$166k - $244k
Senior Software Developer, Machine Learning Mid Apply Note: By applying to this position you... ...Experience in one or more of the following: video compression, video processing, video... ...communication. Experience in Generative AI (Large Language Models, Multi-Modal, Large...VideoFull time$109k - $157k
...generation of developer tools in the geospatial-AI domain. This project aims to... ...in your academic program throughout the residency. Throughout your AI Residency you... ...Consumer). Be willing and excited to learn new concepts and tactics across the full...ResidentSummer workInternshipFlexible hours- ...the best At 42dot, our Senior Machine Learning Engineers conduct research and development... ...tailored for label generation from sensor and video data in autonomous driving contexts.... ...Autonomous Driving, our autonomous driving AI software. Please upload all submission...VideoFull time
- ...Machine Learning System Hardware Architect Sunnyvale, CA Do you want to be part of the AI revolution? Do you want to think out of the box, thriving on challenges in the... ...and power optimizations Familiarity with video, DSP, Ethernet, and PCIe MS or PhD in Electrical...VideoWork at office
$181.1k - $318.4k
...Machine Learning - Data Scientist Lead Do you have a passion for computer vision and deep... ...latest advances in multimodal models? The Video Engineering Data Analytics and Quality group... ...quality of some of Apple's most exciting AI experiences. Responsibilities Lead...VideoRelocation$150k
...research, nurture the next generation of AI builders, and drive transformative contributions... ...for high-performance computing in deep learning, driving impactful discoveries that... ...environments. ~ Experience with large-scale video or multimodal data pipelines. ~...VideoVisa sponsorship$181.1k - $318.4k
...Machine Learning Engineer, Video Engineering Help define the next generation of photo and video experiences at Apple. The Video Engineering group... ...to shape upcoming products at the frontier of generative AI, computer vision, and 3D understanding, with a strong emphasis...VideoRelocation$100k
...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java... ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients... ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data...ResidentVideoFull timeH1b- ...applications in the computer vision and machine learning domain that previously appeared infeasible... ...Engineer to join our team in the Video Computer Vision group and help us ship cutting... ...stack, and partnering with HW, SW and AI teams to shape Apple's products and bring...Video
$111.07k - $166.4k
...Across enterprise, cloud and AI, and carrier architectures, our... ...Marvell is a place to thrive, learn, and lead. The ever-growing... ...driven by AI, cloud services, video streaming, and 5G wireless megatrends... ...S. citizens, lawful permanent residents, or protected individuals as...ResidentVideoPermanent employmentInternshipWork from home$147.4k - $272.1k
...Applied Machine Learning Research Engineer - Multimodal for Human Understanding We're starting... ...Research Engineer to join our team in the Video Computer Vision group and help us push the... ...stack, and partnering with HW, SW and AI teams to shape Apple's products and bring...VideoWorldwideRelocation- ...engineers, and operators building an AI platform to make flying safer,... ...they build, ship quickly, and learn fast, pushing the boundaries... ...systems. Experience with video streaming technologies and... ...card holders, lawful permanent residents, or individuals granted asylum...ResidentVideoPermanent employmentFull timeLocal areaRemote work3 days per week
$184k - $287.5k
...Isaac team is looking for a robotics deep learning expert to help with this revolution. In this... ...space of geometric computer vision and video/vision-language models. Deploy algorithms... ...related fields. Publications in top‑tier AI conferences or contributions to open‑source...Video$126.8k - $190.9k
...Computer Vision / Machine Learning Engineer - 3D Reconstruction Apple is where individual... ...join something — you'll add something. The Video Computer Vision organization is working on... ...products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics...VideoRelocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Resident - Learning From Videos (LFV). Be the first to apply!


