AI Resident - Learning From Videos (LFV)
$45 - $60 per hourToyota Research Institute
At Toyota Research Institute (TRI), we're on a mission to improve the quality of human life. We're developing new tools and capabilities to amplify the human experience. To lead this transformative shift in mobility, we've built a world-class team advancing the state of the art in AI, robotics, driving, and material sciences.
The Team The Learning From Videos (LFV) team in the Robotics division focuses on the development of foundation models capable of leveraging large-scale multi-modal (RGB, depth, flow, semantics, bounding boxes, tactile, audio, etc) data from multiple domains (driving, robotics, indoors, outdoors, etc) to improve downstream task performance.Our approach emphasizes training scalability: by learning from multiple modalities, models can develop useful data-driven priors about 3D geometry, physics, and dynamics for world understanding.
Our research interests include, but are not limited to:
- Video Generation
- World Models
- 4D Reconstruction
- Multi-Modal Models
- Multi-View Geometry
- Data Augmentation
- Video-Language-Action Models
- 4D World Models
- Physical and Embodied Intelligence
- Multi-Modal Learning
We welcome you to join a positive, friendly, and enthusiastic team of researchers, where you will contribute to helping people gain and maintain independence, access, and mobility. We work closely with other Toyota affiliates, and actively collaborate towards research publications and the productization of our developed technologies. Responsibilities
- Develop, integrate, and deploy algorithms for Multi-Modal and 4D reasoning targeting physical applications.
- Handle the ingestion of large-scale datasets for training, including streaming, online, and continual learning.
- Contribute innovative solutions at the intersection of machine learning, computer vision, and robotics to improve real-world task performance.
- Work closely with robotics and machine learning researchers and engineers to understand theoretical and practical needs.
- Follow best practices producing maintainable code, both for internal use as well as for open-sourcing to the scientific community.
- Contribute to research publications and technical reports.
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, Robotics, or a related technical field.
- Exceptional candidates with equivalent research experience (e.g., strong publication record, open-source contributions, or industry research experience) are encouraged to apply.
- Strong background in computer vision and its applications to robotics and embodied systems.
- Demonstrated research experience through publications, technical projects, or open-source contributions.
- Strong communication skills and a collaborative mindset, with the ability to learn quickly and contribute to team research efforts.
- Passionate about assisting and amplifying older adults and those in need through dexterous manipulation, human-robot collaboration, and physical assistance innovation.
- Spatio-temporal (4D) computer vision, including multi-view geometry, 3D/4D reconstruction, video generation, self-supervised learning, occlusion reasoning, etc.
- Large-scale training of multi-modal deep learning methods, both in terms of dataset sizes and model complexity, context length extension, and efficient attention, distributed computing, etc.
- Application of machine learning and computer vision to embodied applications.
$45 - $60 per hour
Toyota Research Institute is seeking an AI Resident to join its Learning From Videos team in Los Altos, California. This year-long position is geared towards early-career researchers excited about tackling ambitious problems in embodied AI. Responsibilities include developing...ResidentVideoHourly pay$204k - $259k
...team that leverages state-of-the-art computer vision, deep learning, and generative AI to automatically analyze driving logs, generate rich scene... ...for computer vision tasks (e.g., detection, segmentation, video understanding) using Python and frameworks like PyTorch, JAX...VideoFull timeRemote work$100k
...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java... ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients... ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data...ResidentVideoFull timeH1b$140k - $390k
...Expect Tesla is building the future of AI systems that operate in the real world where... ...are non-negotiable. As a Machine Learning Integration Engineer, you'll be at the forefront... ...Combine multiple data sources, including video, audio, thermal, mechanical, weather, and...VideoHourly payFull timeTemporary workWorldwideFlexible hours- ...Machine Learning Engineer Creatify is building the world's first end-to-end AI advertising agent—a platform that automates the entire video ad lifecycle, from scripting and avatar-led generation to testing, optimization, and publishing across Meta, TikTok, YouTube,...Video
$204k - $259k
...behavior of the Waymo Driver. The DUE Machine Learning team will build and operate scalable... .... Build and scale production-ready video generation techniques (e.g., Diffusion, Flow... ...research in 4D world modeling and generative AI into robust, production-ready solutions....VideoFull time$147k - $211k
...building, training, and deploying machine learning models using TensorFlow, JAX, or Adbrain.... ...and algorithms. Experience with generative AI techniques (e.g., LLMs, natural language... ...including search, display, shopping, travel and video advertising, as well as analytics. Our...VideoFull timeImmediate start- Rhoda ai in Palo Alto, California, is seeking Research Scientists and Research Engineers to work on adapting web-pretrained video models for robotic tasks. Ideal candidates will possess deep robotics... ...and experience with reinforcement learning and robot systems. This role...Video
- ...domain knowledge to adapt our web-pretrained video model to real robot tasks. Post-training... ...policy performance beyond what imitation learning alone achieves — reward design, online data... ..., self‑driving, or similar physical AI domains). Strong understanding of robot policy...VideoShift work
- At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid... ...hardware to the foundational models and video world models that control it. Our robots are... ...work at the intersection of large-scale learning, robotics, and systems, with a research team...Video
$109k - $157k
...Rapidly develop prototypes of artificial intelligence, machine learning models applied to sustainability and finance problems Apply... ...on project team needs and your availability Throughout your AI Residency you can expect: To be embedded into one of our confidential or...ResidentInternshipFlexible hours$166k - $244k
Senior Software Developer, Machine Learning Mid Apply Note: By applying to this position you... ...Experience in one or more of the following: video compression, video processing, video... ...communication. Experience in Generative AI (Large Language Models, Multi-Modal, Large...VideoFull time$140.7k - $223.4k
...Mountain View, CA, USA Senior Machine Learning Engineer, Advertiser Growth Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ... ...the creation of high-performing image and video assets tailored to specific campaign goals...VideoWork at officeWorldwideRelocation package$109k - $157k
...generation of developer tools in the geospatial-AI domain. This project aims to... ...in your academic program throughout the residency. Throughout your AI Residency you... ...Consumer). Be willing and excited to learn new concepts and tactics across the full...ResidentSummer workInternshipFlexible hours- ...looking for the best At 42dot, our Machine Learning Engineers conduct research and development... ...for label generation from sensor and video data in autonomous driving contexts. Transfer... ...Driving, our autonomous driving AI software. Please upload all submission...VideoFull time
$147.4k - $272.1k
...Computer Vision and Machine Learning Engineer The Video Computer Vision org is a centralized applied research and engineering team responsible for... ...the boundaries of what is possible with spatial media and AI. We value a culture of learning, intellectual curiosity, and...VideoRelocation$181.1k - $318.4k
...Machine Learning - Data Scientist Lead Do you have a passion for computer vision and deep... ...latest advances in multimodal models? The Video Engineering Data Analytics and Quality group... ...quality of some of Apple's most exciting AI experiences. Responsibilities Lead...VideoRelocation$150k
...research, nurture the next generation of AI builders, and drive transformative contributions... ...for high-performance computing in deep learning, driving impactful discoveries that... ...environments. ~ Experience with large-scale video or multimodal data pipelines. ~...VideoVisa sponsorship$181.1k - $318.4k
...Machine Learning Engineer, Video Engineering Help define the next generation of photo and video experiences at Apple. The Video Engineering group... ...to shape upcoming products at the frontier of generative AI, computer vision, and 3D understanding, with a strong emphasis...VideoRelocation$100k
...a competitive advantage visit the below videos exhibiting at Oracle Cloud World/Oracle Java... ...analysts/ Data Scientists, and Machine Learning engineers for full-time positions with clients... ...cycle Knowledge of Statistics, Gen AI, LLM, Python, Computer Vision, data...ResidentVideoFull timeH1b$100k
...competitive advantage-Please visit the below videos exhibiting at Oracle Cloud world /Oracle... ...Data Scientists, Data Engineers, Machine Learning engineers for full time positions with... ...For data Science/Data Analyst/AI/Machine learning Positions REQUIRED SKILLS...ResidentVideoFull time$111.07k - $166.4k
...Across enterprise, cloud and AI, and carrier architectures, our... ...Marvell is a place to thrive, learn, and lead. The ever-growing... ...driven by AI, cloud services, video streaming, and 5G wireless megatrends... ...S. citizens, lawful permanent residents, or protected individuals as...ResidentVideoPermanent employmentInternshipWork from home$126.8k - $190.9k
Machine Learning Video Processing Engineer Cupertino, California, United States Machine Learning and AI Want to work on cutting edge technology that keeps the customer front and center? The Video Engineering group at Apple is responsible for creating the image/video core...VideoRelocation- ...ModelCat is transforming how companies develop AI models for embedded, edge, and IoT devices... ...AI, including pipelines that combine deep-learning and conventional algorithms for embedded... ...and keypoints; action recognition (video), radar and point cloud data, multi-modal...VideoRemote work
- ...engineers, and operators building an AI platform to make flying safer,... ...they build, ship quickly, and learn fast, pushing the boundaries... ...systems. Experience with video streaming technologies and... ...card holders, lawful permanent residents, or individuals granted asylum...ResidentVideoPermanent employmentFull timeLocal areaRemote work3 days per week
$184k - $287.5k
...Isaac team is looking for a robotics deep learning expert to help with this revolution. In this... ...space of geometric computer vision and video/vision-language models. Deploy algorithms... ...related fields. Publications in top‑tier AI conferences or contributions to open‑source...Video$147.4k - $272.1k
...Applied Machine Learning Research Engineer - Multimodal for Human Understanding We're starting... ...Research Engineer to join our team in the Video Computer Vision group and help us push the... ...stack, and partnering with HW, SW and AI teams to shape Apple's products and bring...VideoWorldwideRelocation$126.8k - $190.9k
...Computer Vision / Machine Learning Engineer - 3D Reconstruction Apple is where individual... ...join something — you'll add something. The Video Computer Vision organization is working on... ...products. Our team delivers cutting-edge AI, machine learning, computer vision and graphics...VideoRelocation$165k - $206.5k
...business workflows with enterprise AI. We help companies thrive in... ...evolve these tactics as you learn what resonates. What matters... ...technical blog posts, webinars, demo videos, case studies). Work with... .... If you are a California-resident, please read our California Applicant...ResidentVideoLive inWork at officeShift work3 days per week- ...Role - Machine Learning Expert (This role is open to US Citizens, Green Card holders, GC-EAD only. We do not sponsor visas.) Adidev is looking... ...machine learning models, with a special emphasis on Generative AI. In this role, you will craft and refine AI-driven solutions,...ResidentRemote workVisa sponsorshipRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Resident - Learning From Videos (LFV). Be the first to apply!


