Machine Learning Infrastructure Engineer
Mind Robotics
Machine Learning Infrastructure Engineer
At Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and reasoning-intensive work in real-world industrial environments. Our ability to iterate quickly on large-scale models depends on world-class ML infrastructure.
We're looking for a Machine Learning Infrastructure Engineer to build the core systems that enable fast, reliable, and scalable model training—powering everything from experimentation to production deployment.
Responsibilities
- Design and implement scalable systems for training large ML models
- Enable efficient workflows for data ingestion, training, and iteration
- Develop and optimize distributed training systems across hundreds of GPUs
- Implement strategies for parallelization, sharding, and efficient compute utilization
- Improve training efficiency through techniques such as attention optimizations, kernel fusion, and memory management
- Partner closely with modeling teams to accelerate iteration speed and reduce training costs
- Build internal tools for experiment tracking, monitoring, and debugging
- Implement systems for tracking training performance, failures, and resource utilization
- Debug and resolve bottlenecks across the training stack
- Provide lightweight infrastructure support for deploying and running models in production environments
- Optimize inference performance and reliability where needed
- Support core cloud infrastructure needs for training workloads (without heavy DevOps overhead)
- Manage compute resources efficiently across training jobs
Qualifications
- Strong experience building infrastructure for large-scale ML training
- Deep understanding of how modern LLM/VLM systems are trained and scaled
- Proven experience setting up and scaling distributed training across hundreds of GPUs
- Strong understanding of parallelization strategies (data, model, pipeline parallelism)
- Strong proficiency in Python programming
- Expert-level proficiency in PyTorch and/or JAX
- Strong understanding of techniques like attention optimization, kernel fusion, and efficient memory usage
Nice to Have
- Experience supporting inference systems in production
- Familiarity with robotics or embodied AI workloads
- Experience building tools for experiment management and researcher productivity
- ...industry experience (including 4+ years in the U.S.) ~ Strong foundation in machine learning, deep learning, and computer vision ~ Experience with distributed systems and scalable ML infrastructure ~ Proficient in Python and software development best practices ~...Suggested
$112.7k - $169.1k
...Mountain View, CA, USA Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad) Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616004 Role description The opportunity Unity Vector builds...SuggestedWork at officeWorldwideRelocation package$183.7k - $248.6k
The opportunity Unity is looking for a Senior Machine Learning Infrastructure Engineer to join our Vector Ads team, where we build the real-time systems that power Unity's global advertising platform. This is a high-scale, low-latency environment — processing billions...SuggestedWork at officeRemote workWorldwideRelocation package$167.2k - $210k
...Machine Learning Engineer, Infrastructure Menlo Park, California, United States Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine learning company"—reflects our core mission: democratizing access to the advanced...SuggestedWorldwide$209.7k - $283.8k
...Mountain View, CA, USA Staff Machine Learning Engineer, ML Infrastructure Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2615904 Role description The opportunity Unity Vector builds an offline ML platform...SuggestedWork at officeWorldwideRelocation package$140k - $390k
...-fidelity virtual world where Optimus can safely learn, adapt, and improve. Our mission is to recreate the... ...and reality. We are looking for passionate engineers with expertise in distributed systems, ML infrastructure, and graphics/game development who are excited to...Hourly payFull timeTemporary workFlexible hours$124k - $210k
...energyconsumption, achieving substantial energy savings and operational agility. The Tesla R&D Infrastructure team is seeking a skilled Full-Stack Machine Learning Engineer to join our mission of accelerating the world's transition to sustainable energy. You will...Hourly payFull timeTemporary workFlexible hours- ...Job Description What You Will Do We are looking for Machine learning engineers to join our GenAI Search Platform team to improve our... ...platform team works closely with the ranking, product, design, infrastructure and data science teams to drive our agentic search...Work at officeRemote workFlexible hours
- As a machine learning platform engineer of the Samsung Ads Platform Intelligence (PI) team, you will have access to unique Samsung proprietary data to develop and deploy a wide spectrum of large-scale machine learning products with real-world impact.
- ...About the job Machine Learning Engineer (Agentic AI Platform) About the Role We're building the next generation of agentic AI... ...thrives in ambiguity and wants to shape foundational AI infrastructure from the ground up. You'll work at the intersection of...
- ...time trading, all backed by robust data infrastructure. The Role Arta is building the AI... ...# System Design Interview with VP of Engineering, 60m # Co-founder Interview with Head... ..., collaboration, and continuous learning are highly valued ~ The opportunity...Work at officeRemote workRelocation
$230k - $300k
...with the right team to fulfill our mission: building the infrastructure layer for content intelligence. If you're inspired to... ...information, visit About the Role We are seeking a Staff Machine Learning Engineer to provide technical leadership for our recommendation...Full timeLocal areaWork from home$160k - $200k
...fast-growing teams. As a Senior ML Infrastructure Engineer at Plus, you will design scalable architectures... ...integrated with state-of-the-art deep learning frameworks like PyTorch or TensorFlow.... ...the boundaries of what's possible in machine learning infrastructure and contribute...- ...tools by working on pioneering technologies to surprise and delight creative pros and enthusiasts alike. As a Machine Learning Infrastructure engineer, you will be working alongside world-class engineers and creatives to help innovate in the creative space in ways...
$150k
...researchers, data scientists, and engineers, tackling the most... ...performance computing in deep learning, driving impactful discoveries... ...looking for a distributed ML infrastructure engineer to help extend and... ...Experience with large-scale machine learning workloads (strong ML...Flexible hours- ...About the role We're looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving... ...Character.AI Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20...
- ...Position: Client Infrastructure Engineer with GCP Location: Mountain View, CA [Needs to be onsite for 1 week once in a quarter on... ...2+ months Contract JOB DESCRIPTION: Experience in Machine Learning engineer or Infrastructure roles, with a focus on Machine...Contract work
$124k - $250k
...Life As a member of our software engineering infra team, you'll solve technical challenges... ...implementing state-of-the-art software infrastructure. The team builds a high-performance,... ...legally protected characteristics. Learn more about EEO rights as an applicant here...$181.1k - $318.4k
...Sr. Machine Learning Engineer, ASR Infrastructure and Tools, Siri Speech Want to join the team pushing the boundaries of AI and building an intelligent assistant that helps millions of people get things done? Join the Siri team at Apple! To build the best speech recognition...WorldwideRelocation$148.7k - $258.72k
...Mountain View, CA, USA Senior Machine Learning Engineer, Ads Experimentation & Measurements Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616032 Role description The opportunity Unity's Ads Experimentation...Temporary workWork at officeWorldwideRelocation package- ...any hybrid work will be at the manager's discretion) W2 Candidates only Position Summary Seeking an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect downstream agentic AI systems across...
$140.7k - $223.4k
...Mountain View, CA, USA Senior Machine Learning Engineer, Advertiser Growth Location Mountain View, CA, USA Department AI & Machine... ...expands, advertisers have the intelligent tools, robust infrastructure, and rigorous experimentation frameworks needed to scale...Work at officeWorldwideRelocation package$213k - $263k
...Senior Machine Learning Engineer, Runtime and Serving Waymo is an autonomous driving technology company with the mission to be the world... ...focused on building, scaling, or maintaining ML systems and infrastructure. ~5+ years production programming in C++. ~3+ years...Full timeRemote work- ...profile. Responsibilities: Develop Machine Learning (ML) and physics-based models for... ...Create and maintain Client’s Cloud infrastructure for software development and high-volume... ...Mandatory: Experience with earth engine and functional programming. Experience...Contract workShift work
- ...Machine Learning Engineer In this role you will: Build, deploy, and improve robust machine learning models for product features. Design... ...insights. Provide input and collaborate closely with the ML Infrastructure team towards the development of technical platforms that...
$213k - $263k
...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver...Full timeRemote work$213k - $263k
...Senior Machine Learning Engineer, Multimodal Perception (LLM/VLM) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building...Full timeRemote work$196k - $221k
...ML and work alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong software engineering... ...-term maintenance. Partner deeply with product, and infrastructure teams to develop and translate cutting-edge research into...Permanent employment- ...Machine Learning Engineer Location: Warren, MI / Mountain View, CA Duration: Fulltime Job Description: Must Have Technical/Functional Skills: • Assist in fine-tuning VLA (Vision-Language Alignment) models for robotic perception and interaction tasks • Curate...Full time
- ...About Voltai Voltai is developing world models, and agents to learn, evaluate, plan, experiment, and interact with the physical... ...four US presidents. What We're Looking For Strong AI/ML engineering skills from top tier CS, EECS, Math and Physics programs. Proven...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Infrastructure Engineer. Be the first to apply!
- machine learning ai engineer Palo Alto, CA
- machine learning engineer Palo Alto, CA
- machine learning software engineer Palo Alto, CA
- ai ml engineer Palo Alto, CA
- senior ml engineer Palo Alto, CA
- computer vision machine learning engineer Palo Alto, CA
- data infrastructure engineer Palo Alto, CA
- infrastructure engineering manager Palo Alto, CA
- remote infrastructure engineer Palo Alto, CA
- principal infrastructure engineer Palo Alto, CA

