Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Infrastructure Engineer

Mind Robotics

Machine Learning Infrastructure Engineer

At Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and reasoning-intensive work in real-world industrial environments. Our ability to iterate quickly on large-scale models depends on world-class ML infrastructure.

We're looking for a Machine Learning Infrastructure Engineer to build the core systems that enable fast, reliable, and scalable model training—powering everything from experimentation to production deployment.

Responsibilities
  • Design and implement scalable systems for training large ML models
  • Enable efficient workflows for data ingestion, training, and iteration
  • Develop and optimize distributed training systems across hundreds of GPUs
  • Implement strategies for parallelization, sharding, and efficient compute utilization
  • Improve training efficiency through techniques such as attention optimizations, kernel fusion, and memory management
  • Partner closely with modeling teams to accelerate iteration speed and reduce training costs
  • Build internal tools for experiment tracking, monitoring, and debugging
  • Implement systems for tracking training performance, failures, and resource utilization
  • Debug and resolve bottlenecks across the training stack
  • Provide lightweight infrastructure support for deploying and running models in production environments
  • Optimize inference performance and reliability where needed
  • Support core cloud infrastructure needs for training workloads (without heavy DevOps overhead)
  • Manage compute resources efficiently across training jobs
Qualifications
  • Strong experience building infrastructure for large-scale ML training
  • Deep understanding of how modern LLM/VLM systems are trained and scaled
  • Proven experience setting up and scaling distributed training across hundreds of GPUs
  • Strong understanding of parallelization strategies (data, model, pipeline parallelism)
  • Strong proficiency in Python programming
  • Expert-level proficiency in PyTorch and/or JAX
  • Strong understanding of techniques like attention optimization, kernel fusion, and efficient memory usage
Nice to Have
  • Experience supporting inference systems in production
  • Familiarity with robotics or embodied AI workloads
  • Experience building tools for experiment management and researcher productivity
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Infrastructure Engineer in Palo Alto, CA vacancy
  •  ...industry experience (including 4+ years in the U.S.) ~ Strong foundation in machine learning, deep learning, and computer vision ~ Experience with distributed systems and scalable ML infrastructure ~ Proficient in Python and software development best practices ~... 
    Suggested

    Saxon Global

    Atherton, CA
    4 days ago
  • $112.7k - $169.1k

     ...Mountain View, CA, USA Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad) Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616004 Role description The opportunity Unity Vector builds... 
    Suggested
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    2 days ago
  • $183.7k - $248.6k

    The opportunity Unity is looking for a Senior Machine Learning Infrastructure Engineer to join our Vector Ads team, where we build the real-time systems that power Unity's global advertising platform. This is a high-scale, low-latency environment — processing billions... 
    Suggested
    Work at office
    Remote work
    Worldwide
    Relocation package

    Unity

    Mountain View, CA
    3 days ago
  • $167.2k - $210k

     ...Machine Learning Engineer, Infrastructure Menlo Park, California, United States Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine learning company"—reflects our core mission: democratizing access to the advanced... 
    Suggested
    Worldwide

    Moloco

    Menlo Park, CA
    12 hours ago
  • $209.7k - $283.8k

     ...Mountain View, CA, USA Staff Machine Learning Engineer, ML Infrastructure Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2615904 Role description The opportunity Unity Vector builds an offline ML platform... 
    Suggested
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    4 days ago
  • $140k - $390k

     ...-fidelity virtual world where Optimus can safely learn, adapt, and improve. Our mission is to recreate the...  ...and reality. We are looking for passionate engineers with expertise in distributed systems, ML infrastructure, and graphics/game development who are excited to... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    4 days ago
  • $124k - $210k

     ...energyconsumption, achieving substantial energy savings and operational agility. The Tesla R&D Infrastructure team is seeking a skilled Full-Stack Machine Learning Engineer to join our mission of accelerating the world's transition to sustainable energy. You will... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  •  ...Job Description What You Will Do We are looking for Machine learning engineers to join our GenAI Search Platform team to improve our...  ...platform team works closely with the ranking, product, design, infrastructure and data science teams to drive our agentic search... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    4 days ago
  • As a machine learning platform engineer of the Samsung Ads Platform Intelligence (PI) team, you will have access to unique Samsung proprietary data to develop and deploy a wide spectrum of large-scale machine learning products with real-world impact.

    Samsung Electronics

    Mountain View, CA
    1 day ago
  •  ...About the job Machine Learning Engineer (Agentic AI Platform) About the Role We're building the next generation of agentic AI...  ...thrives in ambiguity and wants to shape foundational AI infrastructure from the ground up. You'll work at the intersection of... 

    Barker Staffing Solutions, LLC

    Mountain View, CA
    3 days ago
  •  ...time trading, all backed by robust data infrastructure. The Role Arta is building the AI...  ...# System Design Interview with VP of Engineering, 60m # Co-founder Interview with Head...  ..., collaboration, and continuous learning are highly valued ~ The opportunity... 
    Work at office
    Remote work
    Relocation

    Arta Finance

    Mountain View, CA
    1 day ago
  • $230k - $300k

     ...with the right team to fulfill our mission: building the infrastructure layer for content intelligence. If you're inspired to...  ...information, visit About the Role We are seeking a Staff Machine Learning Engineer to provide technical leadership for our recommendation... 
    Full time
    Local area
    Work from home

    NewsBreak

    Mountain View, CA
    5 days ago
  • $160k - $200k

     ...fast-growing teams. As a Senior ML Infrastructure Engineer at Plus, you will design scalable architectures...  ...integrated with state-of-the-art deep learning frameworks like PyTorch or TensorFlow....  ...the boundaries of what's possible in machine learning infrastructure and contribute... 

    PlusAI, Inc.

    Santa Clara, CA
    2 days ago
  •  ...tools by working on pioneering technologies to surprise and delight creative pros and enthusiasts alike. As a Machine Learning Infrastructure engineer, you will be working alongside world-class engineers and creatives to help innovate in the creative space in ways... 

    Apple

    Cupertino, CA
    1 day ago
  • $150k

     ...researchers, data scientists, and engineers, tackling the most...  ...performance computing in deep learning, driving impactful discoveries...  ...looking for a distributed ML infrastructure engineer to help extend and...  ...Experience with large-scale machine learning workloads (strong ML... 
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    1 day ago
  •  ...About the role We're looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving...  ...Character.AI Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20... 

    Character

    Redwood City, CA
    2 days ago
  •  ...Position: Client Infrastructure Engineer with GCP Location: Mountain View, CA [Needs to be onsite for 1 week once in a quarter on...  ...2+ months Contract JOB DESCRIPTION: Experience in Machine Learning engineer or Infrastructure roles, with a focus on Machine... 
    Contract work

    Syntricate Technologies

    Mountain View, CA
    3 days ago
  • $124k - $250k

     ...Life As a member of our software engineering infra team, you'll solve technical challenges...  ...implementing state-of-the-art software infrastructure. The team builds a high-performance,...  ...legally protected characteristics. Learn more about EEO rights as an applicant here... 

    AppLovin

    Palo Alto, CA
    1 day ago
  • $181.1k - $318.4k

     ...Sr. Machine Learning Engineer, ASR Infrastructure and Tools, Siri Speech Want to join the team pushing the boundaries of AI and building an intelligent assistant that helps millions of people get things done? Join the Siri team at Apple! To build the best speech recognition... 
    Worldwide
    Relocation

    Apple

    Cupertino, CA
    5 days ago
  • $148.7k - $258.72k

     ...Mountain View, CA, USA Senior Machine Learning Engineer, Ads Experimentation & Measurements Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616032 Role description The opportunity Unity's Ads Experimentation... 
    Temporary work
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    5 days ago
  •  ...any hybrid work will be at the manager's discretion) W2 Candidates only Position Summary Seeking an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect downstream agentic AI systems across... 

    The Fountain Group

    Mountain View, CA
    3 days ago
  • $140.7k - $223.4k

     ...Mountain View, CA, USA Senior Machine Learning Engineer, Advertiser Growth Location Mountain View, CA, USA Department AI & Machine...  ...expands, advertisers have the intelligent tools, robust infrastructure, and rigorous experimentation frameworks needed to scale... 
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    2 days ago
  • $213k - $263k

     ...Senior Machine Learning Engineer, Runtime and Serving Waymo is an autonomous driving technology company with the mission to be the world...  ...focused on building, scaling, or maintaining ML systems and infrastructure. ~5+ years production programming in C++. ~3+ years... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    3 days ago
  •  ...profile. Responsibilities: Develop Machine Learning (ML) and physics-based models for...  ...Create and maintain Client’s Cloud infrastructure for software development and high-volume...  ...Mandatory: Experience with earth engine and functional programming. Experience... 
    Contract work
    Shift work

    US Tech Solutions

    Mountain View, CA
    4 days ago
  •  ...Machine Learning Engineer In this role you will: Build, deploy, and improve robust machine learning models for product features. Design...  ...insights. Provide input and collaborate closely with the ML Infrastructure team towards the development of technical platforms that... 

    LABINE AND ASSOCIATES, INC.

    Palo Alto, CA
    1 day ago
  • $213k - $263k

     ...Machine Learning Engineer, Runtime & Optimization Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    5 days ago
  • $213k - $263k

     ...Senior Machine Learning Engineer, Multimodal Perception (LLM/VLM) Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building... 
    Full time
    Remote work

    Waymo

    Mountain View, CA
    1 day ago
  • $196k - $221k

     ...ML and work alongside industry-veteran scientists and engineers. As a Machine Learning Engineer, you'll bring your strong software engineering...  ...-term maintenance. Partner deeply with product, and infrastructure teams to develop and translate cutting-edge research into... 
    Permanent employment

    Otter.ai

    Mountain View, CA
    1 day ago
  •  ...Machine Learning Engineer Location: Warren, MI / Mountain View, CA Duration: Fulltime Job Description: Must Have Technical/Functional Skills: • Assist in fine-tuning VLA (Vision-Language Alignment) models for robotic perception and interaction tasks • Curate... 
    Full time

    JConnect Infotech

    Mountain View, CA
    1 day ago
  •  ...About Voltai Voltai is developing world models, and agents to learn, evaluate, plan, experiment, and interact with the physical...  ...four US presidents. What We're Looking For Strong AI/ML engineering skills from top tier CS, EECS, Math and Physics programs. Proven... 

    Voltai

    Palo Alto, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Infrastructure Engineer. Be the first to apply!