Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Infrastructure Engineer

Mind Robotics

Machine Learning Infrastructure Engineer

At Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and reasoning-intensive work in real-world industrial environments. Our ability to iterate quickly on large-scale models depends on world-class ML infrastructure.

We're looking for a Machine Learning Infrastructure Engineer to build the core systems that enable fast, reliable, and scalable model training—powering everything from experimentation to production deployment.

Responsibilities
  • Design and implement scalable systems for training large ML models
  • Enable efficient workflows for data ingestion, training, and iteration
  • Develop and optimize distributed training systems across hundreds of GPUs
  • Implement strategies for parallelization, sharding, and efficient compute utilization
  • Improve training efficiency through techniques such as attention optimizations, kernel fusion, and memory management
  • Partner closely with modeling teams to accelerate iteration speed and reduce training costs
  • Build internal tools for experiment tracking, monitoring, and debugging
  • Implement systems for tracking training performance, failures, and resource utilization
  • Debug and resolve bottlenecks across the training stack
  • Provide lightweight infrastructure support for deploying and running models in production environments
  • Optimize inference performance and reliability where needed
  • Support core cloud infrastructure needs for training workloads (without heavy DevOps overhead)
  • Manage compute resources efficiently across training jobs
Qualifications
  • Strong experience building infrastructure for large-scale ML training
  • Deep understanding of how modern LLM/VLM systems are trained and scaled
  • Proven experience setting up and scaling distributed training across hundreds of GPUs
  • Strong understanding of parallelization strategies (data, model, pipeline parallelism)
  • Strong proficiency in Python programming
  • Expert-level proficiency in PyTorch and/or JAX
  • Strong understanding of techniques like attention optimization, kernel fusion, and efficient memory usage
Nice to Have
  • Experience supporting inference systems in production
  • Familiarity with robotics or embodied AI workloads
  • Experience building tools for experiment management and researcher productivity
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Infrastructure Engineer in Palo Alto, CA vacancy
  • $92k - $138k

     ...Mountain View, CA, USA Machine Learning Engineer, Offline Infrastructure (Entry-Level / New Grad) Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2616004 Role description The opportunity Unity Vector builds... 
    Suggested
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    2 days ago
  • $170k - $240k

     ...impact delivering-driven expert in ML Training Infrastructure with a strong ability to execute hands-on technical...  ...model development initiatives. As a Senior ML Engineer, you will collaborate closely with machine learning engineers, research scientists, and other partners... 
    Suggested
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  •  ...Location Palo Alto Employment Type Full time Location Type On-site Department Software Engineering We’re hiring Machine Learning Infrastructure Engineers to build the systems that make large-scale model training actually work. This role is for people who enjoy operating... 
    Suggested
    Full time

    Garuda Ventures

    Palo Alto, CA
    3 days ago
  • $150k

     ...researchers, data scientists, and engineers, tackling the most...  ...performance computing in deep learning, driving impactful discoveries...  ...looking for a distributed ML infrastructure engineer to help extend and...  ...Experience with large‑scale machine learning workloads (strong ML... 
    Suggested
    Flexible hours

    Institute of Foundation Models

    Sunnyvale, CA
    2 days ago
  • $126k - $181.5k

     ...Software Engineering Mountain View, California Machine Learning Engineering TL, Behavior Planning Who we are Aurora’s mission is to deliver the benefits of self-driving technology safely, quickly, and broadly. The Aurora Driver will create a new era in mobility... 
    Suggested
    Local area

    Australian Competition and Consumer Commission

    Mountain View, CA
    2 days ago
  •  ...LiDAR, Optical Coherence Tomography (OCT), embedded computing, machine learning, and AR/VR technologies. VETi is being developed for...  ...technologies. We are looking for a Senior Machine Learning Engineer to build the AI foundation for Kodiak's VETi platform, from model... 

    Kodiak Sciences Inc

    Palo Alto, CA
    3 days ago
  •  ...X Development, LLC in Mountain View, CA, is looking for a Software Engineer to join their Machine Learning team. You will design and maintain CI/CD pipelines for ML workflows, manage ML model deployments, and collaborate with a multidisciplinary team. The ideal candidate... 
    Flexible hours

    X Development, LLC

    Mountain View, CA
    4 days ago
  •  ...Machine Learning Engineer, GAI Search Platform - Moveworks Job Description What You Will Do We are looking for Machine learning engineers to...  ...platform team works closely with the ranking, product, design, infrastructure and data science teams to drive our agentic search... 

    Moveworks.ai

    Mountain View, CA
    14 hours ago
  •  ...Intuit is seeking a highly motivated and experienced Principal Machine Learning Engineer to join our Mid Market AI team. In this influential role, you will lead the design, development, and deployment of end-to-end AI/ML solutions that power the next generation of intelligent... 

    Intuit Inc.

    Mountain View, CA
    2 days ago
  • $197k - $266.5k

     ...Overview Come join Intuit as a Staff Machine Learning Engineer! In this role, you’ll be embedded inside a vibrant team of data scientists. You’ll be expected to help conceive, code, and deploy data science models at scale using the latest industry tools. Important... 
    Work experience placement
    Shift work

    Intuit Inc.

    Mountain View, CA
    2 days ago
  •  ...time trading, all backed by robust data infrastructure. The Role Arta is building the AI...  ...# System Design Interview with VP of Engineering, 60m # Co-founder Interview with Head...  ..., collaboration, and continuous learning are highly valued ~ The opportunity... 
    Work at office
    Remote work
    Relocation

    Arta Finance

    Mountain View, CA
    1 day ago
  • $230k - $300k

     ...with the right team to fulfill our mission: building the infrastructure layer for content intelligence. If you're inspired to...  ...information, visit About the Role We are seeking a Staff Machine Learning Engineer to provide technical leadership for our recommendation... 
    Full time
    Local area
    Work from home

    NewsBreak

    Mountain View, CA
    22 hours ago
  •  ...efforts. We’re proud to serve as the infrastructure platform for teams developing autonomous...  ...validation of state-of-the-art (SOTA) machine learning models, with a focus on performance,...  ...seeking a Senior ML Infrastructure engineer to help build and scale robust Compute... 
    Local area
    Work from home

    General Motors

    Mountain View, CA
    4 days ago
  • $153.2k - $234.1k

     ...General Motors. Our team is developing and deploying machine learning solutions that support safe and reliable autonomous...  ...across real-world scenarios. As a Senior ML engineer, you will build critical infrastructure that powers every machine learning engineer working... 
    Remote work
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    4 days ago
  •  ...About the role We're looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving...  ...Character.AI Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20... 

    Character

    Redwood City, CA
    2 days ago
  • $160k - $200k

     ...fast-growing teams. As a Senior ML Infrastructure Engineer at Plus, you will design scalable architectures...  ...integrated with state-of-the-art deep learning frameworks like PyTorch or TensorFlow....  ...the boundaries of what's possible in machine learning infrastructure and contribute... 

    PlusAI, Inc.

    Santa Clara, CA
    2 days ago
  •  ...area of expertise: data pipelines, training infrastructure or inference. You'll build systems across the full robot learning pipeline: ingesting and processing multimodal...  ...What We're Looking For Strong software engineering and systems fundamentals Experience building... 

    Sunday

    Mountain View, CA
    3 days ago
  • $204k - $259k

     ...simulation across 15+ U.S. states. Software Engineering builds the brains of Waymo's fully...  ..., perception, decision-making and deep learning, while collaborating with hardware and systems...  ...signals either through algorithms or machine learning. Projects on our team typically... 
    Remote work

    Waymo

    Mountain View, CA
    10 days ago
  •  ...About Voltai Voltai is developing world models, and agents to learn, evaluate, plan, experiment, and interact with the physical...  ...four US presidents. What we're Looking For Strong AI/ML engineering skills from top tier CS, EECS, Math and Physics programs. Proven... 

    Voltai

    Palo Alto, CA
    2 days ago
  •  ...Job Description Job Description Machine Learning Engineer This is an opportunity with an early stage startup.(M-F, in Mountain View,...  ...group and level up the team's knowledge of LLM training and infrastructure About you Strong software engineering skills. There... 
    Work at office

    Amiri Recruiting

    Mountain View, CA
    24 days ago
  •  ...Founding Machine Learning Systems Engineer We are working with an early-stage AI systems company in Palo Alto building infrastructure for the next generation of agentic AI workloads. The company is developing a platform that combines high-performance model serving... 
    Work at office

    Strativ Group

    Palo Alto, CA
    4 days ago
  •  ...allowing autonomous devices like vehicles and robots to make more intelligent and safe decisions.    Role Overview: As an ML Engineer on our perception team, you will own the development and deployment of 3D perception models across object detection, semantic... 
    Flexible hours

    Aeva, Inc.

    Mountain View, CA
    22 days ago
  •  ...streamline complex workflows, and continuously learn and adapt. Moveworks is trusted by...  ...automation with Moveworks’ Reasoning Engine and natural language capabilities, we deliver...  ...Our product excels in using cutting-edge Machine Learning technologies, particularly... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    6 days ago
  • $190.82k - $212.5k

     ...A technology company seeks an Engineer III in Machine Learning to lead the personalization strategy. This role involves collaborating with teams across engineering and marketing, designing and analyzing A/B tests, and managing product development in digital platforms.... 
    Remote work

    Samsung Electronics Perú

    Mountain View, CA
    3 days ago
  •  ...any hybrid work will be at the manager's discretion) W2 Candidates only Position Summary Seeking an experienced Machine Learning Engineer to lead the development of prompt injection and prompt safety models that protect downstream agentic AI systems across... 

    The Fountain Group

    Mountain View, CA
    3 days ago
  • $209k - $313k

     ...themselves, live in the moment, learn about the world, and have...  ..., Spectacles. Snap Engineering teams build fun and technically...  ...forefront. We're looking for a Machine Learning Engineer to join...  ...machine learning, ranking infrastructures, and system design... 
    Work experience placement
    Live in
    Work at office
    Local area

    Snapchat

    Palo Alto, CA
    2 days ago
  • $230k - $265k

     ...ML and work alongside industry-veteran scientists and engineers. As a Senior Machine Learning Engineer, you’ll bring your strong software...  ...-term maintenance. Partner deeply with product, and infrastructure teams to develop and translate cutting-edge research... 
    Permanent employment

    Otter.ai

    Mountain View, CA
    2 days ago
  • $120k - $215k

     ...Senior Machine Learning Engineer – Fine-Tuning and On-device AI Palo Alto, CA Who We Are HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates,... 
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    Palo Alto, CA
    16 days ago
  •  ...MACHINE LEARNING ENGINEER (Contextual) Background: AnchorFree is a fast growing technology company in Silicon Valley that makes a significant impact on people's lives around the globe by enabling free access to all information and content online and enabling millions... 
    Relocation package

    AnchorFree

    Mountain View, CA
    3 days ago
  • $160k - $225k

     ...will be used to expand our product and engineering teams, bringing our vision of...  ...software has a clear playbook, building the infrastructure for autonomous, intelligent agents is...  ...'re writing the manual. As an early Machine Learning Engineer at MAI, you won't just be writing... 

    MAI

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Infrastructure Engineer. Be the first to apply!