Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Machine Learning Infrastructure Engineer

DYNA Robotics Inc

Join Us To Shape The Next Frontier Of Ai-Driven Robotics!

Dyna Robotics makes general-purpose robots powered by a proprietary embodied AI foundation model that generalizes and self-improves across varied environments with commercial-grade performance. Dyna's robots have been deployed at customers across multiple industries. Its frontier model has the top generalization and performance in the industry.

Dyna Robotics was founded by repeat founders Lindon Gao and York Yang, who sold Caper AI for $350 million, and former DeepMind research scientist Jason Ma. The company has raised over $140M, backed by top investors, including CRV and First Round. We're positioned to redefine the landscape of robotic automation.

Position Overview

As a Lead ML Infrastructure Engineer, you are the architect of our "Training Engine." You will bridge the gap between raw hardware and cutting-edge research, ensuring that our ML team can iterate at lightning speed without friction. Your goal is simple: maximize the "intelligence-per-watt" by optimizing every millisecond of the training and inference pipeline.

What You'll Do
  • Scale Distributed Training: Architect and own the infrastructure for large-scale GPU clusters. You'll implement sharding, activation checkpointing, and memory optimization (ZeRO, FSDP) to enable the training of massive multimodal models.

  • Optimize Researcher Ergonomics: Build a research codebase and job scheduling system (Kubernetes/SLURM) that prioritizes fast iteration, automated retries, and seamless failure recovery.

  • High-Performance Data Handling: Design high-throughput pipelines to ingest and transform terabytes of multimodal robot data (video, proprioception, 3D signals), ensuring dataloaders never starve the GPUs.

  • Production Inference: Build low-latency inference pipelines for real-time robot control. You'll apply quantization, distillation, and model compilation (TensorRT, Triton) to move models from the lab to the physical world.

  • Deep Systems Profiling: Dive into the weeds of GPU utilization, I/O bottlenecks, and memory fragmentation to squeeze every bit of performance out of our expanding compute fleet.

What You'll Bring
  • 7+ Years of Engineering: With a track record of leading technical projects in high-performance computing (HPC) or ML infrastructure.

  • ML Systems Mastery: Deep experience with PyTorch and distributed training frameworks (DeepSpeed, Accelerate). You understand the nuances of mixed precision and gradient accumulation.

  • Infrastructure Expertise: Hands-on experience managing cloud GPU environments (GCP/AWS) and container orchestration (Kubernetes).

  • Low-Level Intuition: A fundamental understanding of distributed systems, including race conditions, memory management, and NCCL/inter-node communication.

  • Ownership Mindset: You don't just "deploy" code; you design, build, and operate systems end-to-end to unblock fast-moving research.

Bonus Points For
  • Experience with Robotics Data Formats (MCAP, Protobuf) or multimodal models (VLAs).

  • Deep ML systems experience: custom kernels (Triton), compilers, or runtime optimization.

  • Experience as a founding or early-stage infrastructure hire.

At Dyna Robotics, we build technology for the real world, which requires a team as diverse as the environments our robots inhabit. We are an equal opportunity employer committed to technical rigor and mutual respect.

Don't let a checklist stop you. Data shows that underrepresented groups often only apply if they meet 100% of the criteria. We value problem-solving and grit over keyword matching. If you're passionate about the intersection of geometry and robotics, we want to hear from you—even if you don't check every box.

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff Machine Learning Infrastructure Engineer in Redwood City, CA vacancy
  •  ...industry experience (including 4+ years in the U.S.) ~ Strong foundation in machine learning, deep learning, and computer vision ~ Experience with distributed systems and scalable ML infrastructure ~ Proficient in Python and software development best practices ~... 
    Suggested

    Saxon Global

    Atherton, CA
    4 days ago
  •  ...About the role We're looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving...  ...Character.AI Character.AI empowers people to connect, learn and tell stories through interactive entertainment. Over 20... 
    Suggested

    Character

    Redwood City, CA
    1 day ago
  •  ...robotic automation. Join us to shape the next frontier of AI-driven robotics! Learn more at dyna.co Position Overview: We are seeking an experienced Machine Learning Infrastructure Engineer to join our team and help scale our ML training platform. In this role, you... 
    Suggested
    Local area

    Dyna Robotics

    Redwood City, CA
    4 days ago
  • A leading AI research company in California is seeking an experienced Machine Learning Engineer to develop Vision-Language-Action models for robotics. The ideal candidate has over 5 years of expertise in machine learning, a commitment to data quality, and strong production... 
    Suggested

    GenesisAI

    San Carlos, CA
    1 day ago
  • $242.1k - $293.8k

     ...to Roblox to explore, create, play, learn, and connect with friends in 3D immersive...  ..., we are building large scale ads machine learning infrastructure to deliver effective performance ads...  ...Senior Machine Learning Infrastructure Engineer in our Ads ML Infra team, you'll... 
    Suggested
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    1 day ago
  •  ...Machine Learning Infrastructure Engineer At Mind Robotics, we're building generalized physical AI—robotic systems capable of dexterous, adaptive, and reasoning-intensive work in real-world industrial environments. Our ability to iterate quickly on large-scale models... 

    Mind Robotics

    Palo Alto, CA
    3 days ago
  •  ...area of expertise: data pipelines, training infrastructure or inference. You'll build systems across the full robot learning pipeline: ingesting and processing multimodal...  ...We're Looking For Strong software engineering and systems fundamentals Experience building... 

    Sunday

    Redwood City, CA
    3 days ago
  • $183.7k - $248.6k

    The opportunity Unity is looking for a Senior Machine Learning Infrastructure Engineer to join our Vector Ads team, where we build the real-time systems that power Unity's global advertising platform. This is a high-scale, low-latency environment — processing billions... 
    Work at office
    Remote work
    Worldwide
    Relocation package

    Unity

    Mountain View, CA
    3 days ago
  • $167.2k - $210k

     ...Machine Learning Engineer, Infrastructure Menlo Park, California, United States Moloco builds some of the most powerful AI advertising solutions in the world. Our name—short for "machine learning company"—reflects our core mission: democratizing access to the advanced... 
    Worldwide

    Moloco

    Menlo Park, CA
    10 hours ago
  • $112.7k - $169.1k

     ...streaming data, supporting analytics, product intelligence, machine learning pipelines, and business operations. As data volume...  ...ML systems. We’re looking for a Machine Learning Engineer to join our Offline Infrastructure team. This is an ideal role for a recent university... 
    Work at office
    Worldwide
    Relocation package

    Unity

    Mountain View, CA
    3 days ago
  • $170k - $256k

     ...Senior Machine Learning Engineer Meltwater's Consumer Intelligence AI Team is looking for a Senior Machine Learning Engineer who will be...  ...in areas such as early ideation & prototyping, research, infrastructure, scalability, monitoring, and evaluation of AI systems.... 
    Flexible hours

    Meltwater Social (formerly Sysomos)

    Redwood City, CA
    22 hours ago
  • $211k - $263.5k

     ...The Shield team is looking for ML engineers with a passion for building out enterprise...  ...~5+ years of experience in applied machine learning ~ Lead design and implementation efforts...  ...AI, BigQuery, TensorFlow/PyTorch Infrastructure: GCP, Kubernetes Domains: User Behavior... 
    Live in
    Work at office
    Shift work
    3 days per week

    Box

    Redwood City, CA
    5 days ago
  • A leading resale marketplace firm in Redwood City is seeking a Machine Learning expert to drive data science initiatives, manage the entire machine learning lifecycle from data collection to deployment, and collaborate with various teams to enhance ML models. The ideal... 

    Poshmark, Inc.

    Redwood City, CA
    4 days ago
  • $186k - $280k

     ...Who are we? Equinix is the world's digital infrastructure company®, shortening the path to connectivity to enable the innovations...  ...to do your best work. Job Summary The Senior Machine Learning Engineer designs and implements advanced ML solutions leveraging Generative... 
    Full time
    Work at office

    Equinix

    Redwood City, CA
    1 day ago
  • $214k - $335k

     ...systems to collect, curate, and learn from some of the richest biological...  ...Join our Team! As an ML Engineer, you'll join some of the strongest infrastructure engineers in AI, building the systems...  ...building and deploying machine learning infrastructure at scale... 
    Remote work
    Relocation package

    Biohub

    Redwood City, CA
    3 days ago
  • $225k - $325k

     ...hands-on, high-ownership role for ML engineers who want to build production models that...  ...constraints. As a Founding Senior Machine Learning Engineer at Retell, you'll work across...  ...inform model iterations. Level Up Infrastructure – Design and maintain the ML infrastructure... 
    H1b
    Work at office

    Retell AI

    Redwood City, CA
    1 day ago
  • $209.7k - $283.8k

     ...Mountain View, CA, USA Staff Machine Learning Engineer, ML Infrastructure Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-2615904 Role description The opportunity Unity Vector builds an offline ML platform... 
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    4 days ago
  • $125k - $150k

     ...results. We are looking for a dynamic Data Scientist/ML Engineer to join our team. The Data Scientist/ML Engineer will work directly...  ...limited to: Research, design, implement, and deploy Machine Learning algorithms for enterprise applications. Assist and enable... 
    Temporary work
    Work experience placement

    Cathexis

    Redwood City, CA
    5 days ago
  • A cutting-edge robotics company based in California is looking for an experienced Machine Learning Infrastructure Engineer. This role involves designing scalable ML training platforms, optimizing high-performance computing systems, and ensuring robust job scheduling and... 

    Dyna Robotics

    Redwood City, CA
    3 days ago
  • $140k - $390k

     ...-fidelity virtual world where Optimus can safely learn, adapt, and improve. Our mission is to recreate the...  ...and reality. We are looking for passionate engineers with expertise in distributed systems, ML infrastructure, and graphics/game development who are excited to... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    4 days ago
  • $250k - $325k

     ...back into the multi-screen and multi-format content that consumers demand. About the Role: We are looking for a Lead Machine Learning Engineer with deep expertise in Performance advertising and a proven track record of architecting high-performance solutions that... 
    Work at office
    Remote work

    Pubmatic

    Redwood City, CA
    3 days ago
  • $170k - $240k

     ...Applied Machine Learning Engineer New York, NY; San Mateo, CA About Us: At Fireworks, we're building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We'... 

    Fireworks AI

    Redwood City, CA
    2 days ago
  • $225k - $250k

     ...Suki Platform , our proprietary AI infrastructure. Clinicians using Suki spend over 7...  ...What will you do everyday? The Machine Learning team forms the core of Suki's product,...  ...gets it: We're former Googlers, Apple engineers, Stanford docs, and healthcare veterans... 

    Suki

    Redwood City, CA
    5 days ago
  • $170k - $225k

     ...with human-like skill, awareness, and learning capabilities. Our systems are...  ...About the Role As a Senior/Staff Machine Learning Engineer, you will be working on a myriad of...  ...deployment Experience with cloud-based infrastructure (AWS, GCP, Azure) and containerized... 
    Worldwide

    Dexterity

    Redwood City, CA
    5 days ago
  • $108k - $170k

     ...customer interaction. Why Join Us We’re looking for an AI Agent Engineer to lead the charge in building and deploying enterprise‑grade...  ...Spending Account: Flexible quarterly stipend to support wellness, learning and professional development, and personal growth Monthly... 
    Full time
    Work at office
    Local area
    Remote work
    Flexible hours

    AI Chopping Block, Inc.

    Redwood City, CA
    5 days ago
  •  ...Prove, follow us on LinkedIn. About the role We are looking for a seasoned software engineer to join our A+ technical team in designing, building and deploying machine learning solutions into production. UnifyID is building a platform to solve one the world’s... 

    UnifyID (acquired by Prove)

    Redwood City, CA
    14 days ago
  • $124k - $210k

     ...energyconsumption, achieving substantial energy savings and operational agility. The Tesla R&D Infrastructure team is seeking a skilled Full-Stack Machine Learning Engineer to join our mission of accelerating the world's transition to sustainable energy. You will... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    1 day ago
  •  ...soon giving countless hours back to our customers so they can spend more time on the things they value most. As a Machine Learning Research Engineer, you will work on the software and algorithms that enable our robots to complete dexterous manipulation tasks in home... 

    Sunday

    Redwood City, CA
    5 days ago
  •  ...Job Description What You Will Do We are looking for Machine learning engineers to join our GenAI Search Platform team to improve our...  ...platform team works closely with the ranking, product, design, infrastructure and data science teams to drive our agentic search... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Mountain View, CA
    4 days ago
  • $195.78k - $242.1k

     ...Roblox to explore, create, play, learn, and connect with friends in...  ...across Discovery, Safety, Engine, and more. We are seeking exceptional...  ...You Will As a Senior Machine Learning Engineer on the AI...  ...distributed systems, ML Infrastructure, LLM fine tuning/RL, Information... 
    Full time
    Work experience placement
    H1b
    Work at office
    Local area
    Visa sponsorship
    Monday to Friday

    Roblox

    San Mateo, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Machine Learning Infrastructure Engineer. Be the first to apply!