Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff ML Systems Engineer, Distributed Systems

FieldAI

Job Description

Job Description

FieldAI’s Irvine team is where embodied AI meets real robots, real sensors, and real field deployments. Based in the heart of Southern California’s robotics ecosystem, we build risk-aware, reliable, field-ready AI systems that solve the hardest problems in robotics and unlock the full potential of embodied intelligence. If you want your work to ship, get tested on hardware, and improve through real deployments, Irvine is the place. We go beyond typical data-driven approaches or pure transformer-only architectures, combining rigorous engineering with learning systems proven in globally deployed solutions that deliver results today and get better every time our robots run in the field.

We are seeking a Staff ML Systems Engineer to architect and build the distributed infrastructure that powers large-scale machine learning workflows across the organization.

This role sits at the intersection of machine learning, distributed systems, and platform engineering. You will be responsible for designing scalable systems that support data processing, model training, evaluation, and post-processing pipelines while enabling ML teams to efficiently develop, operate, and scale production-grade workflows.

You will play a critical role in defining the architectural patterns, tooling, and infrastructure that underpin our machine learning platform.

What You'll Get To Do
  • Design and build scalable distributed machine learning pipelines across data processing, model training, evaluation, and post-processing workflows.
  • Architect distributed execution systems, including parallelization strategies, workload scheduling, resource allocation, and fault tolerance mechanisms.
  • Develop reusable abstractions, frameworks, and libraries that simplify distributed pipeline development.
  • Optimize performance across distributed CPU and GPU environments, improving throughput, utilization, and reliability.
  • Design systems that effectively manage data partitioning, memory utilization, serialization overhead, and compute efficiency.
  • Partner closely with ML engineers, data engineers, and infrastructure teams to productionize research workflows and enable large-scale model development.
  • Establish best practices and engineering standards for distributed machine learning infrastructure.
  • Evaluate and guide decisions around distributed computing frameworks, infrastructure technologies, and system design trade-offs.
  • Improve observability, debugging, monitoring, and operational tooling for distributed systems at scale.
What You Have
  • 5+ years of experience building distributed systems, backend infrastructure, machine learning platforms, or large-scale data processing systems.
  • Strong Python programming skills, including experience with concurrency, performance optimization, and systems development.
  • Experience with distributed computing frameworks such as Ray, Spark, Dask, Flink, or similar technologies.
  • Experience designing and scaling data pipelines or machine learning workflows.
  • Strong system design skills with demonstrated expertise in scalability, reliability, and performance optimization.
  • Experience diagnosing and resolving bottlenecks in distributed environments.
  • Ability to work cross-functionally and drive technical decisions across multiple teams.
The Extras That Set You Apart
  • Experience building infrastructure for machine learning training and inference systems.
  • Familiarity with modern ML frameworks such as PyTorch or TensorFlow.
  • Experience with multi-node or multi-GPU training architectures, including DDP, FSDP, DeepSpeed, or similar technologies.
  • Experience operating Kubernetes-based infrastructure and large-scale cloud systems.
  • Deep understanding of distributed systems concepts including data locality, serialization costs, scheduling, and resource management.
  • Experience with distributed debugging, observability, and workflow orchestration platforms.
  • Proven ability to establish technical direction and influence architecture across organizations.

Our salary range is highly competitive with the market, but we take into consideration an individual's background and experience in determining final salary. Base pay offered may vary depending on geographic location, job-related knowledge, skills, and experience.

In addition to competitive compensation, FieldAI offers comprehensive benefits, equity participation, and the opportunity to contribute to cutting-edge advancements in AI and robotics.

Our salary range is generous and we consider each individual’s background and experience when determining final compensation. Base pay may vary based on role scope, job-related knowledge, skills, experience, and the Irvine, California market.

Why Join FieldAI in Irvine?

In Irvine, you will work where the robots are. Our local team builds and tests systems on real hardware with real sensors, then ships them to operate in unstructured, previously unknown environments around the world. We are solving one of robotics’ hardest challenges: reliable deployment outside the lab. Our Field Foundational Models™ raise the bar for perception, planning, localization, and manipulation, with an emphasis on explainability and safety for real-world use.

You will collaborate with a world-class team that thrives on creativity, resilience, and bold thinking. We bring deep experience from organizations such as DeepMind, NASA JPL, Boston Dynamics, NVIDIA, Amazon, Tesla Autopilot, Cruise, Zoox, Toyota Research Institute, and SpaceX, along with a track record of field deployments and strong performance in DARPA challenge segments.

Be Part of the Next Robotics Revolution

We are looking for builders who want their work to leave the whiteboard and show up on robots. If you enjoy tackling tough, uncharted questions and working across disciplines, you will find your people here. Our teams span AI, software, robotics engineering, product, field deployment, and technical communication, all focused on shipping systems that perform in the real world.

Our headquarters is in Irvine, and we partner closely with teams there as well as colleagues across the US and around the world. Join us in Southern California and help define what dependable, field-ready autonomy looks like.

We value diverse perspectives and are committed to fostering an inclusive workplace. We evaluate candidates and employees based on merit, qualifications, and performance, and we do not discriminate on the basis of race, color, gender, national origin, ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, or any other legally protected statu

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 12 days ago
Similar jobs that could be interesting for youBased on the Staff ML Systems Engineer, Distributed Systems in Seattle, WA vacancy
  • Join a leading company as a Distributed Systems Software Engineer focused on building resilient cloud applications using Python or Go. The role emphasizes...  ..., with opportunities for innovation in testing and AI/ML pipelines. Enjoy a supportive distributed workplace with... 
    Suggested
    Remote work

    Canonical

    Seattle, WA
    5 days ago
  • $142.6k - $261.5k

     ...scientists, designers, and software engineers enable our clients to solve...  ...practices. Knowledgeable in system development lifecycle and...  ...strong communication skills with staff at all levels. You are a self...  ...and interest in cloud and distributed systems architectures... 
    Suggested
    Summer holiday
    Flexible hours

    Ernst & Young Oman

    Seattle, WA
    3 days ago
  • $157.7k - $213.8k

    A leading data and AI company is seeking a Software Engineer to join their Runtime team in Bellevue, United States. This role involves building next-generation distributed data storage and processing systems, requiring 5+ years of experience in Java, Scala, or C++. Candidates... 
    Suggested

    Menlo Ventures

    Bellevue, WA
    18 hours ago
  • A leading data and AI company is seeking a Senior Software Engineer to join their team in Bellevue, Washington. You will work on building next-generation distributed data storage and processing systems that exceed traditional SQL performance. The ideal candidate will have... 
    Suggested

    Databricks Inc.

    Bellevue, WA
    18 hours ago
  • $182.4k - $247k

    A leading data and AI company in Bellevue is seeking a Software Engineer to join their Runtime team. You will build the next generation distributed data storage and processing systems that outperform traditional SQL query engines. Ideal candidates will have 8+ years of... 
    Suggested

    Menlo Ventures

    Bellevue, WA
    4 days ago
  • $250k

     ...ML Infra/Systems Engineer Title of Role: ML Infra/Systems Engineer Location: Seattle, onsite Company Stage of Funding: Seed — Software Development, AI Office Type: Onsite Salary: $250K–$450K Company Description We're representing a dynamic startup... 
    Work at office

    Recruiting from Scratch

    Seattle, WA
    9 days ago
  • $320k - $405k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...committed researchers, engineers, policy experts, and...  ..., data pipelines, or ML infrastructure Are...  ...enables scientific progress Distributed systems and parallel...  ...Currently, we expect all staff to be in one of our... 
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    Seattle, WA
    1 day ago
  •  ...technology company in Seattle seeks experienced engineers for the Online Storage team focused on building scalable, high-performance database systems. Candidates should have over 4 years of experience in building distributed systems and expertise in systems programming... 

    Slope

    Seattle, WA
    1 day ago
  •  ...A leading technology company in Seattle is seeking a Software Engineer to develop and maintain innovative storage solutions. You will work on large-scale distributed systems with a focus on reliability and performance. The ideal candidate has 3+ years of experience in... 

    Apple

    Seattle, WA
    2 days ago
  •  ...interpretable, and steerable AI systems. We want AI to be safe...  ...committed researchers, engineers, policy experts, and...  ...steerable AI. As an ML Systems Engineer on our...  ..., large scale distributed systems Large scale LLM...  ...Currently, we expect all staff to be in one of our offices... 
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    Seattle, WA
    1 day ago
  • $233.4k - $339.65k

     ...We are seeking a highly skilled and experienced Principal ML Systems Engineer to join our Autonomous Vehicles team. In this role, you will...  ...What You’ll Do Design & develop the next generation distributed ML data platform (Ingestion, Processing, Serving) using GCP... 
    H1b
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Bellevue, WA
    3 days ago
  • $320k - $405k

     ...A technology company in Seattle is seeking an experienced Machine Learning Systems Engineer to join their Encodings and Tokenization team. The role involves developing and optimizing tokenization systems, collaborating with research teams, and building critical infrastructure... 

    Menlo Ventures

    Seattle, WA
    1 day ago
  • Canonical seeks a Distributed Systems Software Engineer skilled in Python or Go to enhance the quality and resilience of their cloud systems. This globally...  ...will drive initiatives spanning CI pipelines and AI/ML technology integration, fostering development excellence... 
    Remote work

    Canonical

    Seattle, WA
    3 days ago
  • $171.6k - $302.2k

     ...company in Seattle is seeking a Machine Learning Engineer to join a high-impact team. You'll build and operate a system that extracts structured knowledge from...  ...programming background, experience with scalable distributed systems, and familiarity with async programming... 

    Apple

    Seattle, WA
    1 day ago
  • A fast-growing Seattle startup is seeking a systems-focused engineer to enhance its advanced evidence reasoning platform. The ideal candidate...  ...and operating large-scale production systems, focusing on distributed systems where latency and reliability are key. You will thrive... 

    CaseGuild Inc.

    Seattle, WA
    18 hours ago
  • $164k - $313.3k

     ...Photoshop ART is seeking a Senior Machine Learning (ML) Systems & Efficiency Engineer to join our R&D team focused on delivering practical, production...  ...will be given to candidates with experience in distributed inference, multimodal model profiling, and performance optimization... 
    Temporary work
    Local area
    Worldwide

    Adobe

    Seattle, WA
    3 days ago
  • $189.6k - $237k

     ...Scale's ML platform (RLXF) team builds our internal distributed framework for large language model training and inference...  ...technologies to optimize our ML system Ideally you'd have:...  ...distributed ML systems Strong software engineering skills, proficient in frameworks... 
    Full time

    Scale AI

    Seattle, WA
    4 days ago
  •  ...data-driven intelligence. Pillar 1: Data Engineering & Observability Build and own large-scale data pipelines and observability systems that power metrics, logging, and real-...  ...at scale. Ideal candidates have strong distributed systems fundamentals, backend development... 

    B Capital

    Bellevue, WA
    1 day ago
  • $171.6k - $302.2k

     ...Apple Inc. in Seattle seeks a Senior/Staff Engineer for its Foundation Model Compute Infrastructure...  ...-scale scheduling and orchestration systems for TPU workloads. Candidates should have over 7 years of experience with distributed systems, strong skills in Python and Kubernetes... 

    Apple

    Seattle, WA
    1 day ago
  • Salesforce, Inc. in Bellevue, WA is searching for a Software Engineer to join their data strategy teams. This role involves...  ...core. The ideal candidate will have strong experience with distributed systems, big data technologies like Spark, and a commitment to operational... 

    Salesforce

    Bellevue, WA
    2 days ago
  • $147k - $211k

    Google Inc. is seeking a Software Engineer specializing in Parallel File Systems and AI/ML Storage in Seattle, WA. In this role, you will develop the next-generation cloud storage tailored for extreme-scale, data-intensive workloads. The ideal candidate will possess a... 

    Google Inc.

    Seattle, WA
    4 days ago
  •  ...Capital Group is looking for a Machine Learning Engineer in Seattle to design, build, and operate Generative AI systems. The role involves collaborating with investment professionals to enhance Capital's investment processes. The ideal candidate will have over 8 years... 

    Capital-Group-1

    Seattle, WA
    1 day ago
  • $171.6k - $302.2k

     ...Staff/Sr. Machine Learning Engineer, Foundation Models - AI, Search & Knowledge Platforms Seattle, Washington...  ...CUDA. Familiar with one of the popular ML Frameworks like Pytorch, Tensorflow....  ...Proficient in building and maintaining systems written in modern languages (eg:... 
    Relocation

    Apple

    Seattle, WA
    1 day ago
  •  ...Axon is seeking an experienced Distributed Systems Engineer based in the United States, preferably near Seattle. In this hybrid role, you will design and oversee the operation of robust, self-governing systems while mentoring teammates to produce high-quality software... 

    Out in Science, Technology, Engineering, and Mathematics

    Seattle, WA
    1 day ago
  •  ...leading technology company in Seattle is seeking a Senior ML Infrastructure Engineer for groundbreaking generative modeling technologies. You...  ...infrastructure, advanced knowledge of PyTorch, and experience with distributed training. This role offers a competitive salary package,... 

    Apple

    Seattle, WA
    1 day ago
  • $139.5k - $258.1k

     ...ML Engineer - Evaluation Analysis, Metric and Data Strategy Seattle, Washington, United States...  ...that datasets reflect actual user distributions Assess alignment across different evaluation...  ...function‑calling reliability within AI systems Experience with evaluation methodology... 
    Relocation

    Apple

    Seattle, WA
    1 day ago
  • $153.2k - $234.1k

     ...breakthrough hardware and battery systems to intuitive design, intelligent software...  ...powers every machine learning engineer working on our cutting-edge...  ...of experience building large-scale distributed systems/applications or advanced ML Applications. Proven track record... 
    Work at office
    Local area
    Remote work
    Work from home
    Flexible hours

    Israelvcforum

    Seattle, WA
    1 day ago
  • $171.6k - $302.2k

     ...scale. This team also focuses on ML-driven forecasting, capacity planning...  .... As a Sr. ML Optimization Engineer, you will work at the intersection of systems engineering, infrastructure strategy...  ..., optimization, or large‑scale distributed systems At Apple, base pay is one... 
    Relocation

    Apple

    Seattle, WA
    1 day ago
  • $171.6k - $258.1k

     ...Apple Inc. is looking for a Senior Machine Learning Engineer in Seattle, WA to contribute to innovative search technologies. In this role, you'll analyze search retrieval and ranking needs, design machine learning models, and collaborate with cross-functional teams. The... 

    Apple

    Seattle, WA
    1 day ago
  •  ...Unchain Data is looking for a Machine Learning Engineer to join their innovative team in Bellevue, WA. This role focuses on implementing and evaluating cutting-edge machine learning algorithms to solve complex financial problems. The ideal candidate should have a Bachelor... 
    Flexible hours

    Unchain Data

    Bellevue, WA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff ML Systems Engineer, Distributed Systems. Be the first to apply!