Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Machine Learning Engineer - Distributed ML Systems

Pluralis Research

OverviewPluralis Research carries out foundational research on Protocol Learning: multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of the model. The purpose of Protocol Learning is to facilitate the creation of community-trained and community-owned frontier models with self-sustaining economics.We're looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large-scale training. You'll be implementing a novel substrate for training distributed ML models that work under consumer-grade internet connection.ResponsibilitiesDistributed Training Architecture & OptimizationDesign and implement large-scale distributed training systems optimized for heterogeneous hardware operating under low-bandwidth, high-latency conditions.Develop and optimise model-parallel training strategies (data, tensor, pipeline parallelism) with custom sharding techniques that minimise communication overhead.Optimise GPU utilisation, memory efficiency, and compute performance across distributed nodes.Implement robust checkpointing, state synchronisation, and recovery mechanisms for long-running, fault-prone training jobs.Build monitoring and metrics systems to track training progress, model quality, and system bottlenecks.Decentralised Networking & ResilienceArchitect resilient training systems where nodes can fail, networks can partition, and participants can dynamically join or leave.Design and optimise peer-to-peer topologies for decentralised coordination across non-co-located nodes.Implement NAT traversal, peer discovery, dynamic routing, and connection lifecycle management.Profile and optimise communication patterns to reduce latency and bandwidth overhead in multi-participant environments.What You'll BringStrong experience building and operating distributed systems in production.Hands-on expertise with distributed training frameworks (FSDP, DeepSpeed, Megatron, or similar).Deep understanding of model parallelism (data, tensor, pipeline parallelism).Expert-level Python with production experience (concurrency, error handling, retry logic, clean architecture).Strong networking fundamentals: P2P systems, gRPC, routing, NAT traversal, distributed coordination.Experience optimising GPU workloads, memory management, and large-scale compute efficiency.What we offerEquity-heavy compensation with meaningful ownership in a mission-driven companyCompetitive base salary for senior engineering roles in AustraliaVisa sponsorship available for exceptional candidatesRemote-first with optional access to our Melbourne hubWorld-class team — team mates were previously at Google, Amazon, Microsoft, and leading startupsBacked by Union Square Ventures and other tier-1 investors, we're a world-class, deeply technical team of ML researchers and engineers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolising model development, access and release, and achieving massive economic capture. If this resonates, please apply.J-18808-Ljbffr Pluralis Research

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Machine Learning Engineer - Distributed ML Systems in California, MO vacancy
  • A mission-driven technology company in California is seeking experienced Senior/Staff Engineers proficient in building distributed ML systems. Applicants should possess strong experience in optimizing large-scale training under low-bandwidth conditions, with expertise in... 
    Suggested
    Remote work

    Pluralis Research

    California, MO
    3 days ago
  •  ...We are looking for seasoned engineers that have strong problem...  ...understanding to build and manage systems with high performance,...  ...of Ads algorithms by using Machine Learning Work on NLP and CV related...  ...data mining, data analysis, distribution system Good understanding in... 
    Suggested
    Work experience placement

    6AM City, LLC

    California, MO
    3 days ago
  •  ...RoleWe are seeking an experienced Senior ML Inference Engineer to join our team, focusing on...  ...and building production-grade inference systems. You will work on critical challenges...  ...AI/ML in medical devicesBackground in distributed inference systems and model parallelism... 
    Suggested
    Remote work
    Worldwide

    PICTOR LABS INC

    California, MO
    4 days ago
  • $148.7k - $199.4k

     ...personalization and live sports experiences. As a Machine Learning Engineer, you will focus on building and operating distributed data and ML infrastructure that supports high-...  ...streaming data pipelines, feature computation systems, and ML-adjacent services that operate... 
    Suggested

    Disney Cruise Line - The Walt Disney Company

    California, MO
    4 days ago
  •  ...develop, and implement machine learning models to predict...  ...restrictions Work with engineering and operations teams...  ...expert in probability distributions, statistical modeling...  ...deploying effective ML models to production...  ...our Applicant Tracking System by approved Liftoff vendors... 
    Suggested
    Local area
    Remote work

    TechBrains

    California, MO
    3 days ago
  • A leading talent agency in AI is looking for an ML Engineer to design, build, and deploy production-grade ML systems. This role offers full ownership of the model lifecycle, collaborating with teams to deliver impactful solutions. Candidates should hold a related degree... 
    Full time

    Catalyst Labs

    California, MO
    1 day ago
  •  ...to build a safer financial system for billions of people around...  ...threat intelligence with machine learning, enables institutions and governments...  .... As a Senior Software Engineer, ML Infrastructure at TRM Labs,...  ...at the intersection of distributed systems, cloud... 
    Worldwide

    TRM Labs

    California, MO
    3 days ago
  • Autodesk, Inc. is seeking a Senior Machine Learning Engineer to develop and evolve machine learning systems for their customer platforms. This role involves designing ML capabilities, improving models, and working across the full ML lifecycle using modern techniques. With... 
    Remote job

    Autodesk, Inc.

    California, MO
    1 day ago
  • The Walt Disney Company is hiring a Machine Learning Engineer to join their team in California. This...  ...role emphasizes building and operating distributed data and machine learning...  ...streaming data pipelines and ensure systems operate reliably at scale. The ideal... 

    Disney Cruise Line - The Walt Disney Company

    California, MO
    4 days ago
  •  ...physics, mathematics, medicine, engineering, and other specialties. The...  ...seeking a highly accomplished Machine Learning Engineer to take ownership of the end-to-end ML lifecycle, from initial data exploration...  ...standards for code quality, system efficiency, and security in a... 
    Seasonal work
    Flexible hours

    SandboxAQ

    California, MO
    3 days ago
  •  ...analytics firm is seeking a Senior Software Engineer for ML Infrastructure to collaborate with...  ...operating GPU-backed infrastructure for AI systems. This role involves optimizing...  ...degree and over 5 years of experience in distributed systems, familiar with cloud environments... 

    TRM Labs

    California, MO
    17 hours ago
  • $208k - $300k

    Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission‑critical government environments. We build evaluation frameworks that ensure... 
    Full time

    Scale AI

    California, MO
    3 days ago
  • About the role As our Backend & Infrastructure Engineer, you will architect and build the core systems that power everything our AI/ML team delivers—the APIs, infrastructure, and distributed systems that make intelligent capabilities possible at scale. This is a foundational... 
    Remote work
    Work from home

    Albert Invent Corp

    California, MO
    2 days ago
  • Job Description Job Description Role: Machine Learning Product Engineer Location: San Jose, CA Contract /...  ...Keep a keen eye on State of the art ML improvements, and think of how that can...  ...dive into technical details of ML systems with engineers, and metrics with the... 
    Contract work

    6AM City, LLC

    California, MO
    5 days ago
  • $108.91k - $112.17k

    A technology firm specializing in advanced analytics is seeking a Software Engineer focusing on transforming research prototypes into reliable software. The ideal candidate will have over 5 years of experience in software engineering, proficient in Python and Rust, and... 
    Remote work

    AIMdyn, Inc.

    California, MO
    3 days ago
  • $120k - $155k

     ...in building and evolving ML/AI applications and services...  ..., architecture, and engineering teams to deliver reliable...  ...Build and deliver scalable systems that incorporate machine learning and Generative AI capabilities...  .... Experience designing distributed systems or working within... 
    Full time

    Medium

    California, MO
    3 days ago
  • $40 - $60 per hour

     ...looking for passionate and talented Machine Learning interns to join our Text Content...  ...product managers, marketing teams, and engineers to bring innovative ideas to life....  ...with building and evolving ML training and inferencing systems at significant scale with a purpose... 
    Hourly pay
    Summer work
    Internship
    Work at office
    Flexible hours
    3 days per week

    SupportFinity™

    California, MO
    5 days ago
  • Software Engineer, Lab45 Wipro is a leading global information technology, consulting and...  ...passionate about solving complex problems and learning new modern stacks. Review code and...  ..., secure and complaint architecture and systems. Strong communication skills and bias for... 
    Remote work

    Wipro Limited

    California, MO
    3 days ago
  • $235k - $275k

     ...diverse, global presence. About The Revenue Engine Team The Revenue Engine team works to...  ...effects of competition. The team of machine learning engineers, software engineers, and data...  ...uses the learnings to build production systems that improve outcomes for Liftoff and... 
    Full time
    Remote work

    Liftoff Mobile

    California, MO
    4 days ago
  •  ...redefining how businesses learn from and optimize their...  ...in Applied AI, Machine Learning, and Data Science...  ...evolving world of intelligent systems. Location New York, NY...  ...production-grade ML systems with end-to-end...  ...professional experience in ML engineering. Strong programming... 
    Full time

    Catalyst Labs

    California, MO
    1 day ago
  • A leading performance marketing platform seeks a Machine Learning Engineer to build statistical models and production systems that balance advertiser performance. The ideal candidate will have a PhD in a relevant field and industry experience applying machine learning... 
    Remote job
    Full time

    Liftoff Mobile

    California, MO
    4 days ago
  •  ...leading AI solutions company is seeking a Machine Learning Engineer for its AI Generation Engine (SAIGE)...  ...role requires ownership of the entire ML lifecycle, focusing on designing and...  ...expertise and experience with large-scale ML systems. The company offers a competitive... 

    SandboxAQ

    California, MO
    3 days ago
  • $175k - $190k

    Senior Machine Learning Engineer Hybrid, New York • Hybrid, Boston Data Science • Hybrid • Full-time...  ...experiences. This role bridges between ML platform work and building on top of...  ...network orchestration, and live pricing systems Ensure data quality and data... 
    Full time
    Temporary work

    Nimblemind.ai

    California, MO
    4 days ago
  • Netflix, Inc. is seeking a Data Engineer with expertise in Machine Learning to own data systems and support high‑visibility data products. You will engineer scalable data pipelines and collaborate closely with Machine Learning Engineers to extract insights from the Netflix... 

    Netflix, Inc.

    California, MO
    3 days ago
  • $196.75k

     ...Roblox is looking for a Senior Engineer who will play a key role in enhancing text safety AI systems. The successful candidate will design, develop, and implement...  ...should have 5+ years of experience in machine learning system development, collaboration skills, and... 

    Roblox

    California, MO
    4 days ago
  •  ...AI solutions provider is looking for a Machine Learning Engineer focused on model evaluations in the...  ...evaluation pipelines for advanced AI systems, ensuring they function reliably in critical...  ...skills in Python, experience with ML models, and familiarity with tools like... 

    Scale AI

    California, MO
    17 hours ago
  •  ...Job Overview We are seeking an experienced MLOps Engineer to design, build, and maintain scalable machine learning operations pipelines that support the full model...  ...This role focuses on enabling production-grade ML systems using modern cloud platforms, CI/CD practices, and... 
    Long term contract
    Local area

    Codinix Consulting Services

    California, MO
    2 days ago
  • Job Description: We are seeking a versatile and pragmatic Applied ML Engineer to contribute across a broad range of machine learning and perception tasks that power our edge‑intelligent maritime systems. This role requires someone comfortable wearing many hats—from working... 
    Remote work
    Flexible hours
    Shift work

    Quartermaster

    California, MO
    5 days ago
  •  ...technology company, Thumbtack, is hiring a Staff ML Infrastructure Engineer to drive the architectural vision for their machine learning infrastructure. This role requires 8+...  ...in engineering and a strong focus on distributed systems. You'll architect solutions that... 
    Remote job

    Work180

    California, MO
    5 days ago
  • $170.5k - $228.6k

     ...leading entertainment company is looking for an experienced Data Engineer to optimize data pipelines for AI/ML research in Nicasio, CA. This hybrid role involves designing scalable data processing systems and collaborating with AI/ML researchers. Candidates should have... 

    The Walt Disney Company (Germany) GmbH

    California, MO
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Machine Learning Engineer - Distributed ML Systems. Be the first to apply!