Machine Learning Engineer - Distributed ML Systems
Pluralis Research
OverviewPluralis Research carries out foundational research on Protocol Learning: multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of the model. The purpose of Protocol Learning is to facilitate the creation of community-trained and community-owned frontier models with self-sustaining economics.We're looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large-scale training. You'll be implementing a novel substrate for training distributed ML models that work under consumer-grade internet connection.ResponsibilitiesDistributed Training Architecture & OptimizationDesign and implement large-scale distributed training systems optimized for heterogeneous hardware operating under low-bandwidth, high-latency conditions.Develop and optimise model-parallel training strategies (data, tensor, pipeline parallelism) with custom sharding techniques that minimise communication overhead.Optimise GPU utilisation, memory efficiency, and compute performance across distributed nodes.Implement robust checkpointing, state synchronisation, and recovery mechanisms for long-running, fault-prone training jobs.Build monitoring and metrics systems to track training progress, model quality, and system bottlenecks.Decentralised Networking & ResilienceArchitect resilient training systems where nodes can fail, networks can partition, and participants can dynamically join or leave.Design and optimise peer-to-peer topologies for decentralised coordination across non-co-located nodes.Implement NAT traversal, peer discovery, dynamic routing, and connection lifecycle management.Profile and optimise communication patterns to reduce latency and bandwidth overhead in multi-participant environments.What You'll BringStrong experience building and operating distributed systems in production.Hands-on expertise with distributed training frameworks (FSDP, DeepSpeed, Megatron, or similar).Deep understanding of model parallelism (data, tensor, pipeline parallelism).Expert-level Python with production experience (concurrency, error handling, retry logic, clean architecture).Strong networking fundamentals: P2P systems, gRPC, routing, NAT traversal, distributed coordination.Experience optimising GPU workloads, memory management, and large-scale compute efficiency.What we offerEquity-heavy compensation with meaningful ownership in a mission-driven companyCompetitive base salary for senior engineering roles in AustraliaVisa sponsorship available for exceptional candidatesRemote-first with optional access to our Melbourne hubWorld-class team — team mates were previously at Google, Amazon, Microsoft, and leading startupsBacked by Union Square Ventures and other tier-1 investors, we're a world-class, deeply technical team of ML researchers and engineers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolising model development, access and release, and achieving massive economic capture. If this resonates, please apply.J-18808-Ljbffr Pluralis Research
- A mission-driven technology company in California is seeking experienced Senior/Staff Engineers proficient in building distributed ML systems. Applicants should possess strong experience in optimizing large-scale training under low-bandwidth conditions, with expertise in...SuggestedRemote work
- ...We are looking for seasoned engineers that have strong problem... ...understanding to build and manage systems with high performance,... ...of Ads algorithms by using Machine Learning Work on NLP and CV related... ...data mining, data analysis, distribution system Good understanding in...SuggestedWork experience placement
- ...RoleWe are seeking an experienced Senior ML Inference Engineer to join our team, focusing on... ...and building production-grade inference systems. You will work on critical challenges... ...AI/ML in medical devicesBackground in distributed inference systems and model parallelism...SuggestedRemote workWorldwide
$148.7k - $199.4k
...personalization and live sports experiences. As a Machine Learning Engineer, you will focus on building and operating distributed data and ML infrastructure that supports high-... ...streaming data pipelines, feature computation systems, and ML-adjacent services that operate...Suggested- ...develop, and implement machine learning models to predict... ...restrictions Work with engineering and operations teams... ...expert in probability distributions, statistical modeling... ...deploying effective ML models to production... ...our Applicant Tracking System by approved Liftoff vendors...SuggestedLocal areaRemote work
- A leading talent agency in AI is looking for an ML Engineer to design, build, and deploy production-grade ML systems. This role offers full ownership of the model lifecycle, collaborating with teams to deliver impactful solutions. Candidates should hold a related degree...Full time
- ...to build a safer financial system for billions of people around... ...threat intelligence with machine learning, enables institutions and governments... .... As a Senior Software Engineer, ML Infrastructure at TRM Labs,... ...at the intersection of distributed systems, cloud...Worldwide
- Autodesk, Inc. is seeking a Senior Machine Learning Engineer to develop and evolve machine learning systems for their customer platforms. This role involves designing ML capabilities, improving models, and working across the full ML lifecycle using modern techniques. With...Remote job
- The Walt Disney Company is hiring a Machine Learning Engineer to join their team in California. This... ...role emphasizes building and operating distributed data and machine learning... ...streaming data pipelines and ensure systems operate reliably at scale. The ideal...
- ...physics, mathematics, medicine, engineering, and other specialties. The... ...seeking a highly accomplished Machine Learning Engineer to take ownership of the end-to-end ML lifecycle, from initial data exploration... ...standards for code quality, system efficiency, and security in a...Seasonal workFlexible hours
- ...analytics firm is seeking a Senior Software Engineer for ML Infrastructure to collaborate with... ...operating GPU-backed infrastructure for AI systems. This role involves optimizing... ...degree and over 5 years of experience in distributed systems, familiar with cloud environments...
$208k - $300k
Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission‑critical government environments. We build evaluation frameworks that ensure...Full time- About the role As our Backend & Infrastructure Engineer, you will architect and build the core systems that power everything our AI/ML team delivers—the APIs, infrastructure, and distributed systems that make intelligent capabilities possible at scale. This is a foundational...Remote workWork from home
- Job Description Job Description Role: Machine Learning Product Engineer Location: San Jose, CA Contract /... ...Keep a keen eye on State of the art ML improvements, and think of how that can... ...dive into technical details of ML systems with engineers, and metrics with the...Contract work
$108.91k - $112.17k
A technology firm specializing in advanced analytics is seeking a Software Engineer focusing on transforming research prototypes into reliable software. The ideal candidate will have over 5 years of experience in software engineering, proficient in Python and Rust, and...Remote work$120k - $155k
...in building and evolving ML/AI applications and services... ..., architecture, and engineering teams to deliver reliable... ...Build and deliver scalable systems that incorporate machine learning and Generative AI capabilities... .... Experience designing distributed systems or working within...Full time$40 - $60 per hour
...looking for passionate and talented Machine Learning interns to join our Text Content... ...product managers, marketing teams, and engineers to bring innovative ideas to life.... ...with building and evolving ML training and inferencing systems at significant scale with a purpose...Hourly paySummer workInternshipWork at officeFlexible hours3 days per week- Software Engineer, Lab45 Wipro is a leading global information technology, consulting and... ...passionate about solving complex problems and learning new modern stacks. Review code and... ..., secure and complaint architecture and systems. Strong communication skills and bias for...Remote work
$235k - $275k
...diverse, global presence. About The Revenue Engine Team The Revenue Engine team works to... ...effects of competition. The team of machine learning engineers, software engineers, and data... ...uses the learnings to build production systems that improve outcomes for Liftoff and...Full timeRemote work- ...redefining how businesses learn from and optimize their... ...in Applied AI, Machine Learning, and Data Science... ...evolving world of intelligent systems. Location New York, NY... ...production-grade ML systems with end-to-end... ...professional experience in ML engineering. Strong programming...Full time
- A leading performance marketing platform seeks a Machine Learning Engineer to build statistical models and production systems that balance advertiser performance. The ideal candidate will have a PhD in a relevant field and industry experience applying machine learning...Remote jobFull time
- ...leading AI solutions company is seeking a Machine Learning Engineer for its AI Generation Engine (SAIGE)... ...role requires ownership of the entire ML lifecycle, focusing on designing and... ...expertise and experience with large-scale ML systems. The company offers a competitive...
$175k - $190k
Senior Machine Learning Engineer Hybrid, New York • Hybrid, Boston Data Science • Hybrid • Full-time... ...experiences. This role bridges between ML platform work and building on top of... ...network orchestration, and live pricing systems Ensure data quality and data...Full timeTemporary work- Netflix, Inc. is seeking a Data Engineer with expertise in Machine Learning to own data systems and support high‑visibility data products. You will engineer scalable data pipelines and collaborate closely with Machine Learning Engineers to extract insights from the Netflix...
$196.75k
...Roblox is looking for a Senior Engineer who will play a key role in enhancing text safety AI systems. The successful candidate will design, develop, and implement... ...should have 5+ years of experience in machine learning system development, collaboration skills, and...- ...AI solutions provider is looking for a Machine Learning Engineer focused on model evaluations in the... ...evaluation pipelines for advanced AI systems, ensuring they function reliably in critical... ...skills in Python, experience with ML models, and familiarity with tools like...
- ...Job Overview We are seeking an experienced MLOps Engineer to design, build, and maintain scalable machine learning operations pipelines that support the full model... ...This role focuses on enabling production-grade ML systems using modern cloud platforms, CI/CD practices, and...Long term contractLocal area
- Job Description: We are seeking a versatile and pragmatic Applied ML Engineer to contribute across a broad range of machine learning and perception tasks that power our edge‑intelligent maritime systems. This role requires someone comfortable wearing many hats—from working...Remote workFlexible hoursShift work
- ...technology company, Thumbtack, is hiring a Staff ML Infrastructure Engineer to drive the architectural vision for their machine learning infrastructure. This role requires 8+... ...in engineering and a strong focus on distributed systems. You'll architect solutions that...Remote job
$170.5k - $228.6k
...leading entertainment company is looking for an experienced Data Engineer to optimize data pipelines for AI/ML research in Nicasio, CA. This hybrid role involves designing scalable data processing systems and collaborating with AI/ML researchers. Candidates should have...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer - Distributed ML Systems. Be the first to apply!
- machine learning software engineer California, MO
- ai ml engineer California, MO
- machine learning engineer California, MO
- senior ml engineer California, MO
- machine learning ai engineer California, MO
- healthcare systems engineer California, MO
- application system engineer California, MO
- operating system engineer California, MO
- space systems engineer California, MO
- system engineer remote California, MO

