Machine Learning Engineer - Distributed ML Systems
Pluralis Research
Overview
Pluralis Research carries out foundational research on Protocol Learning: multi-participant training of foundation models where no single participant has, or can ever obtain, a full copy of the model. The purpose of Protocol Learning is to facilitate the creation of community-trained and community-owned frontier models with self-sustaining economics.
We’re looking for Senior/Staff engineers with 5+ years of experience in distributed systems and ML large‑scale training. You’ll be implementing a novel substrate for training distributed ML models that work under consumer‑grade internet connection.
Responsibilities
Distributed Training Architecture & Optimization
- Design and implement large‑scale distributed training systems optimized for heterogeneous hardware operating under low‑bandwidth, high‑latency conditions.
- Develop and optimise model‑parallel training strategies (data, tensor, pipeline parallelism) with custom sharding techniques that minimise communication overhead.
- Optimise GPU utilisation, memory efficiency, and compute performance across distributed nodes.
- Implement robust checkpointing, state synchronisation, and recovery mechanisms for long‑running, fault‑prone training jobs.
- Build monitoring and metrics systems to track training progress, model quality, and system bottlenecks.
Decentralised Networking & Resilience
- Architect resilient training systems where nodes can fail, networks can partition, and participants can dynamically join or leave.
- Design and optimise peer‑to‑peer topologies for decentralised coordination across non‑co‑located nodes.
- Implement NAT traversal, peer discovery, dynamic routing, and connection lifecycle management.
- Profile and optimise communication patterns to reduce latency and bandwidth overhead in multi‑participant environments.
What You’ll Bring
- Strong experience building and operating distributed systems in production.
- Hands‑on expertise with distributed training frameworks (FSDP, DeepSpeed, Megatron, or similar).
- Deep understanding of model parallelism (data, tensor, pipeline parallelism).
- Expert‑level Python with production experience (concurrency, error handling, retry logic, clean architecture).
- Strong networking fundamentals: P2P systems, gRPC, routing, NAT traversal, distributed coordination.
- Experience optimising GPU workloads, memory management, and large‑scale compute efficiency.
What we offer
- Equity‑heavy compensation with meaningful ownership in a mission‑driven company
- Competitive base salary for senior engineering roles in Australia
- Visa sponsorship available for exceptional candidates
- Remote‑first with optional access to our Melbourne hub
- World‑class team — team mates were previously at Google, Amazon, Microsoft, and leading startups
Backed by Union Square Ventures and other tier‑1 investors, we’re a world‑class, deeply technical team of ML researchers and engineers. Pluralis is unapologetically ideological. We view the world as a better place if we are able to implement what we are attempting, and Protocol Learning as the only plausible approach to preventing a handful of massive corporations monopolising model development, access and release, and achieving massive economic capture. If this resonates, please apply.
#J-18808-Ljbffr- A mission-driven technology company in California is seeking experienced Senior/Staff Engineers proficient in building distributed ML systems. Applicants should possess strong experience in optimizing large-scale training under low-bandwidth conditions, with expertise in...SuggestedRemote work
- ...We are looking for seasoned engineers that have strong problem... ...understanding to build and manage systems with high performance,... ...of Ads algorithms by using Machine Learning Work on NLP and CV related... ...data mining, data analysis, distribution system Good understanding in...SuggestedWork experience placement
- ...develop, and implement machine learning models to predict... ...restrictions Work with engineering and operations teams... ...in probability distributions, statistical modeling... ...deploying effective ML models to production... ...our Applicant Tracking System by approved Liftoff vendors...SuggestedLocal areaRemote work
- A leading talent agency in AI is looking for an ML Engineer to design, build, and deploy production-grade ML systems. This role offers full ownership of the model lifecycle, collaborating with teams to deliver impactful solutions. Candidates should hold a related degree...SuggestedFull time
- ...RoleWe are seeking an experienced Senior ML Inference Engineer to join our team, focusing on... ...and building production-grade inference systems. You will work on critical challenges... ...AI/ML in medical devicesBackground in distributed inference systems and model parallelism...SuggestedRemote workWorldwide
- ...to build a safer financial system for billions of people around... ...threat intelligence with machine learning, enables institutions and governments... .... As a Senior Software Engineer, ML Infrastructure at TRM Labs,... ...at the intersection of distributed systems, cloud...Worldwide
- ...About the role As our Backend & Infrastructure Engineer, you will architect and build the core systems that power everything our AI/ML team delivers—the APIs, infrastructure, and distributed systems that make intelligent capabilities possible at scale. This is a foundational...Remote workWork from home
- ...analytics firm is seeking a Senior Software Engineer for ML Infrastructure to collaborate with... ...operating GPU-backed infrastructure for AI systems. This role involves optimizing... ...degree and over 5 years of experience in distributed systems, familiar with cloud environments...
$208k - $300k
...Machine Learning Engineer - Model Evaluations, Public Sector The Public Sector ML team at Scale deploys advanced AI systems—including LLMs, agentic models, and multimodal pipelines—into mission‑critical government environments. We build evaluation frameworks that ensure...Full time$108.91k - $112.17k
A technology firm specializing in advanced analytics is seeking a Software Engineer focusing on transforming research prototypes into reliable software. The ideal candidate will have over 5 years of experience in software engineering, proficient in Python and Rust, and...Remote work$181.1k - $318.4k
AI/ML - Machine Learning Research Engineer, Machine Translation San Francisco Bay Area, California, United States Machine Learning and AI We are seeking... ...of real-world, large-scale, user-facing MT or LLM systems. At Apple, base pay is one part of our total compensation...Relocation- Job Description Job Description Role: Machine Learning Product Engineer Location: San Jose, CA Contract /... ...Keep a keen eye on State of the art ML improvements, and think of how that can... ...dive into technical details of ML systems with engineers, and metrics with the...Contract work
- A leading performance marketing platform seeks a Machine Learning Engineer to build statistical models and production systems that balance advertiser performance. The ideal candidate will have a PhD in a relevant field and industry experience applying machine learning...Full timeRemote work
$235k - $275k
...global presence. About The Revenue Engine Team The Revenue Engine team works to... ...the effects of competition. The team of machine learning engineers, software engineers, and data... ...uses the learnings to build production systems that improve outcomes for Liftoff and...Full timeRemote work$120k - $155k
...in building and evolving ML/AI applications and services... ..., architecture, and engineering teams to deliver reliable... ...Build and deliver scalable systems that incorporate machine learning and Generative AI capabilities... .... Experience designing distributed systems or working within...Full time$40 - $60 per hour
...looking for passionate and talented Machine Learning interns to join our Text Content... ...product managers, marketing teams, and engineers to bring innovative ideas to life.... ...with building and evolving ML training and inferencing systems at significant scale with a purpose...Hourly paySummer workInternshipWork at officeFlexible hours3 days per week- ...AI solutions provider is looking for a Machine Learning Engineer focused on model evaluations in the... ...evaluation pipelines for advanced AI systems, ensuring they function reliably in critical... ...skills in Python, experience with ML models, and familiarity with tools like...
- Software Engineer, Lab45 Wipro is a leading global information technology, consulting and... ...passionate about solving complex problems and learning new modern stacks. Review code and... ..., secure and complaint architecture and systems. Strong communication skills and bias for...Remote job
- ...redefining how businesses learn from and optimize their... ...in Applied AI, Machine Learning, and Data Science... ...evolving world of intelligent systems. Location New York, NY... ...production-grade ML systems with end-to-end... ...professional experience in ML engineering. Strong programming...Full time
- ...technology company, Thumbtack, is hiring a Staff ML Infrastructure Engineer to drive the architectural vision for their machine learning infrastructure. This role requires 8+... ...in engineering and a strong focus on distributed systems. You'll architect solutions that...Remote work
$175k - $190k
Senior Machine Learning Engineer Hybrid, New York • Hybrid, Boston Data Science • Hybrid • Full-time... ...experiences. This role bridges between ML platform work and building on top of... ...network orchestration, and live pricing systems Ensure data quality and data...Full timeTemporary work- Netflix, Inc. is seeking a Data Engineer with expertise in Machine Learning to own data systems and support high‑visibility data products. You will engineer scalable data pipelines and collaborate closely with Machine Learning Engineers to extract insights from the Netflix...
- Job Description: We are seeking a versatile and pragmatic Applied ML Engineer to contribute across a broad range of machine learning and perception tasks that power our edge‑intelligent maritime systems. This role requires someone comfortable wearing many hats—from working...Remote workFlexible hoursShift work
- ...Job Overview We are seeking an experienced MLOps Engineer to design, build, and maintain scalable machine learning operations pipelines that support the full model... ...This role focuses on enabling production-grade ML systems using modern cloud platforms, CI/CD practices, and...Long term contractLocal area
$140k - $180k
...is a Context Graph: a system of record not only for... ...proudly remote‑first, distributed across North America. The... ...will be joining As a Data Engineer, You Will Build and... ...Experience with AI or ML is a plus. You’ll Love... ...know you and allow you to learn more about our team and...Full timeRemote workWork from homeFlexible hours$170.5k - $228.6k
...leading entertainment company is looking for an experienced Data Engineer to optimize data pipelines for AI/ML research in Nicasio, CA. This hybrid role involves designing scalable data processing systems and collaborating with AI/ML researchers. Candidates should have...- Quartermaster is looking for an Applied ML Engineer to work on machine learning and perception tasks in maritime systems. The ideal candidate should have a Master’s or PhD in a related field and over 4 years of experience building and deploying ML models. Responsibilities...Flexible hours
$170k
...Overview We are looking for a Data Engineer with a background in Machine Learning to own and scale our data systems while supporting high‑... ...then leverage graph data in ML predictive models. The ideal... ...have a strong background in distributed data processing and can demonstrate...Flexible hours$94.35 - $125.03 per hour
...of America) The Lead Data Engineer will be part of a team building... ...(AI) and other emerging machine learning (ML)-based innovations in data... ...group to ensure servers and system maintenance based on... ...processing large datasets in distributed cloud environments. Required...Hourly payFull timeWork experience placement$170.5k - $228.6k
...seeking an experienced Data Engineer to specialize in the... ...cutting‑edge AI/ML research. This is a critical... ..., and evaluation of machine learning models tailored to... ...performance. Experience with distributed training pipelines and... ...in designing systems to support AI/ML model...Work at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Machine Learning Engineer - Distributed ML Systems. Be the first to apply!
- senior ml engineer California, MO
- machine learning software engineer California, MO
- machine learning engineer California, MO
- healthcare systems engineer California, MO
- system test engineer California, MO
- electronic systems engineer California, MO
- systems engineer California, MO
- ground systems engineer California, MO
- operations support system engineer California, MO
- digital communications systems engineer California, MO

