Senior ML Engineer - Distributed Training Platform
Jobgether
We are currently looking for a Senior Machine Learning Engineer - Training Platform in Australia. You will join a high-impact AI Platform group focused on building the foundational systems that power large-scale model training across a global product ecosystem. In this role, you will design and evolve the infrastructure that enables distributed AI training workloads to run reliably, efficiently, and at scale. You will work on a Kubernetes-based training platform and contribute to the full training lifecycle, including orchestration, experiment management, and artifact handling. Your work will directly support research scientists, ML engineers, and product teams in deploying advanced AI capabilities. You will collaborate across infrastructure, cloud, and applied AI teams to solve complex distributed systems challenges. This is a highly cross‑functional environment where platform engineering meets cutting‑edge generative AI innovation. Accountabilities Design, build, and scale the core training platform infrastructure supporting distributed AI workloads across multiple teams and use cases. Improve reliability, observability, debugging, and operational performance of large‑scale training systems. Develop and enhance scheduling capabilities, including resource allocation, workload prioritization, and quota management for AI training jobs. Collaborate with research scientists, ML engineers, and infrastructure teams to optimize training workflows and system performance. Contribute to architecture and system design decisions for scalable AI infrastructure. Identify user pain points and translate them into platform improvements and roadmap priorities. Mentor engineers and promote best practices in distributed systems and AI infrastructure development. Requirements Strong experience in machine learning infrastructure, distributed systems, or large‑scale AI training pipelines. Hands‑on expertise with containerized environments and orchestration using Kubernetes. Familiarity with distributed training frameworks such as Ray or PyTorch distributed training. Experience working with cloud infrastructure supporting high‑performance workloads (e.g., storage systems, networking, HPC environments). Strong systems design skills with the ability to build scalable, reliable, and maintainable platforms. Excellent collaboration skills, with experience working alongside ML engineers, researchers, and infrastructure teams. Strong ownership mindset and ability to solve complex cross‑functional engineering problems. Passion for improving developer experience and enabling AI at scale. Benefits Equity packages to share in long‑term company success. Inclusive parental leave supporting all parents and carers. Annual wellbeing and lifestyle allowance to support personal and professional needs. Flexible leave options to encourage rest, recharge, and meaningful time away. Remote‑friendly working model within Australia with flexible work arrangements. Opportunities to work on cutting‑edge AI infrastructure at global scale. Collaboration with world‑class engineers, researchers, and infrastructure experts. #J-18808-Ljbffr
$180k - $200k
...Senior ML Engineer Full Remote $180K-$200K base A fast-growing AI product company... ...environment. Mission Architect, train, and maintain distributed machine learning systems powering... ...data using Spark and modern data platforms Own the full ML lifecycle:...PlatformSeniorTrainingRemote work- EPITEC is looking for a Research Engineer to join their team focused on AI at Meta. The role involves developing deep learning libraries to support large-scale distributed training and contributing to open-source projects. Candidates should have 5+ years of experience in...SeniorTrainingRemote work
$175k - $250k
...Senior Machine Learning Engineer (ML Infrastructure & Data Systems) Our client is an... ...industrial environments. Their platform focuses on automating... ...deployment and model training. They are building toward... ..., including distributed training, experiment tracking...PlatformSeniorTraining$216.7k - $303.4k
...Senior Machine Learning Systems Engineer Remote - United States Reddit is a community... ...Machine Learning Platform team at Reddit is a... ...ll Do: As a Senior ML Infrastructure... ...including improving model training time, efficiency,... ...costs in a large, distributed ML training...PlatformSeniorTrainingFor contractorsWork experience placementRemote work- ...now launching publicly. Our platform already drives 8-figure... ...Design, build, and deploy ML models for demand forecasting... ...data preprocessing, feature engineering, model training, evaluation, and production... ...intersection of agentic AI, distributed systems, and enterprise data...PlatformSeniorTrainingImmediate startHome officeVisa sponsorshipFlexible hours3 days per week
$180k - $225k
...About the job Senior ML/AI Engineer Senior ML/AI Engineer Location... ...feature engineering, model training, validation, deployment,... ...with large-scale datasets and distributed computing environments. ~... ..., and production ML platforms. Open-source contributions...PlatformSeniorTrainingFull timeHome officeFlexible hours$118k - $169k
...Sr. ML Ops Engineer At Early Warning, we've powered and protected... ...position is responsible for the platforms, tools, and processes that... ...and pipelines for model training, deployment, and monitoring... ...understanding of data management, distributed computing, and software...PlatformSeniorTrainingHourly payWork experience placementWork at officeImmediate startFlexible hours$180k - $225k
...decision-making. Their platform unifies an... ...ground-floor opportunity - engineers joining at this stage... ...About the Role As a Senior ML/AI Engineer, you will... ...feature engineering, model training, evaluation, and production... ...-scale datasets and distributed computing environments...PlatformSeniorTrainingWork at officeRemote workHome officeFlexible hoursNight shift- ...DESCRIPTION Transflo is seeking a Senior AI/ML Engineer to lead the design,... ...Document Processing (IDP) platform. This is a high-impact AI-first... ...& MLOps Design, train, and deploy scalable ML models... ...management, hyperparameter tuning, distributed training, and endpoint...PlatformSeniorTrainingRemote work
- ...Applied AI ML Engineer-Senior Associate The Corporate & Investment Bank... ...in deploying models on AWS platforms such as SageMaker or... ...engineering. Familiarity with distributed computing systems, frameworks... ...like data sharding and DDP training Experience with...PlatformSeniorTraining
- ...Department: Engineering & Technology Function... ...first-of-its-kind platform that connects... ...ship, and own AI and ML systems that perform... .... The Senior AI/ML & Data Engineer... ...feature stores, model training, evaluation, deployment... ...pipelines and distributed compute environments...PlatformSeniorTrainingPrice workFull timeCasual workWork at officeRemote workDay shift
$160k - $240k
...Senior ML Platform Engineer - Artificial Intelligence Location New York Business Area Engineering... ...workflows for continuous model training, inference, and monitoring Work... ...Experience designing cloud-native, distributed platforms ~ Strong knowledge of...PlatformSeniorTrainingTemporary workFor contractorsWork experience placement- ...A leading automotive company is seeking a Senior ML Engineer to design and build scalable AI/ML platform infrastructure. In this role, you will collaborate with machine learning engineers and research scientists to create advanced AI solutions for intelligent driving...PlatformSeniorTraining
- ...multiple database platforms using custom development... ...to work on data engineering pipelines using... ...implementation to training to deployment of... ...discussions as a senior member of the team... ...machine learning tools ML Flow, Databricks,... ...trade, manage and distribute capital for...PlatformSeniorTrainingWorldwide
$128.25k - $195k
As an Applied AI ML Engineer Senior Associate, you will be responsible for... ...in deploying models on AWS platforms such as SageMaker or... ...engineering. Familiarity with distributed computing systems, frameworks... ...like data sharding and DDP training. Familiarity with cloud platforms...PlatformSeniorTraining- ...the world. As an Applied AI ML Engineer‑Senior Associate in the Corporate... ...in deploying models on AWS platforms such as SageMaker or... ...engineering. Familiarity with distributed computing systems, frameworks... ...like data sharding and DDP training. Experience with Diffusion...PlatformSeniorTraining
$153k - $198k
...time doing it. As a Senior Machine Learning Engineer, you will own the end to end ML lifecycle at Button,... ...feed models, through training and evaluation workflows... ...integrate with our platform and power real product... ...streaming systems, or distributed data processing...PlatformSeniorTrainingLocal area$220k
...Senior Machine Learning Engineer Location: Remote (with optional hybrid... ...across digital platforms. THE ROLE As... ...in building scalable ML infrastructure and deploying... ...to work on distributed systems, cutting-edge... ...will architect and train neural network models...PlatformSeniorTrainingRemote workFlexible hours$150k - $200k
...Senior Machine Learning Engineer (Fraud) Remote Canada Affirm is reinventing... ...interest. On the ML Fraud team, you’ll build... ...ML engineers, platform partners, and cross‑... ...feature pipelines and training datasets from proprietary... ...working with distributed data processing or parallel...PlatformSeniorTrainingRemote workFlexible hours- ...a Machine Learning Engineer . As a Senior Machine Learning Engineer... ...and deploying ML models. What you’... ...efficient data pipelines for training machine learning... ...Proficiency in cloud computing platforms (e.g., AWS, Azure,... ...1,000 employees in a distributed workforce environment...PlatformSeniorTrainingLive in
$170k - $240k
...delivering-driven expert in ML Training Infrastructure with a... ...and high-performance AI/ML platform infrastructure to support advanced... ...initiatives. As a Senior ML Engineer, you will collaborate closely... ...optimization solutions to scale distributed training workflows and maximize...PlatformSeniorTrainingRelocationRelocation packageFlexible hours$189.51k - $274.6k
...intelligence. We have two platforms: Quora, a global... ...Team and Role Our small engineering team works on challenging... ...to join our growing distribution team, working on... ...core coding skills and ML knowledge Identify new... ...feature engineering, model training, as well as...PlatformSeniorTrainingRemote workFlexible hours$148.7k - $199.4k
...Senior Machine Learning Engineer Technology is at the heart of Disney's past, present... ...direction of the News ML Platform. You will drive... ...feature engineering, batch training and low-latency online serving... ...microservices for large-scale distributed systems using REST ~...PlatformSeniorTrainingWork experience placement$204k - $259k
...Senior Machine Learning Engineer – VLM/LLM Evaluation Waymo is an autonomous... ...a range of vehicle platforms and product use... ...life-cycle from pre-training and supervised fine-... ...experience Experience in ML engineering and... ...with large scale distributed system Proficient...PlatformSeniorTrainingFull timeTemporary workRemote work$244k - $320k
...the AI marketing platform for 1:1 personalization... ...personalization engine delivers bespoke... ...customers. With a distributed global workforce... ...of brands. As a Senior Machine Learning Engineer... ...production-grade ML systems that drive... ...processing, model training, validation, and...PlatformSeniorTrainingFull time- ...Senior Machine Learning Engineer - News Technology is at the heart of... ...building the products and platforms that will power our... ..., advertising, and distribution businesses for years... .... The News ML team is responsible... ...engineering, batch training and low-latency online...PlatformSeniorTrainingWork experience placementLocal areaDay shift
$165k - $195k
...Senior Machine Learning Engineer, Recommendations (Product) New York... ...is an artist-first platform empowering artists to... ...focusing on building ML-powered features that... ...including data, features, training, and serving Make... ...Navigate distributed systems (BigQuery, BigTable...PlatformSeniorTrainingWork at officeWork from homeFlexible hours$170k - $240k
...beyond legacy search engines. Today, our... ...across next-generation platforms where discovery... ...an experienced Senior Machine Learning Engineer... ...established AI/ML and Search... ...collection, model training pipelines, model deployments... ...in building distributed, low-latency, high...PlatformSeniorTrainingSummer workWork at office- ...based Data Learning platform for automating and accelerating... ...Opportunity As a Senior Machine Learning Engineer on our Algorithms... ...work closely with ML engineers and data... ...effectively in a distributed environment with people... ..., compensation, and training. We strive to create...PlatformSeniorTrainingFull timeLocal areaRemote workWorldwideHome office
$165k - $260k
...Senior Machine Learning Engineer - Search AI, BLAW/BTAX/BGOV Location... ...analytics solutions. Our platform combines... ...teams, software and ML engineering teams, and... ...retrieval workloads across distributed systems Operate... ...conditions, education/training and skill level....PlatformSeniorTrainingTemporary workFor contractorsWork experience placement
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior ML Engineer - Distributed Training Platform. Be the first to apply!
- machine learning ai engineer New York, NY
- machine learning engineer New York, NY
- entry level machine learning engineer New York, NY
- junior machine learning research engineer New York, NY
- machine learning software engineer New York, NY
- ai ml engineer New York, NY
- senior ml engineer New York, NY
- graduate machine learning engineer New York, NY
- computer vision machine learning engineer New York, NY
- data scientist machine learning engineer New York, NY

