Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Systems Engineer: Distributed LLM Training & Inference

$200.8k - $251k

Scale AI

A leading AI technology company in San Francisco seeks a team member to build and optimize a machine learning framework for large language models. Candidates should have system optimization experience and solid software engineering skills, particularly in tools like CUDA and Pytorch. This full-time position offers a competitive salary range of $200,800 - $251,000, along with comprehensive benefits. #J-18808-Ljbffr Scale AI

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the ML Systems Engineer: Distributed LLM Training & Inference in Seattle, WA vacancy
  •  ...reliable, field-ready AI systems that solve the...  ...rigorous engineering with learning systems...  ...are seeking a Staff ML Systems Engineer to...  ...architect and build the distributed infrastructure...  ...processing, model training, evaluation, and...  ...learning training and inference systems.... 
    Training
    Local area

    FieldAI

    Seattle, WA
    6 days ago
  •  ...seeking a Senior or Staff Software Engineer for the ML Infrastructure team. The role...  ...designing and operating systems for large-scale model training and inference, focusing on reliability and performance...  ...extensive experience with distributed systems, Kubernetes, and... 
    Training

    Salesforce

    Seattle, WA
    2 days ago
  • $148.2k - $300.96k

    Machine Learning Engineer Graduate (TikTok Trust and Safety-CV/NLP/Multimodal LLM) - 2026 Start (PhD) Location: Seattle Employment...  ...for optimizing our distributed model training framework continuously.We are...  ...Models, Recommendation Systems, and related areas.3. Strong... 
    Training
    Temporary work
    Local area

    Ellis Technologies, Inc.

    Seattle, WA
    2 days ago
  • $148.2k - $300.96k

    CV/NLP/Multimodal LLM Machine Learning Engineer Graduate (TikTok-Trust and Safety) - 2026 Start (PhD...  ...language models. Optimize our distributed model training framework continuously. Qualifications...  ...language models, recommendation systems, and related areas. Strong coding... 
    Training
    Temporary work
    Local area

    TikTok

    Seattle, WA
    3 days ago
  • $184.5k

     ...Senior Machine Learning Engineer role is part of the Distribution & Supply team which...  ...learning-driven systems that power how our...  ...into clear ML‑driven solutions, selecting...  ...including model training, evaluation, and deployment...  ...large‑scale batch inference, including clear,... 
    Training
    Local area
    Flexible hours

    Expedia Group

    Seattle, WA
    2 days ago
  • $213.3k - $272k

     ...We are seeking a Staff ML/LLM Ops Engineer to own the model lifecycle...  ...monitoring. API Design: Strong systems and API design judgment...  ...Computer Vision / video model inference at scale (GPU serving,...  ...experience, and education/training. Your total earning potential... 
    Training
    Full time
    Contract work
    Work at office
    Flexible hours

    LVT

    Seattle, WA
    16 days ago
  • $201.3k - $302.2k

    Sr. Applied ML Engineer, Apple Services Localization...  ...the server-side systems, and the APIs for...  ...translation and LLM-based systems...  ...own the serving, inference, and data pipelines...  ...track record training or deploying large...  ...operating large‑scale, distributed production... 
    Training
    Relocation

    Apple Inc.

    Seattle, WA
    1 day ago
  • Menlo Ventures is seeking an ML Systems Engineer in Seattle to enhance and improve the algorithms and infrastructure used for model training. You will work to boost the performance and reliability of these systems, focusing on rapid research progress. The ideal candidate... 
    Training
    Work at office

    Menlo Ventures

    Seattle, WA
    2 days ago
  • $320k - $405k

     ...company in Seattle is seeking an experienced Machine Learning Systems Engineer to join their Encodings and Tokenization team. The role...  ...and building critical infrastructure for efficient AI model training. The ideal candidate will have strong software engineering experience... 
    Training

    Menlo Ventures

    Seattle, WA
    1 day ago
  • $320k - $405k

     ...interpretable, and steerable AI systems. We want AI to be safe...  ...researchers, engineers, policy experts, and...  ...efficient and effective training of our AI systems...  ...systems, data pipelines, or ML infrastructure Are...  ...scientific progress Distributed systems and parallel computing... 
    Training
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    Seattle, WA
    1 day ago
  • $184k - $287.5k

     ...are looking for outstanding ML/DL compiler engineers to join the team and...  ...learning compilers and AI systems. We build innovative AI compiler...  ...state of art kernels in LLM inference workloads Develop AI‑driven...  ...for LLM inference and training (e.g. FlashInfer, Flash Attention... 
    Training

    SwiftCruit

    Seattle, WA
    5 days ago
  • $99.6k - $234.6k

    Principal AI Agent / ML Software Engineer (OCI) Job...  ...next‑generation AI systems on Oracle Cloud Infrastructure...  ..., scalable inference infrastructure,...  ...combines deep distributed systems...  ...understanding of LLM application patterns...  ...GPU inference or training workloads for latency... 
    Training
    Temporary work
    Flexible hours

    Ll Oefentherapie

    Seattle, WA
    2 days ago
  • $170k - $240k

     ...the Product and Engineering team at...  ...MLE) on the AI & ML (Insights) team...  ...architecture, training, deployment, and...  ...scalable data systems. You will be expected...  ...models that can infer meaning and...  ...grade GenAI or LLM‑based systems with...  ...pipelines and distributed systems using technologies... 
    Training
    Work at office
    Remote work
    Visa sponsorship

    PitchBook Data

    Seattle, WA
    3 days ago
  • Snapchat seeks a Software Engineer, ML Infrastructure to design and optimize infrastructure for machine learning workloads. This role involves building scalable ML model training and serving systems, enhancing feature generation pipelines, and collaborating with machine... 
    Training
    Work at office

    Snapchat

    Bellevue, WA
    3 days ago
  • $171.6k - $302.2k

    Senior ML Infrastructure Engineer - Training Algorithms, SIML Seattle, Washington, United...  ...? We are the Intelligence System Experience (ISE) team within...  ...in training / adapting LLM and Diffusion models Advanced...  ...projects Experience with distributed training of large models... 
    Training
    Relocation

    Apple Inc.

    Seattle, WA
    5 days ago
  • $175k - $280k

     ...consisting of a variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast, cost‑effective...  ...custom kernels to speed up inference. Find ways to reduce model...  ...low latency. Significant systems programming experience; ex.... 
    Training
    Contract work
    Flexible hours

    Sesame

    Bellevue, WA
    11 days ago
  •  ...seeking a Senior or Staff Software Engineer to join their ML Infrastructure team. You will...  ...and operating core systems for large scale model training and inference in a fast-paced environment....  ...This role requires expertise in distributed systems and Kubernetes, with... 
    Training

    Slack Enterprise

    Seattle, WA
    4 days ago
  •  ...Systems Engineer About Us We are building next-generation...  ...-built for large-scale AI training and inference. As a startup, we operate...  ...system performance across distributed environments Troubleshoot...  ...Experience in AI/ML infrastructure or HPC environments... 
    Training
    Work at office

    Nscale

    Seattle, WA
    1 day ago
  •  ...VAST Data is looking for a Senior Systems Engineer to join our growing team! This is...  ...for real-time data analysis and AI training and inference. Designed from the ground up to make...  ...matter expertise on storage products, distributed storage architectures, file systems,... 
    Training
    Traineeship

    VAST Data

    Seattle, WA
    2 days ago
  • $156.75k - $250.8k

     ...Machine Learning Engineer to join a new...  ...multimodal reasoning systems. As a senior...  ...pipelines, the training and evaluation...  ..., the inference and serving stack...  ...maintaining large-scale distributed platforms in...  ...across the ML lifecycle: data...  ...agentic systems and LLM tool-use... 
    Training
    Full time
    Work experience placement
    Work at office
    Remote work

    Axon

    Seattle, WA
    1 day ago
  • $171.6k - $302.2k

     ...Description As a Senior/Staff Engineer on the Foundation...  ...and orchestration systems for large-scale TPU...  ...clusters. You will work on distributed systems that manage...  ...of large-scale training and inference jobs. This role spans...  ...systems for distributed ML workloads running on... 
    Training
    Relocation

    Apple Inc.

    Seattle, WA
    4 days ago
  • VAST Data is looking for a Senior Systems Engineer to join our growing team! This is a great...  ...for real‑time data analysis and AI training and inference. Designed from the ground up to make...  ...expertise on storage products, distributed storage architectures, file systems,... 
    Training
    Traineeship

    VAST Data

    Seattle, WA
    3 days ago
  • $184.5k - $258k

     ...Senior ML/Gen AI Engineer Introduction to the team: Expedia Technology...  ...will work with a geo‑distributed, cross‑functional team of...  ...to robust production systems. Design, build, and own...  ...machine learning systems for training, deployment, inference, and monitoring. Lead... 
    Training
    Flexible hours

    11105 Expedia, Inc.

    Seattle, WA
    8 hours ago
  •  ...seeking an experienced RDMA software engineer to join their team. In this hands-...  ...infrastructure for large-scale AI training and inference workloads. The ideal candidate...  ...in high-performance networking and distributed communication systems, with a passion for solving complex... 
    Training

    Ll Oefentherapie

    Seattle, WA
    5 days ago
  •  ...company in Seattle is seeking a Senior ML Infrastructure Engineer for groundbreaking generative...  ...functional team to build infrastructure for training large-scale generative models. Ideal...  ...knowledge of PyTorch, and experience with distributed training. This role offers a... 
    Training

    Apple Inc.

    Seattle, WA
    5 days ago
  •  ...Anomaly Detection, and LLM fine-tuning —...  ...As one of our AI ML Engineer’s, you'll be a...  ...performance multi-agent systems that perceive,...  ...Build real-time inference pipelines for...  ...architecting large-scale distributed systems on cloud...  ..., Paternity) Training & Development #J... 
    Training
    Shift work

    C-Serv Global Ltd

    Bellevue, WA
    2 days ago
  • $175k - $280k

     ...in Bellevue, Washington, is seeking a talented engineer to join our team focused on revolutionizing the...  ...product category, working with state-of-the-art LLM and vision models. Candidates should have significant systems programming and performance engineering experience... 

    Sesame

    Bellevue, WA
    10 days ago
  • $185k - $235k

     ...every downstream ML model performs: better...  ...Machine Learning Engineer to design and build the data and ML systems that transform raw...  ...engineering, training, offline/online evaluation...  ...latency, high‑QPS inference for real‑time...  ...with large‑scale / distributed model training and... 
    Training

    Francisco Partners

    Bellevue, WA
    1 day ago
  • $201.3k - $302.2k

    Apple Inc. is looking for a Sr. Applied ML Engineer in Seattle to develop core language and machine translation models that power Localization across Services. This role involves integrating advanced machine learning into various Apple platforms, focusing on large-scale... 

    Apple

    Seattle, WA
    1 day ago
  • $184.5k

     ...Senior Machine Learning Engineer Expedia Technology teams...  ...scale batch and real-time ML systems that power pricing,...  ...intersection of machine learning, distributed systems, and MLOps,...  ...pipelines, model training and validation, scalable inference, monitoring, drift detection... 
    Training
    Local area

    Expedia, Inc.

    Seattle, WA
    11 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Systems Engineer: Distributed LLM Training & Inference. Be the first to apply!