Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior GPU ML Infra Engineer — Mid-Training & Inference

Reflection AI

A cutting-edge AI technology company based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience with modern inference frameworks and a solid understanding of reinforcement learning technologies. Comprehensive healthcare benefits, parental leave, and daily meals are provided, along with competitive salary and equity packages. #J-18808-Ljbffr Reflection AI

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior GPU ML Infra Engineer — Mid-Training & Inference in San Francisco, CA vacancy
  • Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,... 
    Training

    Reducto

    San Francisco, CA
    5 days ago
  •  ...San Francisco is looking for a Senior Software Engineer to build scalable infrastructure for large‑scale training and fine-tuning of foundation...  ...training systems and optimize GPU utilization while collaborating...  ...over 5 years of experience in ML infrastructure and a strong... 
    Senior
    Training

    Baseten

    San Francisco, CA
    6 days ago
  •  ...the physical world. Training our models...  ...heterogeneous fleet of GPU and TPU clusters —...  ...seamless. The Team The ML Infrastructure...  ...closely with ML Infra (training systems)...  ...accelerators. Support Inference and Robot...  ...Strong software engineering fundamentals Experience... 
    Training

    Physical Intelligence

    San Francisco, CA
    4 days ago
  • $250k

    Hamilton Barnes Associates Limited in San Francisco is seeking an experienced engineer to design and maintain large-scale GPU clusters for training and inference. The candidate should have over 7 years in SRE or DevOps, with strong skills in Kubernetes and Linux systems... 
    Senior
    Training

    Hamilton Barnes Associates Limited

    San Francisco, CA
    2 days ago
  •  ...Member of Technical Staff focused on building and optimizing ML inference systems in San Francisco. The role involves designing end-to-...  ...real-world workloads. Candidates should have strong software engineering skills, experience with ML inference systems, and proficiency... 
    Suggested

    Acceler8 Talent

    San Francisco, CA
    5 days ago
  • A leading AI infrastructure company is seeking a Senior ML Performance Engineer to design a comprehensive performance testing platform...  ...performance engineering and strong experience with GPU programming and ML inference workloads. Candidates should have expertise in... 
    Senior

    Amadeus Search

    San Francisco, CA
    3 days ago
  • $295k - $380k

     ...OpenAI is searching for a Senior Software Engineer to join their Robotics team in San Francisco. The role focuses on maintaining and improving the training framework while actively reviewing and debugging code within ML systems. The ideal candidate should thrive in hands... 
    Senior
    Training

    OpenAI

    San Francisco, CA
    5 days ago
  •  ...Francisco is seeking an experienced Software Engineer to develop machine learning...  ...involves building data pipelines, creating training platforms, and collaborating with various...  ...particularly in distributed systems and ML workflows. Join us in shaping the future... 
    Senior
    Training

    AI Chopping Block, Inc.

    San Francisco, CA
    3 days ago
  •  ...PDFs and spreadsheets. We train vision models to read...  ...a Machine Learning Engineer to help us train and deploy...  ...The Opportunity As an ML Infra Engineer , you’ll play...  ...key role in building the inference and training frameworks...  ...multi-node, multi-GPU environments with strong... 
    Training
    Work at office
    Local area

    Reducto

    San Francisco, CA
    5 days ago
  • MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability...  ..., and have strong knowledge in GPU-accelerated inference. Excellent... 
    Senior

    MakerMaker.AI

    San Francisco, CA
    3 days ago
  • $96.8k - $306.4k

     ...Job Description The Senior Principal AI Agent / ML Software Engineer is a Senior Staff-level,...  ...workflows, scalable inference infrastructure, and enterprise...  ..., high throughput, GPU efficiency, reliability,...  ...-scale GPU inference or training workloads for latency, throughput... 
    Senior
    Training
    Temporary work
    Flexible hours

    Oracle

    San Francisco, CA
    3 days ago
  •  ...startup building production‑grade ML infrastructure used by...  ...customers. They are looking for a Senior AI/ML Engineer to own model training pipelines, evaluation systems, and inference serving at scale. Full‑time,...  ...with distributed training, GPU optimization, or inference serving... 
    Senior
    Training
    Full time

    Clera

    San Francisco, CA
    1 day ago
  • $200k - $350k

     ...company in San Francisco seeks candidates for a role specializing in robotic control systems. You will train whole-body policies, build simulation environments, and run GPU training experiments. Ideal candidates should have strong coding skills in Python, C++, or Rust, and... 
    Senior
    Training

    Pantera Capital

    San Francisco, CA
    4 days ago
  • $200k - $260k

     ...Senior Machine Learning Engineer, Voice AI San Francisco About the Role...  ...is building the best inference infrastructure for...  ...looking for a Senior ML Engineer to drive the...  ...frontier. You'll profile GPU utilization, design...  ...plus. ~ Experience training or fine-tuning speech... 
    Senior
    Training
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • Comfy is seeking a skilled engineer to optimize model inference as part of the core ComfyUI team. This role focuses on enhancing AI model performance, memory management, and collaborating on innovative features. Ideal candidates have a strong background in PyTorch and... 
    Senior

    Comfy

    San Francisco, CA
    4 days ago
  • $204k - $259k

     ...generative modeling, Bayesian inference, hierarchical...  ...you will report to a Senior Staff Software Engineer. You will:...  ...life-cycle from pre-training and supervised fine-tuning...  ...experience Experience in ML engineering and...  ...We prefer: ML infra experience: training,... 
    Senior
    Training
    Full time
    Temporary work
    Remote work

    Waymo

    San Francisco, CA
    3 days ago
  • Define the ML strategy, raise the technical bar...  ...between research and engineering reality. You will have...  ...platform: feature store, training infrastructure, model...  ...stack. Mentor senior and mid-level engineers, conduct...  ...retrieval augmentation, and inference optimization. Expert‑... 
    Senior
    Training

    Sierracorp

    San Francisco, CA
    1 day ago
  •  ...is seeking a skilled professional to build scalable infrastructure for AI model training and inference. You will lead architectural decisions and work with core systems that power their GPU optimization platform. Candidates should have expertise in GPU fundamentals, deep... 
    Training

    Wafer

    San Francisco, CA
    4 days ago
  • $100k - $200k

    Voiceflow is seeking a skilled ML-Infrastructure Engineer in San Francisco to architect and operate auto-scaling systems for our voice AI simulation platform. The role includes optimizing GPU and compute infrastructure, ensuring high performance and reliability. Ideal... 
    Work at office

    Voiceflow

    San Francisco, CA
    1 day ago
  •  ...Mach9 ML Engineer Role At Mach9, ML Engineers build the...  ...allows us to develop and train cutting edge 3D scene...  ...is ideal for early-to-mid-career ML engineers who...  ...to scale training and inference of your models and with...  ...Familiarity with multi-GPU training and experiment... 
    Training

    Mach9

    San Francisco, CA
    3 days ago
  • $200k

     ...deploying high-throughput, ultra-low-latency inference engines for large language models or...  ...AI. Possess a deep understanding of GPU architectures (NVIDIA Ampere/Hopper) and...  ...critical intersection between the core ML training team and the backend infrastructure team... 
    Training
    Full time
    Work at office
    Worldwide

    Plaud

    San Francisco, CA
    1 day ago
  • MakerMaker, based in San Francisco, is seeking a highly skilled kernel engineer to write and optimize GPU kernels that enhance performance for training and inference. This role involves deep, low-level work to close the significant performance gap that exists in modern... 
    Senior
    Training

    MakerMaker

    San Francisco, CA
    4 days ago
  •  ...AI company in San Francisco is seeking a skilled ML Infrastructure Engineer to manage and optimize large-scale training systems. In this role, you will design and...  ...infrastructure for model training, ensuring efficient GPU/TPU utilization while working closely with... 
    Training

    Physical Intelligence

    San Francisco, CA
    3 days ago
  • A leading AI technology company in San Francisco is seeking an engineering professional to develop and manage intelligent job scheduling systems...  ...role focuses on ensuring efficient resource allocation across GPU and TPU clusters while enhancing overall system reliability.... 
    Training

    Physical Intelligence

    San Francisco, CA
    1 day ago
  • ML Systems Engineer - Robotics & AI We are building the full-stack foundation for the next generation...  ...and handling scenarios unseen in training. We work at the intersection of large-scale...  ...bottleneck identification at different GPU counts. Drive measurable gains in... 
    Training

    Maxwell Bond

    San Francisco, CA
    3 days ago
  •  ...don't believe culture can be engineered - but when it falls into place...  ...Overview We're looking for an ML infrastructure engineer to help...  ...supports every stage of the ML training flywheel and be an important...  ...distributed ML training on our GPU clusters Take ownership of performance... 
    Training
    Local area

    Humble Robotics

    San Francisco, CA
    1 day ago
  • About the Role ML Ops Engineer — Agentic AI Lab (Founding Team...  ...automating the model training, deployment,...  ...compute orchestration, GPU infrastructure, fine-tuned...  ...conversion, quantization, and inference rollout Manage hybrid...  ...engineering, or infra-focused ML roles Deep... 
    Training
    Full time

    Fabrion

    San Francisco, CA
    3 days ago
  •  ...seeking a Member of Technical Staff to design and optimize inference systems. The role involves managing KV cache...  ...components. Ideal candidates should have strong software engineering skills and experience with ML inference systems, particularly in Python and C++. This... 
    Senior

    Gimlet Labs

    San Francisco, CA
    4 days ago
  •  ...we offer an innovative GPU marketplace and AI inference service that promise affordability...  ...We're seeking a Senior Infrastructure Engineer to help build and scale...  ...data infrastructure for AI/ML workloads, including...  ...distributed file systems for training data and checkpoints... 
    Senior
    Training
    Remote work

    Hyperbolic Labs

    San Francisco, CA
    4 days ago
  • $220k

    Perplexity is looking for an engineer to join their team in San Francisco....  ...work on building and operating the inference engine, supporting new models, migrating GPU kernels, and developing a Rust-...  ...software engineering with a focus on ML inference, familiarity with deep... 
    Senior

    Perplexity

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior GPU ML Infra Engineer — Mid-Training & Inference. Be the first to apply!