Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Inference Infrastructure Engineer

Baseten

A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance monitoring systems for model metrics. This role offers competitive compensation with benefits like 100% medical coverage and generous PTO policies. Join a collaborative team dedicated to advancing AI and machine learning infrastructure. #J-18808-Ljbffr Baseten

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the ML Inference Infrastructure Engineer in San Francisco, CA vacancy
  • A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional... 
    Suggested

    Abridge

    San Francisco, CA
    1 day ago
  • $320k

     ...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic Anthropic...  ...build, and own backend services and infrastructure that serve Claude across multiple CSPs...  ...about LLM serving; prior inference or ML experience is not required Thrive... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    4 days ago
  • $405k

     ...group of committed researchers, engineers, policy experts, and business...  ...organization and the Cloud Inference team: taking classifiers,...  ...reliably inside a CSP partner's infrastructure at serving-path latency and scale...  ...mitigation mechanisms for AI/ML systems, or the... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    4 days ago
  • $160k - $250k

     ...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is...  ...speed up responses. Collaborate with ML researchers to bring new model...  ...journey in building the next generation AI infrastructure. Compensation We offer competitive... 
    Suggested
    Full time
    Local area

    Together AI

    San Francisco, CA
    5 days ago
  •  ...organization in San Francisco is seeking a Technical Program Manager for Inference to bridge their systems with the broader organization. This role...  ...experience in technical program management, ideally in ML/AI, alongside strong stakeholder management skills. This position... 
    Suggested

    Anthropic

    San Francisco, CA
    2 days ago
  • $230k - $265k

    Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires...  ...core platform functionalities, enhance real-time inference processes, and collaborate across teams to ensure... 
    Remote job

    Parafin

    San Francisco, CA
    4 days ago
  •  ...compute into useful intelligence - the inference services that serve LLMs at scale and...  ...you honest about both. Researchers and ML engineers will hand you workloads that barely...  ...Experience operating Kubernetes-based infrastructure, including custom operators or schedulers... 
    Flexible hours

    Adaption

    San Francisco, CA
    8 days ago
  •  ...the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable...  ...including GPU orchestration, large-scale inference systems, performance optimization,...  ...orchestration. Familiarity with GPU-based ML workloads or distributed training/... 
    Internship
    Immediate start

    SpreeAI

    San Francisco, CA
    3 days ago
  • $170k - $216k

     ...15+ U.S. states. The Simulation Infrastructure team creates reliable, scalable, and cost...  ...a broad range of customers Software Engineers, Product, Data Science, System Engineering...  ...You will: Build and evolve ML inference infrastructure for simulations. Be... 
    Full time
    Remote work

    Waymo

    San Francisco, CA
    3 days ago
  •  ...practicing MDs AI scientists PhDs creatives technologists and engineers working together to empower people and make care make...  ...and East Liberty in Pittsburgh. The Role As an ML Infrastructure Engineer Model Inference at Abridge youll play a pivotal role in building and... 
    Hourly pay
    Full time
    Flexible hours

    Abridge

    San Francisco, CA
    7 days ago
  •  ...an innovative GPU marketplace and AI inference service that promise affordability and...  ...the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU...  ...storage and data infrastructure for AI/ML workloads, including object storage,... 
    Remote work

    Hyperbolic Labs

    San Francisco, CA
    1 day ago
  •  ...Chalk Infrastructure Engineer Chalk is building the data platform that powers the future of machine...  ...barriers that have traditionally constrained ML capabilities. Our platform combines...  ...optimize arbitrary user Python code, infers and orchestrates infrastructure implied... 
    Work at office
    Flexible hours

    CHALK INC

    San Francisco, CA
    4 days ago
  • $180k - $200k

     ...Infrastructure Engineer (Storage) New York, New York, United States; Remote; San Francisco, California...  ..., training, and production inference, with security, observability, and control...  ...storage systems that power large-scale AI/ML training, inference, and HPC workloads... 
    Remote work
    Work from home
    Flexible hours

    Lightning AI

    San Francisco, CA
    4 days ago
  • $130k - $240k

     ...Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will...  ...and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection... 
    Flexible hours

    Maxana

    San Francisco, CA
    3 days ago
  • $120k - $200k

     ...Senior Infrastructure Engineer At Bland.com, our goal is to empower enterprises to make AI-phone agents at scale. Based out of San Francisco...  ...systems that handle real-time voice processing, scale ML inference, and integrate with enterprise telephony infrastructure. Your... 
    Work at office
    Night shift

    Bland AI

    San Francisco, CA
    4 days ago
  •  ...Senior HPC & GPU Infrastructure Engineer Sciforium is an AI infrastructure company developing next...  ...driver bring-up to maintaining the ML software stack (CUDA/ROCm, PyTorch, JAX...  ...vLLM, model serving optimizations, or inference systems. Hands-on experience with... 
    Flexible hours

    Sciforium

    San Francisco, CA
    3 days ago
  •  ...Baseten powers mission-critical inference for the world's most dynamic...  ...AI research, flexible infrastructure, and seamless developer tooling...  ...and help build the platform engineers turn to to ship AI products....  ...~ Exposure to a variety of ML startups, offering unparalleled... 
    Work experience placement
    Work at office
    Flexible hours

    Baseten

    San Francisco, CA
    2 days ago
  • $139.2k - $174k

     ...leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high...  ...networking, along with significant experience in building AI/ML products. The position offers a compensation range of $1... 
    Remote work

    DigitalOcean

    San Francisco, CA
    3 days ago
  • $150k - $200k

    Senior Infrastructure Engineer Location: On-site, San Francisco, CA (3 days/week in office) Salary: $150k - $200k + equity Industry: AI, Cloud...  .... Your work will directly impact platform reliability, ML inference performance, and the future of enterprise telephony. You’... 
    Work at office
    3 days per week

    Open Select

    San Francisco, CA
    4 days ago
  •  ...released daily. About the Role We're looking for two Infrastructure Engineers to lead the scaling of our machine learning inference system. You'll be responsible for architecting...  ...infrastructure that serves 150+ biological ML models, scaling our platform several orders of... 
    Relocation

    Tamarind Bio

    San Francisco, CA
    3 days ago
  •  ...Member of Technical Staff for RL Infrastructure in San Francisco. This role...  ...distributed RL training and inference across thousands of GPUs....  ...should have strong software engineering experience, particularly in...  ...be able to work closely with ML researchers. #J-18808-... 

    Vmax

    San Francisco, CA
    4 days ago
  • $165k - $200k

     ...You'll Do As a member of our infrastructure team, you'll be at the heart...  ...—acting as an infrastructure engineer one moment, and a developer,...  ...applications (especially in ML/AI) in AWS and/or GCP. Development...  ...machine learning inference service. Collaborating with... 
    Second job
    Remote work
    Work from home
    Relocation package
    Flexible hours

    Roboflow

    San Francisco, CA
    3 days ago
  • $250k

     ...Ready to architect AI infrastructure that powers next-generation research...  ...is now building a serverless inference platform, beginning with cost...  ...a Senior Inference Platform Engineer at an early stage and help define...  ...distributed systems (ML inference, HPC, or similar).... 
    Permanent employment
    San Francisco, CA
    more than 2 months ago
  •  ...Cohere is a team of researchers, engineers, designers, and more, who are...  ...next generation of agentic AI infrastructure at Cohere. This team sits at the intersection of ML systems, distributed...  ...emerging ML infrastructure, edge inference, or browser-native modelsOpen-... 
    Full time
    Work at office
    Remote work
    Flexible hours

    Cohere

    San Francisco, CA
    2 days ago
  • $320k - $405k

     ...group of committed researchers, engineers, policy experts, and...  ...attribution story for non-accelerator infrastructure — the network, compute, and...  ...between training clusters, inference fleets, and object storage...  ...customer. * Familiarity with AI/ML infrastructure traffic... 
    Contract work
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  • $200k

     ...Infrastructure Engineer, Security San Francisco Thinking Machines Lab's mission is to empower...  ...meet all of these: Experience with ML infrastructure, GPU clusters, or large...  ...their integrations into training and inference pipelines. Logistics... 
    Local area
    Immediate start
    Visa sponsorship
    Work visa
    Relocation package

    Thinking Machines Lab

    San Francisco, CA
    4 days ago
  •  ...technology firm in San Francisco seeks an SW Engineer to enable production workloads and...  ...Candidates should have strong experience in ML systems, performance engineering, and...  ...opportunity to work with cutting-edge AI infrastructure in a vibrant team environment. #J-18808-... 

    AI Chopping Block, Inc.

    San Francisco, CA
    3 days ago
  • Zensors is seeking a Machine Learning Engineer specializing in ML Runtime & Optimization to enhance our visual sensing platform. This role involves developing technologies that improve computer vision models critical for smart spaces and cities. Your responsibilities include... 

    Zensors

    San Francisco, CA
    1 day ago
  •  ...Workshop Labs Job Posting Build the infrastructure to serve personal AI models privately and...  ...tech ever seeing your data. Our core ML systems challenge: how do we serve the world...  ...architecture with the finetuning & inference code You Have • A deep understanding... 
    Remote work
    Shift work

    Workshop Labs

    San Francisco, CA
    1 day ago
  • A cutting-edge AI infrastructure company in San Francisco seeks an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal...  ...will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and... 

    Baseten

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Inference Infrastructure Engineer. Be the first to apply!