Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Inference Infrastructure Engineer

Baseten

A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance monitoring systems for model metrics. This role offers competitive compensation with benefits like 100% medical coverage and generous PTO policies. Join a collaborative team dedicated to advancing AI and machine learning infrastructure. #J-18808-Ljbffr Baseten

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the ML Inference Infrastructure Engineer in San Francisco, CA vacancy
  • A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional... 
    Suggested

    Abridge

    San Francisco, CA
    5 days ago
  • $405k

     ...group of committed researchers, engineers, policy experts, and business...  ...organization and the Cloud Inference team: taking classifiers,...  ...reliably inside a CSP partner’s infrastructure at serving‑path latency and scale...  ...mitigation mechanisms for AI/ML systems, or the... 
    Suggested
    Visa sponsorship

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • $320k

    About the Role The Cloud Inference team scales and optimizes Claude...  ...day‑to‑day operations. Our engineers are extremely high leverage:...  ...and own backend services and infrastructure that serve Claude across multiple...  ...serving; prior inference or ML experience is not required.... 
    Suggested
    Visa sponsorship

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  •  ...organization in San Francisco is seeking a Technical Program Manager for Inference to bridge their systems with the broader organization. This role...  ...experience in technical program management, ideally in ML/AI, alongside strong stakeholder management skills. This position... 
    Suggested

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...released daily. About the Role We're looking for two Infrastructure Engineers to lead the scaling of our machine learning inference system. You'll be responsible for architecting...  ...infrastructure that serves 150+ biological ML models, scaling our platform several orders of... 
    Suggested
    Relocation

    Tamarind Bio

    San Francisco, CA
    3 days ago
  • Software Engineer (AI Infrastructure / Training / Inference) About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems...  ...container orchestration. Familiarity with GPU-based ML workloads or distributed training/inference systems.... 

    SpreeAI

    San Francisco, CA
    2 days ago
  • $130k - $240k

     ...Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will...  ...and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection... 
    Flexible hours

    Maxana

    San Francisco, CA
    2 days ago
  • $180k - $200k

     ...Infrastructure Engineer (Storage) Lightning AI is the company behind PyTorch Lightning. Founded...  ...experimentation, training, and production inference, with security, observability, and...  ...storage systems that power large-scale AI/ML training, inference, and HPC workloads... 
    Remote work
    Work from home
    Flexible hours

    Lightning AI

    San Francisco, CA
    3 days ago
  •  ...Relace is building the models and infrastructure that code agents reach for....  ...As an Infrastructure Engineer at Relace, you'll design and...  ...that power our high-performance inference and training infrastructure....  ...systems for deploying and scaling ML workloads globally. - Work... 
    Work at office

    Relace Inc

    San Francisco, CA
    1 hour ago
  • $120k - $200k

     ..., and many more. About the Role As a Senior Infrastructure Engineer at Bland, you'll help us to build the backbone that enables...  ...systems that handle real-time voice processing, scale ML inference, and integrate with enterprise telephony infrastructure. Your... 
    Work at office
    Night shift

    Bland Company

    San Francisco, CA
    1 hour ago
  •  ...Infrastructure Engineer Chalk is building the data platform that powers the future of machine learning...  ...that have traditionally constrained ML capabilities. Our platform combines Rust...  ...optimize arbitrary user Python code, infers and orchestrates infrastructure implied... 
    Work at office
    Flexible hours

    CHALK INC

    San Francisco, CA
    4 days ago
  • $320k - $405k

     ...group of committed researchers, engineers, policy experts, and...  ...attribution story for non-accelerator infrastructure — the network, compute, and...  ...between training clusters, inference fleets, and object storage...  ...customer. * Familiarity with AI/ML infrastructure traffic... 
    Contract work
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  • $150k - $200k

    Senior Infrastructure Engineer Location: On-site, San Francisco, CA (3 days/week in office) Salary: $150k - $200k + equity Industry: AI, Cloud...  .... Your work will directly impact platform reliability, ML inference performance, and the future of enterprise telephony. You’... 
    Work at office
    3 days per week

    Open Select

    San Francisco, CA
    3 days ago
  •  ...an innovative GPU marketplace and AI inference service that promise affordability and...  ...About the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU...  ...and data infrastructure for AI/ML workloads, including object storage, high... 
    Remote work

    Hyperbolic Labs

    San Francisco, CA
    5 days ago
  • $300k

    A rare infrastructure role in a frontier RL research operation. Compensation...  ...‑funded AI company, a small engineering team works with top...  ...enables researchers and applied ML engineers to run, debug, and...  ...rollouts, training orchestration, inference, evaluation, data pipelines,... 
    H1b

    Aionia Group

    San Francisco, CA
    5 days ago
  •  ...Member of Technical Staff for RL Infrastructure in San Francisco. This role...  ...distributed RL training and inference across thousands of GPUs....  ...should have strong software engineering experience, particularly in...  ...be able to work closely with ML researchers. #J-18808-... 

    VMAX LLC

    San Francisco, CA
    3 days ago
  • $165k - $200k

     ...You'll Do As a member of our infrastructure team, you'll be at the heart...  ...—acting as an infrastructure engineer one moment, and a developer,...  ...applications (especially in ML/AI) in AWS and/or GCP. Development...  ...machine learning inference service. Collaborating with... 
    Second job
    Remote work
    Work from home
    Relocation package
    Flexible hours

    Roboflow

    San Francisco, CA
    2 days ago
  • $200k - $260k

    Rebuild Matterhaul's infrastructure and core systems from zero — AWS, Kubernetes...  ...choices that the rest of engineering will build on for years. Founding...  ...Postgres logical replication. AI/ML infrastructure: vector stores, GPU‑backed inference, embedding pipelines, prompt/... 
    Full time
    Work at office
    Local area

    Matterhaul Inc.

    San Francisco, CA
    2 days ago
  •  ...Baseten powers mission-critical inference for the world's most dynamic...  ...AI research, flexible infrastructure, and seamless developer tooling...  ...and help build the platform engineers turn to to ship AI products....  ...k) Exposure to a variety of ML startups, offering unparalleled... 
    Work experience placement
    Work at office
    Flexible hours

    Baseten

    San Francisco, CA
    5 days ago
  • $139.2k - $174k

     ...leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high...  ...networking, along with significant experience in building AI/ML products. The position offers a compensation range of $1... 
    Remote work

    DigitalOcean

    San Francisco, CA
    1 day ago
  •  ...technology firm in San Francisco seeks an SW Engineer to enable production workloads and...  ...Candidates should have strong experience in ML systems, performance engineering, and...  ...opportunity to work with cutting-edge AI infrastructure in a vibrant team environment. #J-18808-... 

    AI Chopping Block, Inc.

    San Francisco, CA
    2 days ago
  • $183.7k - $248.6k

     ...looking for a Senior Machine Learning Infrastructure Engineer to join our Vector Ads team, where we...  ...operate the infrastructure that brings ML models from training into production,...  ...feature serving, model versioning, and inference optimization What we're looking for... 
    Work at office
    Remote work
    Worldwide
    Relocation package

    UNITY

    San Francisco, CA
    1 day ago
  • $250k

     ...Ready to architect AI infrastructure that powers next-generation research...  ...is now building a serverless inference platform, beginning with cost...  ...a Senior Inference Platform Engineer at an early stage and help define...  ...distributed systems (ML inference, HPC, or similar).... 
    Permanent employment
    San Francisco, CA
    more than 2 months ago
  •  ...Cohere is a team of researchers, engineers, designers, and more, who are...  ...next generation of agentic AI infrastructure at Cohere. This team sits at the intersection of ML systems, distributed...  ...emerging ML infrastructure, edge inference, or browser-native models Open... 
    Remote job
    Full time
    Work at office
    Flexible hours

    Cohere

    San Francisco, CA
    2 days ago
  • A cutting-edge AI infrastructure company in San Francisco seeks an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal...  ...will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and... 

    Baseten

    San Francisco, CA
    2 days ago
  •  ...financial crime at an unprecedented scale. As a Senior Software Engineer, ML Infrastructure at TRM Labs, you will collaborate with data scientists,...  ...concurrent models and users. Optimize high-throughput inference. Implement and tune serving systems that maximize token... 
    Worldwide

    TRM Labs

    San Francisco, CA
    1 day ago
  • $200k - $280k

    Engineering San Francisco Full-time $200,000 - $280,000 About the Role Join our ML Infrastructure team to build the systems that train, deploy, and serve our AI models at scale....  ...Design model serving systems for low-latency inference Implement monitoring and observability... 
    Full time
    Work at office

    Lattice, Inc.

    San Francisco, CA
    1 day ago
  • $160k - $250k

     ...Together AI is building the Inference Platform that brings the most...  ...across data centers and model engine pods. Develop auto-scaling...  ...responses. Collaborate with ML researchers to bring new...  ...building the next generation AI infrastructure. Compensation We offer... 
    Full time
    Local area

    Together AI

    San Francisco, CA
    more than 2 months ago
  •  ...Network Engineer (Data Centers) Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting...  ...applied AI research, flexible infrastructure, and seamless developer...  ...~ Exposure to a variety of ML startups, offering unparalleled... 
    Flexible hours

    Baseten

    San Francisco, CA
    3 days ago
  •  ...humans once had to do. Role Overview We're looking for a Senior Infrastructure Engineer to own and evolve the foundational systems that power Casca'...  ...and compliance are first‑class concerns. Familiarity with ML pipeline orchestration. Experience with multi‑tenant SaaS... 

    Casca

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Inference Infrastructure Engineer. Be the first to apply!