ML Inference Infrastructure Engineer

Baseten

A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance monitoring systems for model metrics. This role offers competitive compensation with benefits like 100% medical coverage and generous PTO policies. Join a collaborative team dedicated to advancing AI and machine learning infrastructure. #J-18808-Ljbffr Baseten

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the ML Inference Infrastructure Engineer in San Francisco, CA vacancy

ML Infrastructure Engineer - Model Inference & Scale
A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional...
Suggested
Abridge
San Francisco, CA
5 days ago
Staff Software Engineer, Cloud Inference Safeguards
$405k
...group of committed researchers, engineers, policy experts, and business... ...organization and the Cloud Inference team: taking classifiers,... ...reliably inside a CSP partner’s infrastructure at serving‑path latency and scale... ...mitigation mechanisms for AI/ML systems, or the...
Suggested
Visa sponsorship
United States Digital Space LLC
San Francisco, CA
1 day ago
Staff + Sr. Software Engineer, Cloud Inference
$320k
About the Role The Cloud Inference team scales and optimizes Claude... ...day‑to‑day operations. Our engineers are extremely high leverage:... ...and own backend services and infrastructure that serve Claude across multiple... ...serving; prior inference or ML experience is not required....
Suggested
Visa sponsorship
United States Digital Space LLC
San Francisco, CA
1 day ago
Inference Performance TPM: AI Infrastructure Lead
...organization in San Francisco is seeking a Technical Program Manager for Inference to bridge their systems with the broader organization. This role... ...experience in technical program management, ideally in ML/AI, alongside strong stakeholder management skills. This position...
Suggested
Anthropic
San Francisco, CA
1 day ago
Infrastructure Engineer
...released daily. About the Role We're looking for two Infrastructure Engineers to lead the scaling of our machine learning inference system. You'll be responsible for architecting... ...infrastructure that serves 150+ biological ML models, scaling our platform several orders of...
Suggested
Relocation
Tamarind Bio
San Francisco, CA
3 days ago
Software Engineer (AI Infrastructure / Training / Inference)
Software Engineer (AI Infrastructure / Training / Inference) About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems... ...container orchestration. Familiarity with GPU-based ML workloads or distributed training/inference systems....
SpreeAI
San Francisco, CA
2 days ago
Infrastructure Engineer
$130k - $240k
...Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will... ...and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection...
Flexible hours
Maxana
San Francisco, CA
2 days ago
Infrastructure Engineer (Storage)
$180k - $200k
...Infrastructure Engineer (Storage) Lightning AI is the company behind PyTorch Lightning. Founded... ...experimentation, training, and production inference, with security, observability, and... ...storage systems that power large-scale AI/ML training, inference, and HPC workloads...
Remote work
Work from home
Flexible hours
Lightning AI
San Francisco, CA
3 days ago
Infrastructure Engineer
...Relace is building the models and infrastructure that code agents reach for.... ...As an Infrastructure Engineer at Relace, you'll design and... ...that power our high-performance inference and training infrastructure.... ...systems for deploying and scaling ML workloads globally. - Work...
Work at office
Relace Inc
San Francisco, CA
1 hour ago
Senior Infrastructure Engineer
$120k - $200k
..., and many more. About the Role As a Senior Infrastructure Engineer at Bland, you'll help us to build the backbone that enables... ...systems that handle real-time voice processing, scale ML inference, and integrate with enterprise telephony infrastructure. Your...
Work at office
Night shift
Bland Company
San Francisco, CA
1 hour ago
Infrastructure Engineer
...Infrastructure Engineer Chalk is building the data platform that powers the future of machine learning... ...that have traditionally constrained ML capabilities. Our platform combines Rust... ...optimize arbitrary user Python code, infers and orchestrates infrastructure implied...
Work at office
Flexible hours
CHALK INC
San Francisco, CA
4 days ago
Network Engineer, Capacity and Efficiency
$320k - $405k
...group of committed researchers, engineers, policy experts, and... ...attribution story for non-accelerator infrastructure — the network, compute, and... ...between training clusters, inference fleets, and object storage... ...customer. * Familiarity with AI/ML infrastructure traffic...
Contract work
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
1 day ago
Senior Infrastructure Engineer - AI
$150k - $200k
Senior Infrastructure Engineer Location: On-site, San Francisco, CA (3 days/week in office) Salary: $150k - $200k + equity Industry: AI, Cloud... .... Your work will directly impact platform reliability, ML inference performance, and the future of enterprise telephony. You’...
Work at office
3 days per week
Open Select
San Francisco, CA
3 days ago
Senior GPU Infrastructure Engineer
...an innovative GPU marketplace and AI inference service that promise affordability and... ...About the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU... ...and data infrastructure for AI/ML workloads, including object storage, high...
Remote work
Hyperbolic Labs
San Francisco, CA
5 days ago
RL Infrastructure Engineer — Frontier AI Research
$300k
A rare infrastructure role in a frontier RL research operation. Compensation... ...‑funded AI company, a small engineering team works with top... ...enables researchers and applied ML engineers to run, debug, and... ...rollouts, training orchestration, inference, evaluation, data pipelines,...
H1b
Aionia Group
San Francisco, CA
5 days ago
Hybrid SF RL Infrastructure Engineer GPU-Scale Orchestration
...Member of Technical Staff for RL Infrastructure in San Francisco. This role... ...distributed RL training and inference across thousands of GPUs.... ...should have strong software engineering experience, particularly in... ...be able to work closely with ML researchers. #J-18808-...
VMAX LLC
San Francisco, CA
3 days ago
Infrastructure Engineer
$165k - $200k
...You'll Do As a member of our infrastructure team, you'll be at the heart... ...—acting as an infrastructure engineer one moment, and a developer,... ...applications (especially in ML/AI) in AWS and/or GCP. Development... ...machine learning inference service. Collaborating with...
Second job
Remote work
Work from home
Relocation package
Flexible hours
Roboflow
San Francisco, CA
2 days ago
Founding Infrastructure Engineer
$200k - $260k
Rebuild Matterhaul's infrastructure and core systems from zero — AWS, Kubernetes... ...choices that the rest of engineering will build on for years. Founding... ...Postgres logical replication. AI/ML infrastructure: vector stores, GPU‑backed inference, embedding pipelines, prompt/...
Full time
Work at office
Local area
Matterhaul Inc.
San Francisco, CA
2 days ago
Infrastructure Ops Engineer
...Baseten powers mission-critical inference for the world's most dynamic... ...AI research, flexible infrastructure, and seamless developer tooling... ...and help build the platform engineers turn to to ship AI products.... ...k) Exposure to a variety of ML startups, offering unparalleled...
Work experience placement
Work at office
Flexible hours
Baseten
San Francisco, CA
5 days ago
Senior Engineer, AI Inference Platform
$139.2k - $174k
...leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high... ...networking, along with significant experience in building AI/ML products. The position offers a compensation range of $1...
Remote work
DigitalOcean
San Francisco, CA
1 day ago
Platform Workload Engineer: AI Inference & Benchmarking
...technology firm in San Francisco seeks an SW Engineer to enable production workloads and... ...Candidates should have strong experience in ML systems, performance engineering, and... ...opportunity to work with cutting-edge AI infrastructure in a vibrant team environment. #J-18808-...
AI Chopping Block, Inc.
San Francisco, CA
2 days ago
Senior Machine Learning Infrastructure Engineer
$183.7k - $248.6k
...looking for a Senior Machine Learning Infrastructure Engineer to join our Vector Ads team, where we... ...operate the infrastructure that brings ML models from training into production,... ...feature serving, model versioning, and inference optimization What we're looking for...
Work at office
Remote work
Worldwide
Relocation package
UNITY
San Francisco, CA
1 day ago
Senior Inference Engineer - AI Infrastructure
$250k
...Ready to architect AI infrastructure that powers next-generation research... ...is now building a serverless inference platform, beginning with cost... ...a Senior Inference Platform Engineer at an early stage and help define... ...distributed systems (ML inference, HPC, or similar)....
Permanent employment
San Francisco, CA
more than 2 months ago
Senior Agent Infrastructure Engineer (Remote‑Flexible)
...Cohere is a team of researchers, engineers, designers, and more, who are... ...next generation of agentic AI infrastructure at Cohere. This team sits at the intersection of ML systems, distributed... ...emerging ML infrastructure, edge inference, or browser-native models Open...
Remote job
Full time
Work at office
Flexible hours
Cohere
San Francisco, CA
2 days ago
GPU Networking Engineer - RDMA & Distributed Inference
A cutting-edge AI infrastructure company in San Francisco seeks an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal... ...will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and...
Baseten
San Francisco, CA
2 days ago
Machine Learning Infrastructure Engineer
...financial crime at an unprecedented scale. As a Senior Software Engineer, ML Infrastructure at TRM Labs, you will collaborate with data scientists,... ...concurrent models and users. Optimize high-throughput inference. Implement and tune serving systems that maximize token...
Worldwide
TRM Labs
San Francisco, CA
1 day ago
ML Infrastructure Engineer
$200k - $280k
Engineering San Francisco Full-time $200,000 - $280,000 About the Role Join our ML Infrastructure team to build the systems that train, deploy, and serve our AI models at scale.... ...Design model serving systems for low-latency inference Implement monitoring and observability...
Full time
Work at office
Lattice, Inc.
San Francisco, CA
1 day ago
Senior Backend Engineer, Inference Platform
$160k - $250k
...Together AI is building the Inference Platform that brings the most... ...across data centers and model engine pods. Develop auto-scaling... ...responses. Collaborate with ML researchers to bring new... ...building the next generation AI infrastructure. Compensation We offer...
Full time
Local area
Together AI
San Francisco, CA
more than 2 months ago
Data Center Network Engineer
...Network Engineer (Data Centers) Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting... ...applied AI research, flexible infrastructure, and seamless developer... ...~ Exposure to a variety of ML startups, offering unparalleled...
Flexible hours
Baseten
San Francisco, CA
3 days ago
Senior Infrastructure Engineer
...humans once had to do. Role Overview We're looking for a Senior Infrastructure Engineer to own and evolve the foundational systems that power Casca'... ...and compliance are first‑class concerns. Familiarity with ML pipeline orchestration. Experience with multi‑tenant SaaS...
Casca
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Inference Infrastructure Engineer. Be the first to apply!