ML Inference Infrastructure Engineer
Baseten
A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance monitoring systems for model metrics. This role offers competitive compensation with benefits like 100% medical coverage and generous PTO policies. Join a collaborative team dedicated to advancing AI and machine learning infrastructure. #J-18808-Ljbffr Baseten
- A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional...Suggested
$320k
...Staff + Sr. Software Engineer, Cloud Inference San Francisco, CA About Anthropic Anthropic... ...build, and own backend services and infrastructure that serve Claude across multiple CSPs... ...about LLM serving; prior inference or ML experience is not required Thrive...SuggestedWork at officeVisa sponsorshipFlexible hours$405k
...group of committed researchers, engineers, policy experts, and business... ...organization and the Cloud Inference team: taking classifiers,... ...reliably inside a CSP partner's infrastructure at serving-path latency and scale... ...mitigation mechanisms for AI/ML systems, or the...SuggestedWork at officeVisa sponsorshipFlexible hours$160k - $250k
...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is... ...speed up responses. Collaborate with ML researchers to bring new model... ...journey in building the next generation AI infrastructure. Compensation We offer competitive...SuggestedFull timeLocal area- ...organization in San Francisco is seeking a Technical Program Manager for Inference to bridge their systems with the broader organization. This role... ...experience in technical program management, ideally in ML/AI, alongside strong stakeholder management skills. This position...Suggested
$230k - $265k
Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role requires... ...core platform functionalities, enhance real-time inference processes, and collaborate across teams to ensure...Remote job- ...compute into useful intelligence - the inference services that serve LLMs at scale and... ...you honest about both. Researchers and ML engineers will hand you workloads that barely... ...Experience operating Kubernetes-based infrastructure, including custom operators or schedulers...Flexible hours
- ...the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems that enable... ...including GPU orchestration, large-scale inference systems, performance optimization,... ...orchestration. Familiarity with GPU-based ML workloads or distributed training/...InternshipImmediate start
$170k - $216k
...15+ U.S. states. The Simulation Infrastructure team creates reliable, scalable, and cost... ...a broad range of customers Software Engineers, Product, Data Science, System Engineering... ...You will: Build and evolve ML inference infrastructure for simulations. Be...Full timeRemote work- ...practicing MDs AI scientists PhDs creatives technologists and engineers working together to empower people and make care make... ...and East Liberty in Pittsburgh. The Role As an ML Infrastructure Engineer Model Inference at Abridge youll play a pivotal role in building and...Hourly payFull timeFlexible hours
- ...an innovative GPU marketplace and AI inference service that promise affordability and... ...the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU... ...storage and data infrastructure for AI/ML workloads, including object storage,...Remote work
- ...Chalk Infrastructure Engineer Chalk is building the data platform that powers the future of machine... ...barriers that have traditionally constrained ML capabilities. Our platform combines... ...optimize arbitrary user Python code, infers and orchestrates infrastructure implied...Work at officeFlexible hours
$180k - $200k
...Infrastructure Engineer (Storage) New York, New York, United States; Remote; San Francisco, California... ..., training, and production inference, with security, observability, and control... ...storage systems that power large-scale AI/ML training, inference, and HPC workloads...Remote workWork from homeFlexible hours$130k - $240k
...Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will... ...and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection...Flexible hours$120k - $200k
...Senior Infrastructure Engineer At Bland.com, our goal is to empower enterprises to make AI-phone agents at scale. Based out of San Francisco... ...systems that handle real-time voice processing, scale ML inference, and integrate with enterprise telephony infrastructure. Your...Work at officeNight shift- ...Senior HPC & GPU Infrastructure Engineer Sciforium is an AI infrastructure company developing next... ...driver bring-up to maintaining the ML software stack (CUDA/ROCm, PyTorch, JAX... ...vLLM, model serving optimizations, or inference systems. Hands-on experience with...Flexible hours
- ...Baseten powers mission-critical inference for the world's most dynamic... ...AI research, flexible infrastructure, and seamless developer tooling... ...and help build the platform engineers turn to to ship AI products.... ...~ Exposure to a variety of ML startups, offering unparalleled...Work experience placementWork at officeFlexible hours
$139.2k - $174k
...leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high... ...networking, along with significant experience in building AI/ML products. The position offers a compensation range of $1...Remote work$150k - $200k
Senior Infrastructure Engineer Location: On-site, San Francisco, CA (3 days/week in office) Salary: $150k - $200k + equity Industry: AI, Cloud... .... Your work will directly impact platform reliability, ML inference performance, and the future of enterprise telephony. You’...Work at office3 days per week- ...released daily. About the Role We're looking for two Infrastructure Engineers to lead the scaling of our machine learning inference system. You'll be responsible for architecting... ...infrastructure that serves 150+ biological ML models, scaling our platform several orders of...Relocation
- ...Member of Technical Staff for RL Infrastructure in San Francisco. This role... ...distributed RL training and inference across thousands of GPUs.... ...should have strong software engineering experience, particularly in... ...be able to work closely with ML researchers. #J-18808-...
$165k - $200k
...You'll Do As a member of our infrastructure team, you'll be at the heart... ...—acting as an infrastructure engineer one moment, and a developer,... ...applications (especially in ML/AI) in AWS and/or GCP. Development... ...machine learning inference service. Collaborating with...Second jobRemote workWork from homeRelocation packageFlexible hours$250k
...Ready to architect AI infrastructure that powers next-generation research... ...is now building a serverless inference platform, beginning with cost... ...a Senior Inference Platform Engineer at an early stage and help define... ...distributed systems (ML inference, HPC, or similar)....Permanent employment- ...Cohere is a team of researchers, engineers, designers, and more, who are... ...next generation of agentic AI infrastructure at Cohere. This team sits at the intersection of ML systems, distributed... ...emerging ML infrastructure, edge inference, or browser-native modelsOpen-...Full timeWork at officeRemote workFlexible hours
$320k - $405k
...group of committed researchers, engineers, policy experts, and... ...attribution story for non-accelerator infrastructure — the network, compute, and... ...between training clusters, inference fleets, and object storage... ...customer. * Familiarity with AI/ML infrastructure traffic...Contract workWork at officeVisa sponsorshipFlexible hours$200k
...Infrastructure Engineer, Security San Francisco Thinking Machines Lab's mission is to empower... ...meet all of these: Experience with ML infrastructure, GPU clusters, or large... ...their integrations into training and inference pipelines. Logistics...Local areaImmediate startVisa sponsorshipWork visaRelocation package- ...technology firm in San Francisco seeks an SW Engineer to enable production workloads and... ...Candidates should have strong experience in ML systems, performance engineering, and... ...opportunity to work with cutting-edge AI infrastructure in a vibrant team environment. #J-18808-...
- Zensors is seeking a Machine Learning Engineer specializing in ML Runtime & Optimization to enhance our visual sensing platform. This role involves developing technologies that improve computer vision models critical for smart spaces and cities. Your responsibilities include...
- ...Workshop Labs Job Posting Build the infrastructure to serve personal AI models privately and... ...tech ever seeing your data. Our core ML systems challenge: how do we serve the world... ...architecture with the finetuning & inference code You Have • A deep understanding...Remote workShift work
- A cutting-edge AI infrastructure company in San Francisco seeks an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal... ...will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Inference Infrastructure Engineer. Be the first to apply!
- entry level machine learning engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- machine learning engineer San Francisco, CA

