ML Inference Infrastructure Engineer
Baseten
A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance monitoring systems for model metrics. This role offers competitive compensation with benefits like 100% medical coverage and generous PTO policies. Join a collaborative team dedicated to advancing AI and machine learning infrastructure. #J-18808-Ljbffr Baseten
- A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional...Suggested
$405k
...group of committed researchers, engineers, policy experts, and business... ...organization and the Cloud Inference team: taking classifiers,... ...reliably inside a CSP partner’s infrastructure at serving‑path latency and scale... ...mitigation mechanisms for AI/ML systems, or the...SuggestedVisa sponsorship$320k
About the Role The Cloud Inference team scales and optimizes Claude... ...day‑to‑day operations. Our engineers are extremely high leverage:... ...and own backend services and infrastructure that serve Claude across multiple... ...serving; prior inference or ML experience is not required....SuggestedVisa sponsorship- ...organization in San Francisco is seeking a Technical Program Manager for Inference to bridge their systems with the broader organization. This role... ...experience in technical program management, ideally in ML/AI, alongside strong stakeholder management skills. This position...Suggested
- ...released daily. About the Role We're looking for two Infrastructure Engineers to lead the scaling of our machine learning inference system. You'll be responsible for architecting... ...infrastructure that serves 150+ biological ML models, scaling our platform several orders of...SuggestedRelocation
- Software Engineer (AI Infrastructure / Training / Inference) About the Role We are hiring Software Engineers focused on AI Infrastructure to build the systems... ...container orchestration. Familiarity with GPU-based ML workloads or distributed training/inference systems....
$130k - $240k
...Maxana is seeking an experienced Infrastructure Engineer for a confidential client — a fast-growing AI company. In this role you will... ...and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection...Flexible hours$180k - $200k
...Infrastructure Engineer (Storage) Lightning AI is the company behind PyTorch Lightning. Founded... ...experimentation, training, and production inference, with security, observability, and... ...storage systems that power large-scale AI/ML training, inference, and HPC workloads...Remote workWork from homeFlexible hours- ...Relace is building the models and infrastructure that code agents reach for.... ...As an Infrastructure Engineer at Relace, you'll design and... ...that power our high-performance inference and training infrastructure.... ...systems for deploying and scaling ML workloads globally. - Work...Work at office
$120k - $200k
..., and many more. About the Role As a Senior Infrastructure Engineer at Bland, you'll help us to build the backbone that enables... ...systems that handle real-time voice processing, scale ML inference, and integrate with enterprise telephony infrastructure. Your...Work at officeNight shift- ...Infrastructure Engineer Chalk is building the data platform that powers the future of machine learning... ...that have traditionally constrained ML capabilities. Our platform combines Rust... ...optimize arbitrary user Python code, infers and orchestrates infrastructure implied...Work at officeFlexible hours
$320k - $405k
...group of committed researchers, engineers, policy experts, and... ...attribution story for non-accelerator infrastructure — the network, compute, and... ...between training clusters, inference fleets, and object storage... ...customer. * Familiarity with AI/ML infrastructure traffic...Contract workWork at officeVisa sponsorshipFlexible hours$150k - $200k
Senior Infrastructure Engineer Location: On-site, San Francisco, CA (3 days/week in office) Salary: $150k - $200k + equity Industry: AI, Cloud... .... Your work will directly impact platform reliability, ML inference performance, and the future of enterprise telephony. You’...Work at office3 days per week- ...an innovative GPU marketplace and AI inference service that promise affordability and... ...About the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU... ...and data infrastructure for AI/ML workloads, including object storage, high...Remote work
$300k
A rare infrastructure role in a frontier RL research operation. Compensation... ...‑funded AI company, a small engineering team works with top... ...enables researchers and applied ML engineers to run, debug, and... ...rollouts, training orchestration, inference, evaluation, data pipelines,...H1b- ...Member of Technical Staff for RL Infrastructure in San Francisco. This role... ...distributed RL training and inference across thousands of GPUs.... ...should have strong software engineering experience, particularly in... ...be able to work closely with ML researchers. #J-18808-...
$165k - $200k
...You'll Do As a member of our infrastructure team, you'll be at the heart... ...—acting as an infrastructure engineer one moment, and a developer,... ...applications (especially in ML/AI) in AWS and/or GCP. Development... ...machine learning inference service. Collaborating with...Second jobRemote workWork from homeRelocation packageFlexible hours$200k - $260k
Rebuild Matterhaul's infrastructure and core systems from zero — AWS, Kubernetes... ...choices that the rest of engineering will build on for years. Founding... ...Postgres logical replication. AI/ML infrastructure: vector stores, GPU‑backed inference, embedding pipelines, prompt/...Full timeWork at officeLocal area- ...Baseten powers mission-critical inference for the world's most dynamic... ...AI research, flexible infrastructure, and seamless developer tooling... ...and help build the platform engineers turn to to ship AI products.... ...k) Exposure to a variety of ML startups, offering unparalleled...Work experience placementWork at officeFlexible hours
$139.2k - $174k
...leading cloud services provider is looking for a Senior Engineer 2 to join their AI Infrastructure Control Plane team. This role involves architecting high... ...networking, along with significant experience in building AI/ML products. The position offers a compensation range of $1...Remote work- ...technology firm in San Francisco seeks an SW Engineer to enable production workloads and... ...Candidates should have strong experience in ML systems, performance engineering, and... ...opportunity to work with cutting-edge AI infrastructure in a vibrant team environment. #J-18808-...
$183.7k - $248.6k
...looking for a Senior Machine Learning Infrastructure Engineer to join our Vector Ads team, where we... ...operate the infrastructure that brings ML models from training into production,... ...feature serving, model versioning, and inference optimization What we're looking for...Work at officeRemote workWorldwideRelocation package$250k
...Ready to architect AI infrastructure that powers next-generation research... ...is now building a serverless inference platform, beginning with cost... ...a Senior Inference Platform Engineer at an early stage and help define... ...distributed systems (ML inference, HPC, or similar)....Permanent employment- ...Cohere is a team of researchers, engineers, designers, and more, who are... ...next generation of agentic AI infrastructure at Cohere. This team sits at the intersection of ML systems, distributed... ...emerging ML infrastructure, edge inference, or browser-native models Open...Remote jobFull timeWork at officeFlexible hours
- A cutting-edge AI infrastructure company in San Francisco seeks an experienced network engineer to optimize high-performance networking protocols for AI models. The ideal... ...will integrate RDMA and InfiniBand into the inference stack, ensuring efficient communication and...
- ...financial crime at an unprecedented scale. As a Senior Software Engineer, ML Infrastructure at TRM Labs, you will collaborate with data scientists,... ...concurrent models and users. Optimize high-throughput inference. Implement and tune serving systems that maximize token...Worldwide
$200k - $280k
Engineering San Francisco Full-time $200,000 - $280,000 About the Role Join our ML Infrastructure team to build the systems that train, deploy, and serve our AI models at scale.... ...Design model serving systems for low-latency inference Implement monitoring and observability...Full timeWork at office$160k - $250k
...Together AI is building the Inference Platform that brings the most... ...across data centers and model engine pods. Develop auto-scaling... ...responses. Collaborate with ML researchers to bring new... ...building the next generation AI infrastructure. Compensation We offer...Full timeLocal area- ...Network Engineer (Data Centers) Baseten powers mission-critical inference for the world's most dynamic AI companies. By uniting... ...applied AI research, flexible infrastructure, and seamless developer... ...~ Exposure to a variety of ML startups, offering unparalleled...Flexible hours
- ...humans once had to do. Role Overview We're looking for a Senior Infrastructure Engineer to own and evolve the foundational systems that power Casca'... ...and compliance are first‑class concerns. Familiarity with ML pipeline orchestration. Experience with multi‑tenant SaaS...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Inference Infrastructure Engineer. Be the first to apply!
- graduate machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- machine learning software engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- security infrastructure engineer San Francisco, CA

