ML Infrastructure Engineer - Model Inference & Scale

Abridge

A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional teams. Ideal candidates have strong experience in deploying models in production environments and expertise in Kubernetes. This innovative firm promotes a culture of ownership and offers comprehensive benefits for personal and professional growth. #J-18808-Ljbffr

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the ML Infrastructure Engineer - Model Inference & Scale in San Francisco, CA vacancy

Staff ML Infrastructure Engineer: Scale Training & Inference
$300k - $430k
...team. About the Team The ML Infrastructure team builds the... ...every stage of Decagon's model lifecycle. We own the... ...routing layer that manages inference across multiple... ...Staff ML Infrastructure Engineer to own the platforms powering... ...and post-training at scale Implement and...
Suggested
Work at office
Decagon
San Francisco, CA
11 hours ago
Machine Learning Infrastructure Engineer- Model Inference
...scientists PhDs creatives technologists and engineers working together to empower people... ...Pittsburgh. The Role As an ML Infrastructure Engineer Model Inference at Abridge youll play a pivotal... ...with ML and product teams to scale backend infrastructure for AI-driven...
Suggested
Hourly pay
Full time
Flexible hours
Abridge
San Francisco, CA
10 days ago
Founding ML Infra Engineer: Scale Real-Time Inference
...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling of our GPU compute platform from the... ...ensuring high availability and low-latency inference. This is a founding technical hire...
Suggested
U-Run
San Francisco, CA
11 hours ago
ML Infra Engineer: Scale GPU Training & Inference
...Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,...
Suggested
Reducto
San Francisco, CA
11 hours ago
Staff ML Inference Engineer — Model Efficiency (Remote)
Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across... ...C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and...
Suggested
Remote job
Jaide Health
San Francisco, CA
4 days ago
ML Inference Infrastructure Engineer
A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop... ..., and enhance monitoring systems for model metrics. This role offers competitive compensation...
Baseten
San Francisco, CA
1 day ago
ML Infrastructure Engineer — Scale Training Pipelines
...to take on a hands-on role focused on scaling and optimizing ML training systems. Key responsibilities include owning the training infrastructure, improving performance, and managing... ...candidates will have strong software engineering foundations, hands-on experience in JAX...
Physical Intelligence
San Francisco, CA
4 days ago
ML Infrastructure Engineer Large-Scale AI Systems
...A leading AI research organization in San Francisco seeks an Infrastructure Engineer to design and maintain large distributed ML training and inference clusters. The ideal candidate will have a strong grasp of optimizing training workloads and experience with distributed...
Causal Labs
San Francisco, CA
3 days ago
ML Model Serving Engineer
...variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast, cost-effective... ...and custom kernels to speed up inference. Find ways to reduce model... .... Bottleneck analysis in high-scale server systems or profiling low...
Full time
Contract work
Flexible hours
SESAME
San Francisco, CA
3 days ago
Senior ML Platform Engineer - Remote, Scalable Inference
$230k - $265k
...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role... ...maintain core platform functionalities, enhance real-time inference processes, and collaborate across teams to ensure...
Remote work
Parafin Inc
San Francisco, CA
1 day ago
Remote ML Platform Engineer - Scale AI Infrastructure
...shopping platform is looking for an AI/ML Platform Engineer to shape the future of AI and ML... ...systems. This role involves designing the infrastructure that powers machine learning... ...working alongside experts to deploy models at scale. Candidates should have extensive experience...
Remote work
Flexible hours
Whatnot
San Francisco, CA
1 day ago
Senior ML Infra Engineer Scale ML Platforms & Data
...neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential for developing innovative...
Echo Neurotechnologies
San Francisco, CA
11 hours ago
ML Infrastructure Engineer
$250k - $350k
...Most AI roles build on top of models. This one builds what makes... ...work. We’re hiring ML Infrastructure Engineers to tackle a hard, real-world... ...using wearable devices, large-scale video, and AI. This isn’t clean... ...hours of data Training and inference systems for multimodal /...
Trades Workforce Solutions
San Francisco, CA
11 hours ago
ML Infrastructure Engineer
$200k - $280k
...Engineering San Francisco Full-time $200,000 - $280,000 About the Role Join our ML Infrastructure team to build the systems that train, deploy, and serve our AI models at scale. You'll work at the intersection of machine... ...for low-latency inference Implement monitoring...
Full time
Work at office
Lattice
San Francisco, CA
11 hours ago
Cloud-Scale AI Inference Architect
...workloads effectively. The role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with... ...teams. Ideal candidates will have 3+ years in cloud infrastructure or DevOps, strong skills in Kubernetes, Docker,...
Flexible hours
FriendliAI
San Francisco, CA
11 hours ago
ML Infrastructure Engineer
...Accelerated AI Server Engineer Sygaldry... ...speed up training and inference for AI. By... ...combination of cost, scale, and speed necessary... ...They need compute infrastructure that stays out of... ...numerical optimization, model training, tensor... ...) Python-based ML and scientific...
Casual work
Local area
Visa sponsorship
Sygaldry
San Francisco, CA
3 days ago
ML Infrastructure Engineer, Safeguards
$320k - $405k
...committed researchers, engineers, policy experts,... ...Machine Learning Infrastructure Engineer to join... ...you'll build and scale the critical infrastructure... ...and implement ML infrastructure... ...values, ensuring our models operate safely as... ...Optimize inference latency and throughput...
Work at office
Visa sponsorship
Flexible hours
Anthropic
San Francisco, CA
2 days ago
Founding ML infrastructure Engineer
...problem we saw Most AI infrastructure is built for batch:... ...that hold state, models that stay alive... ...to deliver that at scale doesn't really exist... ...fix it uRun is the inference cloud for interactive... .... As our ML Infrastructure and Platform Engineer, you will own the architecture...
Flexible hours
Shift work
U-Run
San Francisco, CA
12 hours ago
ML Infra Engineer: Scale Training & Inference (Hybrid)
...A leading technology company is looking for an ML Infrastructure Engineer in San Francisco. The successful candidate will build and maintain ML training pipelines and ensure low-latency model serving. Candidates should have over 4 years of experience in ML engineering...
Work at office
Lattice
San Francisco, CA
11 hours ago
Data/ML Infrastructure Engineer
...We are seeking a Data Infrastructure Engineer to build and operate the... ...production datasets, models, and customer-facing... ...complexity, and product usage scale. What You'll Do... ...scalable data and ML infrastructure on AWS,... ...training, evaluation, batch inference, or model deployment...
Permanent employment
Full time
Matter Intelligence
San Francisco, CA
2 days ago
Senior GPU ML Infra Engineer — Mid-Training & Inference
...based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience...
Reflection AI
San Francisco, CA
1 day ago
Staff, ML Infrastructure Engineer
$227.2k - $324.5k
...the Role: This Software Engineering team works closely... ...The team’s efforts take inference systems to the next level... ...latency. Work with ML engineers to... ...online microservices at scale with low‑latency serving... ...the machine‑learning infrastructure. Previous experience...
Full time
Flexible hours
Tubi Tv
San Francisco, CA
10 hours ago
ML Infrastructure Engineer
$180k - $250k
...building the pre-model intelligence layer... ...developing the context engine layer that solves... ...squared scaling inherent to attention... ...come from better infrastructure around models: Better... ...PhD in Robotics and ML. Clark Zhang, CTO... ...pipelines, inference/serving systems, data...
Full time
Graphon.AI
San Francisco, CA
1 day ago
Engineering Manager, Model Inference
...medicine—and the inference systems that power... ...re looking for an Engineering Manager to lead and grow our Model Inference team. The... ..., high-throughput infrastructure to pushing the frontier... ...closely with ML Research and the broader... ...on building and scaling infrastructure for...
Hourly pay
Full time
Flexible hours
AI Chopping Block, Inc.
San Francisco, CA
1 day ago
Engineering Manager, Model Routing & Inference Engineering San Francisco Apply
...combination of inventive research, design, and engineering. Our organization is very flat, and... .... About the Role You will lead the Model Routing & Inference team at Cursor, owning the inference... ..., and more cost‑effective at a scale few teams in the world get to operate...
Anysphere
San Francisco, CA
11 hours ago
Real-Time Inference & Model Serving Engineer (Equity)
$220k - $320k
...ML Model Serving Engineer Want to build the layer that actually makes AI usable... ...ll join a team focused on inference, where performance is the... ...instantly, reliably, and at scale. That means solving hard... ...working across model serving, infrastructure, and performance...
3 days per week
Trades Workforce Solutions
San Francisco, CA
12 hours ago
Senior Model Inference Engineer for Production-Scale AI
$325k
...leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal candidate... ...engineering experience, strong familiarity with ML architectures, and experience with distributed systems....
OpenAI
San Francisco, CA
11 hours ago
ML Runtime & Vision Inference Engineer (Edge/Cloud)
...Zensors is seeking a Machine Learning Engineer specializing in ML Runtime & Optimization to enhance our visual sensing platform. This role involves developing technologies that improve computer vision models critical for smart spaces and cities. Your responsibilities include...
Zensors
San Francisco, CA
10 hours ago
ML Infrastructure Engineer Scale ML Pipelines & Cloud (Equity)
...Delphina-Hotels- is looking for an experienced ML Infrastructure Engineer to join their Technical Staff in San Francisco. In this pivotal role,... ...include developing platforms for ML jobs, establishing CI/CD models, and leading cross-functional initiatives. Candidates should...
Delphina-Hotels-
San Francisco, CA
12 hours ago
Machine Learning Engineer - Speech Model Training
$250k - $300k
...Machine Learning Engineer - Speech Model Training $250,000 -... ...through to production inference on edge devices. At a... ...Design and train large-scale speech models end-to-... ...in distributed infrastructure and ship solutions... ...traversing the entire ML stack from signal processing...
Permanent employment
Full time
Work at office
Immediate start
Worldwide
DeepRec.ai
San Francisco, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Infrastructure Engineer - Model Inference & Scale. Be the first to apply!