ML Infrastructure Engineer - Model Inference & Scale
Abridge
A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional teams. Ideal candidates have strong experience in deploying models in production environments and expertise in Kubernetes. This innovative firm promotes a culture of ownership and offers comprehensive benefits for personal and professional growth. #J-18808-Ljbffr
$300k - $430k
...team. About the Team The ML Infrastructure team builds the... ...every stage of Decagon's model lifecycle. We own the... ...routing layer that manages inference across multiple... ...Staff ML Infrastructure Engineer to own the platforms powering... ...and post-training at scale Implement and...SuggestedWork at office- ...scientists PhDs creatives technologists and engineers working together to empower people... ...Pittsburgh. The Role As an ML Infrastructure Engineer Model Inference at Abridge youll play a pivotal... ...with ML and product teams to scale backend infrastructure for AI-driven...SuggestedHourly payFull timeFlexible hours
- ...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling of our GPU compute platform from the... ...ensuring high availability and low-latency inference. This is a founding technical hire...Suggested
- ...Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,...Suggested
- Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across... ...C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and...SuggestedRemote job
- A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop... ..., and enhance monitoring systems for model metrics. This role offers competitive compensation...
- ...to take on a hands-on role focused on scaling and optimizing ML training systems. Key responsibilities include owning the training infrastructure, improving performance, and managing... ...candidates will have strong software engineering foundations, hands-on experience in JAX...
- ...A leading AI research organization in San Francisco seeks an Infrastructure Engineer to design and maintain large distributed ML training and inference clusters. The ideal candidate will have a strong grasp of optimizing training workloads and experience with distributed...
- ...variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast, cost-effective... ...and custom kernels to speed up inference. Find ways to reduce model... .... Bottleneck analysis in high-scale server systems or profiling low...Full timeContract workFlexible hours
$230k - $265k
...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role... ...maintain core platform functionalities, enhance real-time inference processes, and collaborate across teams to ensure...Remote work- ...shopping platform is looking for an AI/ML Platform Engineer to shape the future of AI and ML... ...systems. This role involves designing the infrastructure that powers machine learning... ...working alongside experts to deploy models at scale. Candidates should have extensive experience...Remote workFlexible hours
- ...neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential for developing innovative...
$250k - $350k
...Most AI roles build on top of models. This one builds what makes... ...work. We’re hiring ML Infrastructure Engineers to tackle a hard, real-world... ...using wearable devices, large-scale video, and AI. This isn’t clean... ...hours of data Training and inference systems for multimodal /...$200k - $280k
...Engineering San Francisco Full-time $200,000 - $280,000 About the Role Join our ML Infrastructure team to build the systems that train, deploy, and serve our AI models at scale. You'll work at the intersection of machine... ...for low-latency inference Implement monitoring...Full timeWork at office- ...workloads effectively. The role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with... ...teams. Ideal candidates will have 3+ years in cloud infrastructure or DevOps, strong skills in Kubernetes, Docker,...Flexible hours
- ...Accelerated AI Server Engineer Sygaldry... ...speed up training and inference for AI. By... ...combination of cost, scale, and speed necessary... ...They need compute infrastructure that stays out of... ...numerical optimization, model training, tensor... ...) Python-based ML and scientific...Casual workLocal areaVisa sponsorship
$320k - $405k
...committed researchers, engineers, policy experts,... ...Machine Learning Infrastructure Engineer to join... ...you'll build and scale the critical infrastructure... ...and implement ML infrastructure... ...values, ensuring our models operate safely as... ...Optimize inference latency and throughput...Work at officeVisa sponsorshipFlexible hours- ...problem we saw Most AI infrastructure is built for batch:... ...that hold state, models that stay alive... ...to deliver that at scale doesn't really exist... ...fix it uRun is the inference cloud for interactive... .... As our ML Infrastructure and Platform Engineer, you will own the architecture...Flexible hoursShift work
- ...A leading technology company is looking for an ML Infrastructure Engineer in San Francisco. The successful candidate will build and maintain ML training pipelines and ensure low-latency model serving. Candidates should have over 4 years of experience in ML engineering...Work at office
- ...We are seeking a Data Infrastructure Engineer to build and operate the... ...production datasets, models, and customer-facing... ...complexity, and product usage scale. What You'll Do... ...scalable data and ML infrastructure on AWS,... ...training, evaluation, batch inference, or model deployment...Permanent employmentFull time
- ...based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience...
$227.2k - $324.5k
...the Role: This Software Engineering team works closely... ...The team’s efforts take inference systems to the next level... ...latency. Work with ML engineers to... ...online microservices at scale with low‑latency serving... ...the machine‑learning infrastructure. Previous experience...Full timeFlexible hours$180k - $250k
...building the pre-model intelligence layer... ...developing the context engine layer that solves... ...squared scaling inherent to attention... ...come from better infrastructure around models: Better... ...PhD in Robotics and ML. Clark Zhang, CTO... ...pipelines, inference/serving systems, data...Full time- ...medicine—and the inference systems that power... ...re looking for an Engineering Manager to lead and grow our Model Inference team. The... ..., high-throughput infrastructure to pushing the frontier... ...closely with ML Research and the broader... ...on building and scaling infrastructure for...Hourly payFull timeFlexible hours
- ...combination of inventive research, design, and engineering. Our organization is very flat, and... .... About the Role You will lead the Model Routing & Inference team at Cursor, owning the inference... ..., and more cost‑effective at a scale few teams in the world get to operate...
$220k - $320k
...ML Model Serving Engineer Want to build the layer that actually makes AI usable... ...ll join a team focused on inference, where performance is the... ...instantly, reliably, and at scale. That means solving hard... ...working across model serving, infrastructure, and performance...3 days per week$325k
...leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal candidate... ...engineering experience, strong familiarity with ML architectures, and experience with distributed systems....- ...Zensors is seeking a Machine Learning Engineer specializing in ML Runtime & Optimization to enhance our visual sensing platform. This role involves developing technologies that improve computer vision models critical for smart spaces and cities. Your responsibilities include...
- ...Delphina-Hotels- is looking for an experienced ML Infrastructure Engineer to join their Technical Staff in San Francisco. In this pivotal role,... ...include developing platforms for ML jobs, establishing CI/CD models, and leading cross-functional initiatives. Candidates should...
$250k - $300k
...Machine Learning Engineer - Speech Model Training $250,000 -... ...through to production inference on edge devices. At a... ...Design and train large-scale speech models end-to-... ...in distributed infrastructure and ship solutions... ...traversing the entire ML stack from signal processing...Permanent employmentFull timeWork at officeImmediate startWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to ML Infrastructure Engineer - Model Inference & Scale. Be the first to apply!
- machine learning software engineer San Francisco, CA
- ai ml engineer San Francisco, CA
- graduate machine learning engineer San Francisco, CA
- computer vision machine learning engineer San Francisco, CA
- machine learning engineer San Francisco, CA
- senior ml engineer San Francisco, CA
- junior machine learning research engineer San Francisco, CA
- machine learning ai engineer San Francisco, CA
- data scientist machine learning engineer San Francisco, CA
- security infrastructure engineer San Francisco, CA

