Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

ML Infrastructure Engineer - Model Inference & Scale

Abridge

A healthcare technology firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable Kubernetes clusters, enhance ML model serving infrastructure, and collaborate with cross-functional teams. Ideal candidates have strong experience in deploying models in production environments and expertise in Kubernetes. This innovative firm promotes a culture of ownership and offers comprehensive benefits for personal and professional growth. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the ML Infrastructure Engineer - Model Inference & Scale in San Francisco, CA vacancy
  • $300k - $430k

     ...team. About the Team The ML Infrastructure team builds the...  ...every stage of Decagon's model lifecycle. We own the...  ...routing layer that manages inference across multiple...  ...Staff ML Infrastructure Engineer to own the platforms powering...  ...and post-training at scale Implement and... 
    Suggested
    Work at office

    Decagon

    San Francisco, CA
    11 hours ago
  •  ...scientists PhDs creatives technologists and engineers working together to empower people...  ...Pittsburgh. The Role As an ML Infrastructure Engineer Model Inference at Abridge youll play a pivotal...  ...with ML and product teams to scale backend infrastructure for AI-driven... 
    Suggested
    Hourly pay
    Full time
    Flexible hours

    Abridge

    San Francisco, CA
    10 days ago
  •  ...URun in San Francisco is searching for an ML Infrastructure and Platform Engineer. In this role, you will lead the architecture and scaling of our GPU compute platform from the...  ...ensuring high availability and low-latency inference. This is a founding technical hire... 
    Suggested

    U-Run

    San Francisco, CA
    11 hours ago
  •  ...Reducto, a fast-growing AI company in San Francisco, is hiring a Machine Learning Infra Engineer. This role involves building and maintaining the training and inference frameworks necessary for optimal performance. Ideal candidates should possess strong Python skills,... 
    Suggested

    Reducto

    San Francisco, CA
    11 hours ago
  • Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across...  ...C++ or Python and insights into the LLM inference ecosystem. A commitment to diversity and... 
    Suggested
    Remote job

    Jaide Health

    San Francisco, CA
    4 days ago
  • A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop...  ..., and enhance monitoring systems for model metrics. This role offers competitive compensation... 

    Baseten

    San Francisco, CA
    1 day ago
  •  ...to take on a hands-on role focused on scaling and optimizing ML training systems. Key responsibilities include owning the training infrastructure, improving performance, and managing...  ...candidates will have strong software engineering foundations, hands-on experience in JAX... 

    Physical Intelligence

    San Francisco, CA
    4 days ago
  •  ...A leading AI research organization in San Francisco seeks an Infrastructure Engineer to design and maintain large distributed ML training and inference clusters. The ideal candidate will have a strong grasp of optimizing training workloads and experience with distributed... 

    Causal Labs

    San Francisco, CA
    3 days ago
  •  ...variety of LLM, speech, and vision models. Partner with ML infrastructure and training engineers to build a fast, cost-effective...  ...and custom kernels to speed up inference. Find ways to reduce model...  .... Bottleneck analysis in high-scale server systems or profiling low... 
    Full time
    Contract work
    Flexible hours

    SESAME

    San Francisco, CA
    3 days ago
  • $230k - $265k

     ...Parafin is seeking a Software Engineer to lead the evolution of their ML Platform, ensuring robust and scalable systems for data scientists. The role...  ...maintain core platform functionalities, enhance real-time inference processes, and collaborate across teams to ensure... 
    Remote work

    Parafin Inc

    San Francisco, CA
    1 day ago
  •  ...shopping platform is looking for an AI/ML Platform Engineer to shape the future of AI and ML...  ...systems. This role involves designing the infrastructure that powers machine learning...  ...working alongside experts to deploy models at scale. Candidates should have extensive experience... 
    Remote work
    Flexible hours

    Whatnot

    San Francisco, CA
    1 day ago
  •  ...neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential for developing innovative... 

    Echo Neurotechnologies

    San Francisco, CA
    11 hours ago
  • $250k - $350k

     ...Most AI roles build on top of models. This one builds what makes...  ...work. We’re hiring ML Infrastructure Engineers to tackle a hard, real-world...  ...using wearable devices, large-scale video, and AI. This isn’t clean...  ...hours of data Training and inference systems for multimodal /... 

    Trades Workforce Solutions

    San Francisco, CA
    11 hours ago
  • $200k - $280k

     ...Engineering San Francisco Full-time $200,000 - $280,000 About the Role Join our ML Infrastructure team to build the systems that train, deploy, and serve our AI models at scale. You'll work at the intersection of machine...  ...for low-latency inference Implement monitoring... 
    Full time
    Work at office

    Lattice

    San Francisco, CA
    11 hours ago
  •  ...workloads effectively. The role involves designing large-scale deployment architectures, solving AI inference challenges, and collaborating closely with...  ...teams. Ideal candidates will have 3+ years in cloud infrastructure or DevOps, strong skills in Kubernetes, Docker,... 
    Flexible hours

    FriendliAI

    San Francisco, CA
    11 hours ago
  •  ...Accelerated AI Server Engineer Sygaldry...  ...speed up training and inference for AI. By...  ...combination of cost, scale, and speed necessary...  ...They need compute infrastructure that stays out of...  ...numerical optimization, model training, tensor...  ...) Python-based ML and scientific... 
    Casual work
    Local area
    Visa sponsorship

    Sygaldry

    San Francisco, CA
    3 days ago
  • $320k - $405k

     ...committed researchers, engineers, policy experts,...  ...Machine Learning Infrastructure Engineer to join...  ...you'll build and scale the critical infrastructure...  ...and implement ML infrastructure...  ...values, ensuring our models operate safely as...  ...Optimize inference latency and throughput... 
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    2 days ago
  •  ...problem we saw Most AI infrastructure is built for batch:...  ...that hold state, models that stay alive...  ...to deliver that at scale doesn't really exist...  ...fix it uRun is the inference cloud for interactive...  .... As our ML Infrastructure and Platform Engineer, you will own the architecture... 
    Flexible hours
    Shift work

    U-Run

    San Francisco, CA
    12 hours ago
  •  ...A leading technology company is looking for an ML Infrastructure Engineer in San Francisco. The successful candidate will build and maintain ML training pipelines and ensure low-latency model serving. Candidates should have over 4 years of experience in ML engineering... 
    Work at office

    Lattice

    San Francisco, CA
    11 hours ago
  •  ...We are seeking a Data Infrastructure Engineer to build and operate the...  ...production datasets, models, and customer-facing...  ...complexity, and product usage scale. What You'll Do...  ...scalable data and ML infrastructure on AWS,...  ...training, evaluation, batch inference, or model deployment... 
    Permanent employment
    Full time

    Matter Intelligence

    San Francisco, CA
    2 days ago
  •  ...based in San Francisco is seeking a specialist to design and operate large-scale GPU infrastructure. This role requires expertise in deploying GPU systems for high-throughput inference and model performance optimization. The ideal candidate will have hands-on experience... 

    Reflection AI

    San Francisco, CA
    1 day ago
  • $227.2k - $324.5k

     ...the Role: This Software Engineering team works closely...  ...The team’s efforts take inference systems to the next level...  ...latency. Work with ML engineers to...  ...online microservices at scale with low‑latency serving...  ...the machine‑learning infrastructure. Previous experience... 
    Full time
    Flexible hours

    Tubi Tv

    San Francisco, CA
    10 hours ago
  • $180k - $250k

     ...building the pre-model intelligence layer...  ...developing the context engine layer that solves...  ...squared scaling inherent to attention...  ...come from better infrastructure around models: Better...  ...PhD in Robotics and ML. Clark Zhang, CTO...  ...pipelines, inference/serving systems, data... 
    Full time

    Graphon.AI

    San Francisco, CA
    1 day ago
  •  ...medicine—and the inference systems that power...  ...re looking for an Engineering Manager to lead and grow our Model Inference team. The...  ..., high-throughput infrastructure to pushing the frontier...  ...closely with ML Research and the broader...  ...on building and scaling infrastructure for... 
    Hourly pay
    Full time
    Flexible hours

    AI Chopping Block, Inc.

    San Francisco, CA
    1 day ago
  •  ...combination of inventive research, design, and engineering. Our organization is very flat, and...  .... About the Role You will lead the Model Routing & Inference team at Cursor, owning the inference...  ..., and more cost‑effective at a scale few teams in the world get to operate... 

    Anysphere

    San Francisco, CA
    11 hours ago
  • $220k - $320k

     ...ML Model Serving Engineer Want to build the layer that actually makes AI usable...  ...ll join a team focused on inference, where performance is the...  ...instantly, reliably, and at scale. That means solving hard...  ...working across model serving, infrastructure, and performance... 
    3 days per week

    Trades Workforce Solutions

    San Francisco, CA
    12 hours ago
  • $325k

     ...leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal candidate...  ...engineering experience, strong familiarity with ML architectures, and experience with distributed systems.... 

    OpenAI

    San Francisco, CA
    11 hours ago
  •  ...Zensors is seeking a Machine Learning Engineer specializing in ML Runtime & Optimization to enhance our visual sensing platform. This role involves developing technologies that improve computer vision models critical for smart spaces and cities. Your responsibilities include... 

    Zensors

    San Francisco, CA
    10 hours ago
  •  ...Delphina-Hotels- is looking for an experienced ML Infrastructure Engineer to join their Technical Staff in San Francisco. In this pivotal role,...  ...include developing platforms for ML jobs, establishing CI/CD models, and leading cross-functional initiatives. Candidates should... 

    Delphina-Hotels-

    San Francisco, CA
    12 hours ago
  • $250k - $300k

     ...Machine Learning Engineer - Speech Model Training $250,000 -...  ...through to production inference on edge devices. At a...  ...Design and train large-scale speech models end-to-...  ...in distributed infrastructure and ship solutions...  ...traversing the entire ML stack from signal processing... 
    Permanent employment
    Full time
    Work at office
    Immediate start
    Worldwide

    DeepRec.ai

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to ML Infrastructure Engineer - Model Inference & Scale. Be the first to apply!