Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Model Inference Engineer for Production-Scale AI

$325k

Jobleads-US

A leading AI research company in San Francisco seeks an engineer to optimize their powerful AI models for high-volume production environments. The ideal candidate has over 5 years of software engineering experience, strong familiarity with ML architectures, and experience with distributed systems. This role involves collaboration with researchers and focus on performance optimization. Compensation ranges from $325K to $490K. #J-18808-Ljbffr Jobleads-US

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Model Inference Engineer for Production-Scale AI in San Francisco, CA vacancy
  • MakerMaker.AI is looking for a Senior Machine Learning Systems Engineer in San Francisco. In this role, you will build and operate production inference systems, optimizing for performance and reliability. The ideal candidate will have 3+ years of experience in production... 
    Senior

    MakerMaker.AI

    San Francisco, CA
    1 day ago
  • $220k - $320k

    ML Model Serving Engineer Want to build the layer that actually makes AI usable in real time? You’ll join a team focused on inference, where performance is the product. This is about delivering low-latency, high-throughput...  ..., reliably, and at scale. That means solving hard... 
    Suggested
    3 days per week

    Trades Workforce Solutions

    San Francisco, CA
    2 days ago
  • $220k - $320k

     ...Help us make inference blazingly fast. If you...  ...optimization techniques into production systems, we'd love...  ...language models for companies that...  ...frontier-quality AI at a fraction of the...  ..., and planet-scale hosting. We are a...  ...ten-person team of engineers who work in-person... 
    Senior
    Work at office

    Inference

    San Francisco, CA
    3 days ago
  • B Capital is looking for a Production Support Engineer in San Francisco. You'll play a key role in ensuring the reliability of the Agentforce Supply Chain platform and work with an agile team on scaling the product and automating infrastructure. The ideal candidate has... 
    Senior

    B Capital

    San Francisco, CA
    4 days ago
  • A dynamic AI company in San Francisco is looking for an Applied AI Inference Engineer to develop and deploy high-scale production AI applications. You will partner with customers to transform business goals into reliable services while engaging in software development and... 
    Suggested
    Flexible hours

    Baseten

    San Francisco, CA
    4 days ago
  • $167.2k - $209k

     ...DigitalOcean is expanding its AI Infrastructure...  .... We are seeking a Senior Engineer 2 to join our AI Inference Data Plane team. In...  ...delivering high-scale, resilient data...  ...deploy and scale their models with industry-...  ...availing of all the productivity boosts granted by the... 
    Senior
    Local area
    Remote work
    Worldwide
    Flexible hours

    DigitalOcean

    San Francisco, CA
    3 days ago
  • Clera is seeking a Senior AI/ML Engineer to build production-grade ML infrastructure. You will design and ship end-to-end ML systems including data pipelines, training, and deployment. The role requires 4+ years of applied ML engineering experience in production settings... 
    Senior
    Full time

    Clera

    San Francisco, CA
    2 days ago
  • Intersect is looking for a Civil Engineer to provide civil and structural design guidance for grid-scale solar and data center projects in Canada. The role involves collaborating with various teams while ensuring code-compliant designs. Candidates should have a Bachelor... 
    Senior

    Intersect

    San Francisco, CA
    2 days ago
  •  ...seeking a talented software engineer to join their dynamic Inference team. This role involves...  ...implementing infrastructure for large-scale multimodal models, focusing on high-...  ...closely with researchers and product teams to push the boundaries of AI technology, ensuring... 

    Jobleads-US

    San Francisco, CA
    4 days ago
  • Anyscale is seeking a Distributed LLM Inference Engineer in San Francisco, California. This pivotal role involves pushing the boundaries of performance for ML inference at scale. You'll work closely with product teams to deliver end-to-end solutions while leveraging open... 

    Anyscale

    San Francisco, CA
    4 days ago
  • A leading data and AI company in San Francisco is seeking a Senior Engineer to enhance their Model Serving platform. This role requires expertise in building large-scale distributed systems and collaboration across teams to optimize performance and reliability. Ideal candidates... 
    Senior

    Jobleads-US

    San Francisco, CA
    1 day ago
  •  ...in San Francisco is seeking a Senior Engineer to build infrastructure that...  ...enhances the efficiency of multiple product teams. The role requires over...  ...in distributed systems and scaling production environments....  ...environment, with a hybrid work model requiring in-office presence... 
    Senior
    Work at office
    3 days per week

    Nooks

    San Francisco, CA
    4 days ago
  • YO IT Consulting is seeking an experienced Senior Civil Engineer specializing in evaluating AI-generated content. This remote role involves ensuring technical accuracy, challenging AI models with real-world engineering scenarios, and shaping AI communication standards.... 
    Senior
    Remote job

    YO IT Consulting

    San Francisco, CA
    2 days ago
  • $166.9k - $225.9k

     ...operates as both a central engineering function and an...  ...partner for one of Drata's product engineering teams across...  ...stack to help Drata scale reliably for a rapidly...  ...Experience with AIOps—using AI/ML‑based tooling for...  ...backed services (e.g., LLM inference latency, non‑... 
    Senior
    Flexible hours

    Drata

    San Francisco, CA
    3 days ago
  •  ...firm in San Francisco is seeking an ML Infrastructure Engineer, Model Inference to build and optimize AI-driven solutions. You will design scalable...  ...candidates have strong experience in deploying models in production environments and expertise in Kubernetes. This innovative... 

    Abridge

    San Francisco, CA
    2 days ago
  •  ...a Member of Technical Staff focused on AI Safety to lead red-teaming efforts and ensure...  ..., partner with researchers to define production safety standards, and research advanced...  ...expertise in LLM safety, strong software engineering skills, and relevant academic... 
    Senior

    Xcede

    San Francisco, CA
    3 days ago
  • A technology startup in San Francisco is seeking a skilled individual to enhance the API infrastructure supporting AI models. The role involves designing and optimizing backend services, focusing on performance and reliability. Candidates should have over 3 years of experience... 

    Baseten

    San Francisco, CA
    4 days ago
  • A tech startup focused on AI workloads is seeking a Member of Technical Staff to design and optimize inference systems. The role involves managing KV cache allocation and...  ...Ideal candidates should have strong software engineering skills and experience with ML inference... 
    Senior

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • Magnitude, located in San Francisco, is seeking a Sr. GTM Engineer to reimagine and scale the go-to-market motion. In this vital role, you will architect AI-powered experiments and integrate tools to drive growth. The ideal candidate has over 6 years in GTM or growth engineering... 
    Senior
    Flexible hours

    Magnitude

    San Francisco, CA
    2 days ago
  • $200k - $400k

     ...Senior Software Engineer - Agentic Systems We are partnered with a highly technical AI research company building advanced AI...  ...funded, executing large‑scale programs, and...  ...combining large language models with structured...  ...workflows into real production toolchains. This... 
    Senior
    Full time
    Immediate start

    Strativ Group

    San Francisco, CA
    3 days ago
  • $142.7k - $270.95k

     ...ART is seeking a Senior researcher -...  ...Systems & Efficiency Engineer to join our R&D...  ...practical, production-ready improvements in inference performance, latency...  ...intersection of model architecture, systems...  ...Intelligence (AI), ML systems,...  ...implementing and scaling large-scale... 
    Senior
    Full time
    Temporary work
    Local area
    Worldwide

    Adobe

    San Francisco, CA
    2 days ago
  • $190k - $282k

     ...Production Engineer, Security Engineering Join to apply for the Production Engineer, Security Engineering...  ...at CoreWeave . CoreWeave is the AI Hyperscaler, delivering a cloud platform...  ..., or Go. ~ Experience managing large-scale distributed systems and ensuring their... 
    Senior
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    San Francisco, CA
    3 days ago
  • A synthetic fuel startup is seeking a Senior Controls Engineer who will be responsible for the reliability of systems and architecting infrastructures for a large-scale fleet expansion. Ideal candidates should have a strong background in PLC and SCADA systems, with experience... 
    Senior

    Jack & Jill/External ATS

    San Francisco, CA
    4 days ago
  • A forward-thinking tech company in San Francisco is looking for a skilled Backend Engineer to build and optimize large-scale systems for AI-powered applications. This role entails developing APIs, microservices, and data pipelines while collaborating closely with the tech... 
    Senior
    Remote job

    Linkup Inc

    San Francisco, CA
    4 days ago
  • A leading AI company in San Francisco seeks a Senior Support Engineer to provide technical guidance and operational oversight...  ...scripting skills, you will help scale support using automation and AI advancements...  ...The role involves a hybrid work model, ensuring flexibility while... 
    Senior

    OpenAI

    San Francisco, CA
    1 day ago
  • $175k - $200k

    A forward-thinking tech company based in San Francisco is seeking a Senior Full Stack Engineer to design large-scale data ingestion and processing systems. The successful candidate will have over 5 years of engineering experience and be comfortable with Python (Django)... 
    Senior

    Scribe

    San Francisco, CA
    1 day ago
  • $196k - $294k

     ...cloud infrastructure to build, scale, and secure a faster, more...  ...behind v0, Next.js, and AI SDK, Vercel helps customers...  ...the world to ship the best products. That starts with creating a...  ...can just ship things. As a Senior Software Engineer on our Trust & Safety team at... 
    Senior
    Work at office
    Remote work
    Work from home
    Monday to Friday
    Flexible hours

    Nerdleveltech

    San Francisco, CA
    3 days ago
  • $301.75k - $355k

     ...vertically integrated AI infrastructure...  ...believe in the scale of our ambition...  ...Role The Senior Director for the Model LifeCycle team will...  ...Science, Engineering, or a related field...  ...cutting‑edge AI products and solving challenging...  ...GPU systems and inference frameworks.... 
    Senior
    Temporary work

    Jobleads-US

    San Francisco, CA
    4 days ago
  • A leading AI company is seeking a Senior Software Engineer to design and build scalable infrastructure. The role involves working on both front-end and back...  ..., along with comprehensive benefits including health, dental, and generous PTO. #J-18808-Ljbffr Scale AI, Inc.
    Senior

    Scale AI, Inc.

    San Francisco, CA
    1 day ago
  • Chef Robotics is seeking a Senior Software Engineer to help scale their robotics platform. The role involves debugging low-level systems,...  ...position offers an opportunity to work on impactful AI solutions in food production, with a focus on integrating new hardware and... 
    Senior

    Israelvcforum

    San Francisco, CA
    15 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Model Inference Engineer for Production-Scale AI. Be the first to apply!