Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Engineer, Inference & RL Systems — Scale Production ML

$225k

Dormont Manufacturing Co

Dormont Manufacturing Co is looking for a Software Engineer on the Inference & RL Systems team in San Francisco. The role involves designing distributed systems, optimizing performance, and ensuring high reliability for RL and post-training workflows. The ideal candidate will possess strong software engineering fundamentals and experience with large-scale systems. Compensation includes a competitive salary range from $225K to $550K, along with equity, health benefits, and unlimited paid time off. #J-18808-Ljbffr Dormont Manufacturing Co

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Staff Engineer, Inference & RL Systems — Scale Production ML in San Francisco, CA vacancy
  •  ...Talent is seeking a Member of Technical Staff focused on ML Systems & Inference in San Francisco, California. This...  ...that improve latency and efficiency for production AI workloads. The ideal candidate has strong software engineering roots and experience in inference systems... 
    Suggested

    Acceler8 Talent

    San Francisco, CA
    1 day ago
  •  ...area of unprecedented scale and complexity. In no...  ...You Will Make: As a staff software engineer, you will lead two areas...  ...with different AI & ML engineering teams, cross...  ...of many AI driven products for our community....  ...teams to develop backend systems and enhance AI prompt... 
    Suggested
    Work experience placement
    Flexible hours

    airbnb, Inc.

    San Francisco, CA
    2 days ago
  •  ...class researchers and engineers across AI and biology....  ...next generation of AI systems for the life sciences....  ...agentic AI platform in production. You’ll design and implement...  ...to run reliably at scale — across cloud-native deployments...  ...Partner closely with ML and science teams to... 
    Suggested

    deCircle

    San Francisco, CA
    3 days ago
  •  ...seeking a Member of Technical Staff to develop AI-powered systems in San Francisco. In this...  ...optimize performance for production-grade solutions. The...  ...s in Computer Science or Engineering and has strong Python skills...  ...integrating with large-scale data. #J-18808-Ljbffr... 
    Suggested

    ERAGON

    San Francisco, CA
    7 days ago
  • $200k

     ...San Francisco is seeking a Software Engineer for their RL Research & Environments team. The role...  ...designing and improving data and evaluation systems to enhance model capabilities....  ...engineering background and experience in large-scale data or ML systems. Compensation ranges from $20... 
    Suggested

    SupportFinity™

    San Francisco, CA
    1 day ago
  • Sail Research in San Francisco is seeking a talented engineer to design and implement robust systems that ensure fast and cost-efficient AI inference at global scale. You will be responsible for building high-performance schedulers and optimizing global routing while focusing... 

    Sail Research

    San Francisco, CA
    4 days ago
  •  ...general-purpose AI systems that run...  ...services. We are scaling rapidly and need exceptional...  ...Opportunity Our Edge Inference team compiles...  ...understanding of both ML architectures and...  ...code ships to production and directly impacts...  ...Embedded software engineering experience or work... 

    Liquid AI

    San Francisco, CA
    1 day ago
  • A leading conversational AI platform in San Francisco seeks an AI/ML Engineer to build advanced systems for unprecedented performance. The ideal candidate will have over 8 years of experience and a strong track record in AI/ML projects. You'll design state-of-the-art methods... 

    Decagon

    San Francisco, CA
    15 hours ago
  •  ...talented developer to design and implement AI systems that navigate complex healthcare...  ...will build 24/7 AI co-workers and develop production-grade AI systems, directly impacting patient...  ...Python and Go, as well as experience with ML/NLP libraries and cloud platforms. Join a... 

    Usetrellis

    San Francisco, CA
    4 hours ago
  • $150k - $300k

    Prime Intellect is looking for a skilled ML Systems Engineer to build and optimize LLM serving infrastructure and inference systems. This hybrid role involves contributing to the scalability of their reinforcement learning training. Successful candidates will have over... 
    Relocation package

    Prime Intellect

    San Francisco, CA
    3 days ago
  • B Capital is seeking a skilled engineer for GPU infrastructure in San Francisco. This role involves designing and operating high-performance systems for model inference, synthetic data generation, and reinforcement learning. The ideal candidate has strong GPU systems experience... 

    B Capital

    San Francisco, CA
    15 hours ago
  •  ...seeking a Member of Technical Staff for their infrastructure team...  ...role, you will own the cloud systems that serve our compression...  ...latency, high-throughput GPU ML inference infrastructure. The ideal candidate...  ...track record in building production environments. Additional benefits... 
    Visa sponsorship

    The Token Company

    San Francisco, CA
    3 days ago
  • $300k

     ...technical problems at a scale few companies ever see....  ...We're doubling down on ML as the future of Grindr...  ...-scale recommendation systems to serve millions, balancing...  ..., iterate, and ship production-ready ML solutions that...  ..., collaborating with engineering, data science and product... 
    Casual work
    Work at office
    Immediate start
    Worldwide
    Flexible hours

    Grindr

    San Francisco, CA
    4 days ago
  •  ...4 by a team of former Scale AI engineers and operators. In less...  ...research, engineering, product, and operations minds...  ...About this role As a Staff Full Stack Engineer at...  ...daily. Build scalable systems processing terabytes of...  ...engineering, data engineering, ML, and signal processing... 
    Work at office

    David AI

    San Francisco, CA
    2 days ago
  •  ...-time analytics systems operate in production — influencing reliability...  ...across engineering. This team operates...  ...Infrastructure, ML, and engineering...  ...to DevOps to Staff-level SRE, and you...  ...center of high-scale ML systems —...  ...scale (training + inference) AWS or cloud‑managed... 

    Valid8 Financial, Inc.

    San Francisco, CA
    4 days ago
  •  ...Francisco, CA, is looking for a Member of Technical Staff to lead the development of AI-powered systems. This full-time, on-site role requires expertise in...  ...will manage projects from concept through to production, focusing on delivering scalable AI solutions for enterprise... 
    Full time

    David Joseph & Company

    San Francisco, CA
    15 hours ago
  • $207k - $290k

     ...enterprises don't scale expertise—...  ...bringing AI systems to market...  ...actually run in production, handle real...  ...experienced AI Engineer with deep...  ...Reinforcement Learning (RL) to join our...  ...as a Senior Staff Architect. In...  ...in AI/ML engineering,...  ..., including inference-time search,... 
    Worldwide
    Flexible hours

    JazzX AI

    San Francisco, CA
    11 days ago
  • A leading AI research firm in San Francisco is seeking a Member of Technical Staff specialized in Model Efficiency. In this role, you will enhance LLM inference systems by tackling performance issues and collaborating with cross-functional teams. Ideal candidates have over... 
    Remote work

    Cohere

    San Francisco, CA
    4 days ago
  •  ...talented developers to design and build decision-making AI systems for healthcare. You will create production-grade frameworks that assist in complex reimbursement...  ..., expertise in Python, Go, and familiarity with ML/NLP libraries. Join a world-class team making significant... 

    Scritch, Inc.

    San Francisco, CA
    1 day ago
  •  ...Staff AI Engineer Goodfin is an AI-native wealth platform...  ...intelligent, agentic systems that help accredited investors...  ..."feature." AI is the product. As a Staff AI...  ...like when traditional ML metrics fall short....  ...architectural patterns that scale with product... 

    goodfin

    San Francisco, CA
    1 day ago
  •  ...hypotheticals. We are in production, generating real...  ...looking for a hands-on Staff Engineer to own security and compliance...  ...Data security for AI systems — keeping agentic and...  ...tenant isolation at scale. • Data security for...  ...securing AI/ML or LLM-based systems in... 
    Full time
    H1b
    Work at office

    interface.ai

    San Francisco, CA
    2 days ago
  •  ...Role You will own technical systems across the stack. We're a small team, post-product-market-fit, and growing. We...  ...sense, PyTorch over legacy ML frameworks. The Work GPU inference : We run our own ASR models...  ...and operating systems at scale Strong opinions about architecture... 

    Aqua Voice

    San Francisco, CA
    3 days ago
  • $220k - $240k

     ...world for all. The AI Engineering Team is chartered...  ...LLMs) and agentic systems. Our mission is to...  ..., safety, and scale. We manage petabyte...  ...needed to make AI production-ready. We’re also...  ...the market. As a Staff MLOps Engineer with...  ...infrastructure for AI/ML systems. You will:... 
    Remote work
    Worldwide

    TRM Labs

    San Francisco, CA
    1 day ago
  • Jaide Health is seeking an engineer for their Model Efficiency team in San Francisco. The role focuses on building reliable ML systems while enhancing core performance metrics across model...  ...or Python and insights into the LLM inference ecosystem. A commitment to diversity... 
    Remote job

    Jaide Health

    San Francisco, CA
    2 days ago
  •  ...justice they deserve. Our products enable law firms to secure...  ...more at Join EvenUp as a Staff Machine Learning Engineer and build the future of...  ...innovation, mentor a top-tier ML team, and set the vision...  ...and architecture of large-scale ML systems for RAG, vector search, and... 
    Full time
    Temporary work
    Work at office
    Local area
    Home office
    Flexible hours

    EvenUp Inc.

    San Francisco, CA
    2 days ago
  •  ...is to build the shared ML and AI infrastructure...  ...capabilities across Plaid’s product suite. We develop the foundational systems, models, and data...  ...— from large‑scale data curation and model...  ...innovation across Plaid. As a Staff Machine Learning Engineer, you will lead the... 
    Work experience placement
    Local area
    Immediate start

    Plaid Inc

    San Francisco, CA
    2 days ago
  • A tech-first company is seeking a Member of Technical Staff to focus on cutting-edge AI research and development. The role involves building and scaling training and inference infrastructure, designing ML kernels, and optimizing performance. Ideal candidates should have... 

    Mirendil

    San Francisco, CA
    4 days ago
  • $279.2k - $390.9k

    Team The ML Indexing & Retrieval Platform team at...  ...responsible for building and scaling the core infrastructure...  ...We design and maintain systems for ML data ingestion,...  ...from ideation to production and going beyond incremental...  ...Indexing & Retrieval engine, integrating... 
    For contractors
    Work experience placement
    Flexible hours

    Tensec

    San Francisco, CA
    4 days ago
  • $170k - $190k

    About the Role As a Staff Systems Engineer, Automations & Integrations , you will be a hands-on technical leader designing and delivering enterprise...  ..., robust error handling, and governed data flows that scale safely and reliably. This is an individual contributor role... 
    Full time

    Ironclad Inc.

    San Francisco, CA
    1 day ago
  • $273k - $345k

     ...and transport. Our systems are designed to understand...  ...scalable, and more productive. This work...  ...until they work at scale. We are roboticists, engineers, operators, and...  ...develop cutting edge RL and distillation techniques...  ...Profile real-time inference pipelines to... 
    Full time
    Internship
    Work at office
    Flexible hours

    Atoms

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Engineer, Inference & RL Systems — Scale Production ML. Be the first to apply!