Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Software Engineer, ML Infrastructure

$300k - $430k

Decagon

About Decagon Decagon is the leading conversational AI platform empowering every brand to deliver concierge customer experiences. Our technology enables industry-defining enterprises like Avis Budget Group, Block’s Cash App and Square, Chime, Oura Health, and Hunter Douglas to deploy AI agents that power personalized, deeply satisfying interactions across voice, chat, email, SMS, and every other channel. We’re building a future where customer experiences are being redefined from support tickets and hold music to faster resolutions, richer conversations, and deeper relationships. We’re proud to be backed by world-class investors who share that vision, including a16z, Accel, Bain Capital Ventures, Coatue, and Index Ventures, along with many others. We’re an in-office company, driven by a shared commitment to excellence and velocity. Our values — Just Get It Done, Invent What Customers Want, Winner’s Mindset, and The Polymath Principle — shape how we work and grow as a team. About the Team The ML Infrastructure team builds the systems that power every stage of Decagon's model lifecycle. We own the platforms for model training, the infrastructure for model evaluation and experimentation, and the routing layer that manages inference across multiple providers. We work at the intersection of research and production: translating cutting-edge ML techniques into reliable, scalable systems that run in customer environments. We collaborate closely with Research, Infrastructure, and Product teams to ensure models train efficiently, serve reliably, and deliver exceptional user experiences. About the Role We're hiring a Staff ML Infrastructure Engineer to own the platforms powering Decagon's model training and inference. You'll build distributed training systems, design inference architecture across multiple providers, and create the frameworks that let our Research and Product teams ship faster. This role is for someone who thrives on technical depth, can lead multi-quarter initiatives, and wants to shape the long-term architecture of our ML stack. In this role, you will Design and build distributed training platforms for LLM and multimodal fine-tuning and post-training at scale Implement and integrate state-of-the‑art training algorithms into production pipelines Own inference architecture and multi‑provider routing, including failover and optimization Research and implement inference optimizations including quantization, speculative decoding, and batching strategies Lead initiatives to improve latency and cost efficiency across the training and serving stack Build evaluation and experimentation infrastructure that enables rapid, reliable iteration Drive technical direction, mentor engineers, and establish best practices for ML infrastructure Your background looks something like this 8+ years building ML infrastructure or production systems at scale Deep experience with distributed training: multi‑node GPU clusters, fault tolerance, and optimization Strong understanding of LLM inference: latency optimization, provider tradeoffs, and serving architecture Proficiency in Python and modern ML frameworks (PyTorch, JAX, or TensorFlow) Proven track record leading complex, multi‑quarter technical projects Benefits Medical, dental, and vision benefits Take what you need vacation policy Daily lunches, dinners and snacks in the office to keep you at your best Compensation $300K – $430K + Offers Equity #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Staff Software Engineer, ML Infrastructure in San Francisco, CA vacancy
  • $200k - $400k

     ...as a team. About the Team The Infrastructure team builds and operates the foundations that power Decagon: networking, data, ML serving, developer platform, and realtime...  ...We're hiring a Senior Infrastructure Engineer to design, build, and operate production... 
    Suggested
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    4 days ago
  •  ...workflow compliance, process efficiency. Every new use case runs through the perception team. We're hiring a Staff Software Engineer to own ML Infrastructure at Voxel. Our applied ML team is shipping vision models into production every week, across thousands of... 
    Suggested
    Work at office
    Flexible hours

    Voxel Labs

    San Francisco, CA
    2 days ago
  • $236k - $290k

     ...we're just getting started. Role Overview As a Staff Software Engineer on the Core Infrastructure team at Harvey, you'll play a critical role in designing...  ...Have Experience building infrastructure for AI/ML workloads or high-throughput inference systems Background... 
    Suggested
    Relocation package

    Harvey

    San Francisco, CA
    4 days ago
  •  ...Learn more at Life as an Engineer at EvenUp Location & Work...  ...intersection of backend engineering, ML systems, and platform...  ...Scientists, ML Researchers, and Software Engineers, all working...  ...operate. By creating shared infrastructure and tooling, the AI Platform... 
    Suggested
    Temporary work
    Work at office
    Local area
    Home office
    Flexible hours
    3 days per week

    EvenUp Inc.

    San Francisco, CA
    5 days ago
  • $279.2k - $390.9k

     ...sources of information. Team: The ML Indexing & Retrieval Platform...  ...and scaling the core infrastructure that powers machine learning...  ...generation ML Indexing & Retrieval engine, integrating capabilities...  ...Be 10+ years of experience in software engineering, specializing in... 
    Suggested
    For contractors
    Work experience placement
    Remote work
    Flexible hours

    Tensec

    San Francisco, CA
    1 day ago
  •  ...Job Description Slack is looking for a Staff Software Engineer to join the Data Infrastructure team within the broader Data Engineering organization. The mission...  ...data infrastructure that powers Slack’s analytics, ML, and data‑driven decision‑making. Serve as Directly... 

    100 Salesforce, Inc.

    San Francisco, CA
    1 day ago
  • $197.3k - $313.7k

    ## Staff Software Engineer, Data InfrastructureApplyremote type: Office Tech-Flexiblelocations: California...  ...Software Engineer to join the **Data Infrastructure** team within the broader Data...  ...infrastructure powering Slack’s analytics, ML, and data-driven decision-making.*... 
    Permanent employment
    Work at office

    Slack Enterprise

    San Francisco, CA
    2 days ago
  • $190k - $260k

     ...world's best data and AI infrastructure platform so our...  ...business. Founded by engineers — and customer obsessed...  ...when necessary. As a Staff Engineer, you will operate...  ...Product, Machine Learning (ML) and Large Language...  ...machine learning and software engineering, coupled with... 
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    1 day ago
  • $192.6k - $288.8k

     ...and growth across our ecosystem. As a Staff Software Engineer on the Feature Platform team, you'll play a critical role in building the infrastructure that powers machine learning,...  ...distributed systems, platform engineering, and ML infrastructure. This role goes beyond... 
    Work at office
    Worldwide
    Relocation package

    UNITY

    San Francisco, CA
    2 days ago
  •  ...computing and make it accessible to software developers of all skill levels...  ...data scientist can scale an ML application from their laptop...  ...Anyscale is seeking a Staff Software Engineer to lead the technical vision for our Infrastructure team. As a Staff Engineer, you... 

    Anyscale, Inc

    San Francisco, CA
    5 days ago
  •  ..., GNSS, IMU, etc. feed sensor fusion and ML models at the edge. Data is automatically...  ...Tech-savvy customers develop and deploy software directly to our dashcams to get realtime...  ...The Map AI Platform is the core data engine that powers our data customers, consumer... 
    Flexible hours

    Hivemapper

    San Francisco, CA
    4 days ago
  • $405k

     ...committed researchers, engineers, policy experts, and...  ...looking for experienced software engineers to join our...  ...across Anthropic, and own infrastructure and systems that teams...  ...Relevant experience: ML training infra,...  ...Currently, we expect all staff to be in one of our offices... 
    Currently hiring
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    5 days ago
  •  ...Staff ML Platform Engineer Grow Therapy is on a mission to serve as the trusted partner for therapists growing their practice, and patients...  ...capabilities and setting the bar for what excellent ML infrastructure looks like at Grow. Matching a client to the right therapist... 
    Full time
    Work at office
    Remote work
    Home office
    Flexible hours
    Day shift
    3 days per week

    Grow Therapy

    San Francisco, CA
    6 days ago
  •  ...Stripe is a financial infrastructure platform for businesses...  ...to innovate in ML Platform at Stripe....  ...services that enable ML engineers and data scientists across...  ...driven products. As a Staff Engineer, you'll make...  ..., and operating great software systems. Who you are... 
    Flexible hours

    Stripe

    San Francisco, CA
    5 days ago
  • $320k - $405k

     ...committed researchers, engineers, policy experts, and...  ...for backend / platform software engineers to join our...  ...critical development infrastructure that powers our AI product...  ...worked with NLP and ML models and understand...  ...Currently, we expect all staff to be in one of our... 
    Full time
    Currently hiring
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    San Francisco, CA
    2 days ago
  •  ...Senior Software Engineer, Platform At David AI, our engineers build the pipelines, platforms...  ...tight-knit team of product engineers, infrastructure specialists, and machine learning experts...  ...boundaries to support a growing set of ML and product features. Build resilient... 
    Work at office

    David AI

    San Francisco, CA
    4 days ago
  • $160k - $250k

     ...automation startup serving US manufacturers. As a Senior/Staff Platform Engineer, you will build the software layer that powers autonomous manufacturing...  ...: Develop and productionize LLM-based agents, AI/ML infrastructure, and RAG patterns for intelligent quote analysis... 
    Visa sponsorship

    Clera

    San Francisco, CA
    3 days ago
  • $300 per month

     ...intelligence. We’re crafting the engine that powers a world...  ..., transformative cloud infrastructure. About This Role: The Crusoe Cloud Software Development team is...  ...and experienced Senior Staff Software Engineer specializing...  ...on optimizing for AI/ML workloads. This includes... 
    Full time
    Temporary work

    Crusoe Energy Systems LLC

    San Francisco, CA
    4 days ago
  • $232k - $313k

     ...and running the world's best data and AI infrastructure platform, so our customers can focus on...  ...growing SaaS companies in the world. Our engineering teams build highly technical products...  ...build the most trusted data analytics and ML platform in the world. Security Engineering... 
    Work at office
    Local area
    Worldwide
    Flexible hours

    I did my part and supported the Regular Toilet

    San Francisco, CA
    1 day ago
  • Staff Software Engineer - Machine Learning Platform (San Francisco) Replicate makes it easy for software...  ...need something custom. We handle the infrastructure, so you can focus on building. Our...  ...operate the latest advancements in the ML and AI space. Designing systems to maximize... 
    Full time
    Work at office
    Shift work
    3 days per week

    Replicate, Inc.

    San Francisco, CA
    5 days ago
  • $170k - $220k

     ...supply chain and enterprise software investors. We're live with manufacturers...  ...the Role You'll build the infrastructure for autonomous manufacturing...  ...LLM-based agents, AI/ML infrastructure, and RAG...  ...You Have • Very deep software engineering experience with a strong focus... 

    Tenkara Labs, Inc.

    San Francisco, CA
    1 day ago
  • $204k - $247k

    A technology company is looking for a Staff Software Engineer to join their Model LifeCycle team in San Francisco. The role requires significant expertise in AI and machine learning, with responsibilities including fine-tuning large models and implementing training pipelines... 

    Crusoe Energy Systems LLC

    San Francisco, CA
    15 hours ago
  • $192k - $260k

     ...and running the world's best data and AI infrastructure platform so our customers can use deep...  ...platform to deploy and manage AI/ML models - from traditional ML to fine-tuned...  ...strong SLAs and cost efficiency. As a Staff Engineer, you'll play a critical role in shaping... 
    Local area
    Worldwide

    Databricks

    San Francisco, CA
    3 days ago
  •  ...our unique combination of proprietary infrastructure and software, we empower over 200,000 businesses...  ...entire company to leverage data, AI, and ML into business impact. We accomplish...  ...Are? As a high level architect (staff engineer), you will oversee the strategy, architecture... 
    Work at office
    Worldwide

    Airwallex

    San Francisco, CA
    4 days ago
  • $140k - $210k

     ...Role We're looking for a Senior/Staff Data Platform Engineer to build and scale the foundation of...  ...the pipelines, transformations, and infrastructure that power every agent, insight, and...  ...ve worked on data systems that power ML models, intelligent workflows, or real... 
    Work at office
    Flexible hours
    Shift work

    Actively AI

    San Francisco, CA
    6 days ago
  •  ...etc. feed sensor fusion and ML models at the edge. Data is...  ...customers develop and deploy software directly to our dashcams to...  ...called Map AI. The Platform Engineering team is responsible for the...  ...distributed systems, ML infrastructure, data model abstractions, and... 
    Flexible hours

    Hivemapper

    San Francisco, CA
    2 days ago
  • $194k - $267k

     ...building the trusted, neutral infrastructure that enables organizations to...  ...user reporting Data and ML platform for Okta to scale...  ...expect great things from our engineers and reward them with stimulating...  ...opportunity for experienced Software Engineers to join our fast growing... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    San Francisco, CA
    4 days ago
  •  ...Staff Software Engineer Plenful is on a mission to transform healthcare operations from the inside...  ...experience building backend or data infrastructure in production ~ Deep expertise in relational...  ...consume structured data, not pure ML research Comfort in customer-facing... 
    Work at office
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    3 days ago
  • $220k - $405k

    Perplexity is seeking an experienced Software Engineer focusing on building the next-gen AI Foundation...  ...data, evaluation and personalization infrastructure and flywheel which powers almost all...  ...closely with AI Product, Applied ML, Post-Training, and Data Science teams... 
    Worldwide

    Perplexity

    San Francisco, CA
    1 day ago
  • $155k - $240k

     ...processing. We build tools, frameworks, infrastructure and services used by every team at...  ...You will... Be a part of a team of engineers that manages Waabi’s data lifecycle from...  ...Waabi, including research scientists, ML and software engineers, system engineers and program... 
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi Innovation Inc.

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer, ML Infrastructure. Be the first to apply!