Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff + Senior Software Engineer, Inference

$320k

United States Digital Space LLC

About the company the company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute‑agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet‑wide orchestration across diverse AI accelerators. The team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, while enabling breakthrough research by giving our scientists the high‑performance inference infrastructure they need to develop next‑generation models. We tackle complex, distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms. Key responsibilities Design, build, and maintain the distributed systems that serve Claude to millions of users worldwide Develop intelligent request routing, load balancing, and traffic management systems across thousands of accelerators Maximize compute efficiency across the fleet by autoscaling and orchestrating production, research, and experimental workloads Build and operate production‑grade deployment pipelines for releasing new models to users Provide high‑performance inference infrastructure that enables researchers to develop next‑generation models Integrate new AI accelerator platforms and support inference for new model architectures Use observability data to tune and improve performance based on real‑world production workloads Minimum qualifications Significant software engineering experience, particularly with distributed systems Results‑oriented, with a bias towards flexibility and impact Willingness to pick up slack, even if it goes outside your job description Enjoy pair programming (we love to pair!) Desire to learn more about machine learning systems and infrastructure Thrive in environments where technical excellence directly drives both business results and research breakthroughs Care about the societal impacts of your work Preferred qualifications Experience with high‑performance, large‑scale distributed systems Experience implementing and deploying machine learning systems at scale Experience with load balancing, request routing, or traffic management systems Familiarity with LLM inference optimization, batching, and caching strategies Experience with Kubernetes and cloud infrastructure (AWS, GCP, Azure) Proficiency in Python or Rust Representative projects Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads Building production‑grade deployment pipelines for releasing new models to millions of users Integrating new AI accelerator platforms to maintain our hardware‑agnostic competitive advantage Contributing to new inference features (e.g., structured sampling, prompt caching) Supporting inference for new model architectures Analyzing observability data to tune performance based on real‑world production workloads Managing multi‑region deployments and geographic routing for global customers Deadline to apply: None. Applications will be reviewed on a rolling basis. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings (“OTE”) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $320,000—$485,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location‑based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. EEO and diversity: We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of your candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. #J-18808-Ljbffr

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Staff + Senior Software Engineer, Inference in San Francisco, CA vacancy
  •  ...Staff+ Software Engineer, Inference Runtime Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY About Anthropic...  ...abstractions every accelerator builds on. This is a senior IC role with broad technical ownership. You'll set... 
    Suggested
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    5 days ago
  • $281k - $356k

     ...Senior Staff Software Engineer, TLM Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver....  ...engineers to solve the "technical moat" of high-fidelity ML inference at a petabyte scale Key Responsibilities ~... 
    Senior
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $207k - $385k

     ...About the Team Join the engineering teams that bring OpenAI's...  ...the Role We're seeking Software Engineers who can solve...  ...to optimizing how we serve inference in unique, high-stakes environments...  ...title Member of Technical Staff . We use Senior Staff externally to signal... 
    Senior

    OpenAI

    San Francisco, CA
    5 days ago
  •  ...Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is...  .... Cohere is a team of researchers, engineers, designers, and more, who are passionate...  ...We are looking for Members of Technical Staff to join the Model Serving team at Cohere... 
    Suggested
    Full time
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Jaide Health

    San Francisco, CA
    4 days ago
  • $405k

     ...Senior Staff Software Engineer, API San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable, interpretable...  ...and execution, partnering closely with Research, Inference, Platform, Infrastructure, and Safeguards to ensure the... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    4 days ago
  • $180k - $220k

     ...About the Role We're looking for a Senior/Staff Backend Engineer to architect and build large scale...  ...clear, maintainable, and reliable software. Strong backend engineer. Deep Python...  ...driven systems - such as real-time inference pipelines, context and retrieval... 
    Senior
    Full time
    Work at office
    Shift work

    Actively AI

    San Francisco, CA
    5 days ago
  • $237.6k - $318.24k

     ...Senior Staff Software Engineer For Ai Model Lifecycle Team Crusoe is on a mission to accelerate the abundance of energy and intelligence. As...  ...frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits ~ Competitive compensation... 
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    3 days ago
  • $220k

     ...Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels...  ...candidate has 3+ years of experience in software engineering with a focus on ML inference... 
    Senior

    Perplexity

    San Francisco, CA
    4 days ago
  • $219k - $315k

    Zoox is looking for an experienced software engineer to work on large‑scale simulation pipelines used to validate the behavior of the Zoox...  ...Qualifications Exposure to machine learning workloads (training, inference, data generation) from a cost optimization perspective... 
    Senior
    Temporary work
    Relocation package

    Zoox

    San Francisco, CA
    2 days ago
  • A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service... 
    Senior
    Remote job

    DigitalOcean

    San Francisco, CA
    5 days ago
  • $320k

     ...United States Digital Space LLC is seeking a backend engineer for the Cloud Inference team. This role involves designing and building infrastructure...  ...and cost. The ideal candidate will have significant software engineering experience with a major cloud platform. We offer... 
    Senior

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • $320k - $405k

     ...committed researchers, engineers, policy experts, and...  ...beneficial AI systems. Staff Infrastructure...  ...and internal research/inference/product teams to shape...  ...build alignment across senior stakeholders and communicate...  ...qualifications 8+ years of software engineering experience... 
    Senior
    Visa sponsorship

    Menlo Ventures

    San Francisco, CA
    6 days ago
  • $320k - $405k

     ...growing group of committed researchers, engineers, policy experts, and business leaders working...  ...Partner with research, training, and inference to understand workload shapes and turn...  ...failures Minimum qualifications Significant software engineering experience building and... 
    Senior

    Menlo Ventures

    San Francisco, CA
    6 days ago
  • $167.2k - $209k

    A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong... 
    Senior
    Remote job

    DigitalOcean

    San Francisco, CA
    9 days ago
  •  ...Gusto is seeking a Senior Staff Software Engineer to lead technical initiatives for its Commerce Platform in San Francisco. You will oversee architecture, collaborate across teams, and ensure system reliability for 500,000+ small businesses. The ideal candidate has over... 
    Senior

    Gusto

    San Francisco, CA
    4 days ago
  • $160k - $250k

     ...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most...  ...is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand... 
    Senior
    Full time
    Local area

    Together AI

    San Francisco, CA
    2 days ago
  •  ...About the role Slash is, at its core, a technology company and is on a mission to build the best engineering team in the world. We're looking for a Senior/Staff Software Engineer to help build and evolve our core banking product that powers $10b in transaction volume... 
    Senior
    Work at office

    Slash Financial

    San Francisco, CA
    4 days ago
  • $215k - $265k

     ...Data Direct Networks is seeking a Sr Staff Software Engineer to lead the ongoing development of an S3 compliant high-performance file system. The ideal candidate will have over 12 years of experience in system software development, with strong skills in C/C++ and Linux... 
    Senior
    Remote work

    DataDirect Networks Inc

    San Francisco, CA
    4 days ago
  • $2,000 per month

     ...pathologically unfair at worst. Our mission is to reimagine the world of data with you. About The Role As a Principal/Staff Software Engineer , you will help build out the next generation data platform to support decentralized analytical and ML workloads, which... 
    Senior

    Nextdata

    San Francisco, CA
    4 days ago
  • $190k - $230k

     ...Senior Software Engineer (Full Stack / Product Engineering) Location: San Francisco, NYC, Austin, or Remote (North America) Company Stage: Early-Stage (AI-Native / ERP & Commerce Infrastructure) Office Type: Hybrid / Remote-Friendly Salary: Competitive Base + Equity... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Recruiting from Scratch

    San Francisco, CA
    3 days ago
  •  ...Senior Staff Software Engineer Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Colorwave Inc

    San Francisco, CA
    4 days ago
  •  ...industry-leading organizations across IT, Engineering, Financial Services & Fintech, and...  ...Talent Now Job #ZR 77 We are looking for a Staff Software Engineer, AI/ML with at least 6 years...  ...best practices. Requirements Seniority : 6 - 15 years of experience in a software... 
    Senior

    AI Talent Now

    San Francisco, CA
    17 hours ago
  • $160k - $220k

     ...TCV, First Harmonic, Bain Capital Ventures, First Round Capital, and more. About the Role We're looking for a Senior / Staff Software Engineer - Search & Retrieval to build and scale the systems that power Actively's AI agents to find, rank, and reason over... 
    Senior
    Work at office
    Shift work

    Actively AI

    San Francisco, CA
    16 days ago
  • $141k - $242k

     ...Waabi Senior Or Staff Software Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis... 
    Senior

    G2 Venture Partners

    San Francisco, CA
    4 days ago
  • $189k - $236k

     ...Senior Staff Software Engineer - Pricing and Packaging San Francisco, CA At Gusto, we're on a mission to grow the small business economy. We handle the hard stuff — payroll, health insurance, 401(k)s, and HR — so owners can focus on their craft and their customers... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    1 day ago
  •  ...Join us to invest in yourself, your career, and the financial world. The Role: We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering organization with a mission to enable SoFi engineers to elegantly solve problems.... 
    Senior
    Remote work

    SoFi

    San Francisco, CA
    2 days ago
  • $238k - $288k

     ...we're investing deeply in the firmware that underpins fleet reliability, security, and operability — and we're hiring a founding engineer to lead our BMC firmware work. You'll set the technical direction for BMC firmware across Crusoe's server platforms and drive the... 
    Senior
    Temporary work

    G2 Venture Partners

    San Francisco, CA
    1 day ago
  • $237.6k - $288k

     ...Senior Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens... 
    Senior
    Temporary work

    G2 Venture Partners

    San Francisco, CA
    4 days ago
  •  ...Abridge Engineering Role Abridge's platform is scaling fast alongside our expanding customer base and product growth. We're building...  ...internal systems and agents that power how we develop and ship software. As an early member of this team, you'll tackle high-impact, high... 
    Senior
    Hourly pay
    Full time
    Work at office
    Local area
    Relocation
    Flexible hours
    3 days per week

    Abridge

    San Francisco, CA
    4 days ago
  • $141k - $242k

     ...Senior Or Staff Software Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis... 
    Senior
    Full time
    Work at office
    Work from home
    Flexible hours

    G2 Venture Partners

    San Francisco, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff + Senior Software Engineer, Inference. Be the first to apply!