Staff + Senior Software Engineer, Inference
$320kUnited States Digital Space LLC
About the company the company’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Our Inference team is responsible for building and maintaining the critical systems that serve Claude to millions of users worldwide. We bring Claude to life by serving our models via the industry's largest compute‑agnostic inference deployments. We are responsible for the entire stack from intelligent request routing to fleet‑wide orchestration across diverse AI accelerators. The team has a dual mandate: maximizing compute efficiency to serve our explosive customer growth, while enabling breakthrough research by giving our scientists the high‑performance inference infrastructure they need to develop next‑generation models. We tackle complex, distributed systems challenges across multiple accelerator families and emerging AI hardware running in multiple cloud platforms. Key responsibilities Design, build, and maintain the distributed systems that serve Claude to millions of users worldwide Develop intelligent request routing, load balancing, and traffic management systems across thousands of accelerators Maximize compute efficiency across the fleet by autoscaling and orchestrating production, research, and experimental workloads Build and operate production‑grade deployment pipelines for releasing new models to users Provide high‑performance inference infrastructure that enables researchers to develop next‑generation models Integrate new AI accelerator platforms and support inference for new model architectures Use observability data to tune and improve performance based on real‑world production workloads Minimum qualifications Significant software engineering experience, particularly with distributed systems Results‑oriented, with a bias towards flexibility and impact Willingness to pick up slack, even if it goes outside your job description Enjoy pair programming (we love to pair!) Desire to learn more about machine learning systems and infrastructure Thrive in environments where technical excellence directly drives both business results and research breakthroughs Care about the societal impacts of your work Preferred qualifications Experience with high‑performance, large‑scale distributed systems Experience implementing and deploying machine learning systems at scale Experience with load balancing, request routing, or traffic management systems Familiarity with LLM inference optimization, batching, and caching strategies Experience with Kubernetes and cloud infrastructure (AWS, GCP, Azure) Proficiency in Python or Rust Representative projects Designing intelligent routing algorithms that optimize request distribution across thousands of accelerators Autoscaling our compute fleet to dynamically match supply with demand across production, research, and experimental workloads Building production‑grade deployment pipelines for releasing new models to millions of users Integrating new AI accelerator platforms to maintain our hardware‑agnostic competitive advantage Contributing to new inference features (e.g., structured sampling, prompt caching) Supporting inference for new model architectures Analyzing observability data to tune performance based on real‑world production workloads Managing multi‑region deployments and geographic routing for global customers Deadline to apply: None. Applications will be reviewed on a rolling basis. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings (“OTE”) range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $320,000—$485,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location‑based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. EEO and diversity: We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of your candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. #J-18808-Ljbffr
- ...Staff+ Software Engineer, Inference Runtime Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY About Anthropic... ...abstractions every accelerator builds on. This is a senior IC role with broad technical ownership. You'll set...SuggestedWork at officeRemote workVisa sponsorshipFlexible hours
$281k - $356k
...Senior Staff Software Engineer, TLM Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver.... ...engineers to solve the "technical moat" of high-fidelity ML inference at a petabyte scale Key Responsibilities ~...SeniorFull timeRemote work$207k - $385k
...About the Team Join the engineering teams that bring OpenAI's... ...the Role We're seeking Software Engineers who can solve... ...to optimizing how we serve inference in unique, high-stakes environments... ...title Member of Technical Staff . We use Senior Staff externally to signal...Senior- ...Full time Location Type Hybrid Department Inference Model Serving Who are we? Our mission is... .... Cohere is a team of researchers, engineers, designers, and more, who are passionate... ...We are looking for Members of Technical Staff to join the Model Serving team at Cohere...SuggestedFull timeWork experience placementWork at officeRemote workFlexible hours
$405k
...Senior Staff Software Engineer, API San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable, interpretable... ...and execution, partnering closely with Research, Inference, Platform, Infrastructure, and Safeguards to ensure the...SeniorWork at officeVisa sponsorshipFlexible hours$180k - $220k
...About the Role We're looking for a Senior/Staff Backend Engineer to architect and build large scale... ...clear, maintainable, and reliable software. Strong backend engineer. Deep Python... ...driven systems - such as real-time inference pipelines, context and retrieval...SeniorFull timeWork at officeShift work$237.6k - $318.24k
...Senior Staff Software Engineer For Ai Model Lifecycle Team Crusoe is on a mission to accelerate the abundance of energy and intelligence. As... ...frameworks. Performance optimizations on GPU systems and inference frameworks. Benefits ~ Competitive compensation...SeniorTemporary work$220k
...Perplexity is looking for an engineer to join their team in San Francisco. You will work on building and operating the inference engine, supporting new models, migrating GPU kernels... ...candidate has 3+ years of experience in software engineering with a focus on ML inference...Senior$219k - $315k
Zoox is looking for an experienced software engineer to work on large‑scale simulation pipelines used to validate the behavior of the Zoox... ...Qualifications Exposure to machine learning workloads (training, inference, data generation) from a cost optimization perspective...SeniorTemporary workRelocation package- A leading cloud infrastructure company is seeking a Senior Engineer 2 to join their AI Inference Optimization team. The role involves leading the technical strategy for performance architecture and addressing complex performance issues ensuring industry-leading service...SeniorRemote job
$320k
...United States Digital Space LLC is seeking a backend engineer for the Cloud Inference team. This role involves designing and building infrastructure... ...and cost. The ideal candidate will have significant software engineering experience with a major cloud platform. We offer...Senior$320k - $405k
...committed researchers, engineers, policy experts, and... ...beneficial AI systems. Staff Infrastructure... ...and internal research/inference/product teams to shape... ...build alignment across senior stakeholders and communicate... ...qualifications 8+ years of software engineering experience...SeniorVisa sponsorship$320k - $405k
...growing group of committed researchers, engineers, policy experts, and business leaders working... ...Partner with research, training, and inference to understand workload shapes and turn... ...failures Minimum qualifications Significant software engineering experience building and...Senior$167.2k - $209k
A leading cloud service provider is seeking a Senior Engineer 2 for their AI Inference Data Plane team. This remote role focuses on designing and developing high-scale, resilient data plane services that enhance AI-driven applications. The ideal candidate will have strong...SeniorRemote job- ...Gusto is seeking a Senior Staff Software Engineer to lead technical initiatives for its Commerce Platform in San Francisco. You will oversee architecture, collaborate across teams, and ensure system reliability for 500,000+ small businesses. The ideal candidate has over...Senior
$160k - $250k
...Senior Backend Engineer, Inference Platform San Francisco About the Role Together AI is building the Inference Platform that brings the most... ...is a strong plus. ~ Familiarity with GPU software stacks (CUDA, Triton, NCCL) and HPC technologies (InfiniBand...SeniorFull timeLocal area- ...About the role Slash is, at its core, a technology company and is on a mission to build the best engineering team in the world. We're looking for a Senior/Staff Software Engineer to help build and evolve our core banking product that powers $10b in transaction volume...SeniorWork at office
$215k - $265k
...Data Direct Networks is seeking a Sr Staff Software Engineer to lead the ongoing development of an S3 compliant high-performance file system. The ideal candidate will have over 12 years of experience in system software development, with strong skills in C/C++ and Linux...SeniorRemote work$2,000 per month
...pathologically unfair at worst. Our mission is to reimagine the world of data with you. About The Role As a Principal/Staff Software Engineer , you will help build out the next generation data platform to support decentralized analytical and ML workloads, which...Senior$190k - $230k
...Senior Software Engineer (Full Stack / Product Engineering) Location: San Francisco, NYC, Austin, or Remote (North America) Company Stage: Early-Stage (AI-Native / ERP & Commerce Infrastructure) Office Type: Hybrid / Remote-Friendly Salary: Competitive Base + Equity...SeniorWork at officeRemote workFlexible hours- ...Senior Staff Software Engineer Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers...SeniorWork at officeVisa sponsorshipFlexible hours
- ...industry-leading organizations across IT, Engineering, Financial Services & Fintech, and... ...Talent Now Job #ZR 77 We are looking for a Staff Software Engineer, AI/ML with at least 6 years... ...best practices. Requirements Seniority : 6 - 15 years of experience in a software...Senior
$160k - $220k
...TCV, First Harmonic, Bain Capital Ventures, First Round Capital, and more. About the Role We're looking for a Senior / Staff Software Engineer - Search & Retrieval to build and scale the systems that power Actively's AI agents to find, rank, and reason over...SeniorWork at officeShift work$141k - $242k
...Waabi Senior Or Staff Software Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis...Senior$189k - $236k
...Senior Staff Software Engineer - Pricing and Packaging San Francisco, CA At Gusto, we're on a mission to grow the small business economy. We handle the hard stuff — payroll, health insurance, 401(k)s, and HR — so owners can focus on their craft and their customers...SeniorFull timeWork at officeLocal areaRemote work2 days per week3 days per week- ...Join us to invest in yourself, your career, and the financial world. The Role: We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering organization with a mission to enable SoFi engineers to elegantly solve problems....SeniorRemote work
$238k - $288k
...we're investing deeply in the firmware that underpins fleet reliability, security, and operability — and we're hiring a founding engineer to lead our BMC firmware work. You'll set the technical direction for BMC firmware across Crusoe's server platforms and drive the...SeniorTemporary work$237.6k - $288k
...Senior Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens...SeniorTemporary work- ...Abridge Engineering Role Abridge's platform is scaling fast alongside our expanding customer base and product growth. We're building... ...internal systems and agents that power how we develop and ship software. As an early member of this team, you'll tackle high-impact, high...SeniorHourly payFull timeWork at officeLocal areaRelocationFlexible hours3 days per week
$141k - $242k
...Senior Or Staff Software Engineer Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis...SeniorFull timeWork at officeWork from homeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff + Senior Software Engineer, Inference. Be the first to apply!
- senior data management analyst San Francisco, CA
- senior app developer San Francisco, CA
- senior manager insurance San Francisco, CA
- senior game producer San Francisco, CA
- senior retail sales associate San Francisco, CA
- senior manager quality engineering San Francisco, CA
- senior software test automation engineer San Francisco, CA
- senior quantitative risk analyst San Francisco, CA
- senior broker San Francisco, CA
- senior compensation manager San Francisco, CA

