Staff + Sr. Software Engineer, Cloud Inference Launch Engineering

$320k

Anthropic

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform, from API integration and intelligent request routing to inference execution, capacity management, and day-to-day operations.

Within Cloud Inference, the model & inference launch team owns the validation pipeline for our inference server and load balancer on these platforms. We're responsible for every inference change - model launches, performance improvements, safeguard integrations - landing on cloud platforms with correctness, performance, and reliability intact.

This is high-leverage infrastructure work: validation has to be fast and cheap enough to run on the same accelerators that serve customers, trustworthy enough to replace manual checks, and consistent enough that a change working on Anthropic first-party means it works everywhere. This directly determines how fast frontier models and features ship to every cloud platform, and how quickly performance wins reach production - reclaiming capacity at a time when compute is our scarcest resource.
What You'll Do

Be on the critical path for frontier model launches, bringing up inference for new model architectures and shipping them to cloud platforms in lockstep with our first-party platform
Work with the core inference team to bring new inference features (e.g. structured sampling, prompt caching, and more) to cloud platforms, owning the platform-specific integration that gets them to production
Identify and dive deep on the gaps that make inference behave differently across first-party and CSPs - config drift, observability, deployment patterns, hard cross-platform bugs - and fix them at the source rather than building platform-specific workarounds
Design, build, and own the CI/CD infrastructure for the inference server and load balancer across cloud platforms, with shadow traffic, performance baselines (throughput and latency), and correctness checks that catch regressions before production
Drive down merge-to-production cycle time by making validation faster, more parallel, and cost-effective enough to run on the same constrained accelerator pool that serves customers, without trading away reliability
Analyze observability data across providers to identify performance bottlenecks, cost anomalies, and regressions, and drive remediation based on real-world production workloads

You May Be a Good Fit If You:

Have a strong interest in LLM serving; prior inference or ML experience is not required
Have significant software engineering experience, with a strong background in high-performance, large-scale distributed systems serving millions of users
Have a track record of building automation or test infrastructure that measurably improved release velocity or reliability
Have experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure), with exposure to Kubernetes, Infrastructure as Code, or container orchestration
Thrive in cross-functional collaboration with both internal teams and external partners
Are a fast learner who can quickly ramp up on new technologies, hardware platforms, and provider ecosystems
Are highly autonomous and take ownership of problems end-to-end, including work that falls outside your job description

Strong Candidates May Also Have Experience With:

LLM inference optimization, batching, and caching strategies
Capacity-constrained scheduling or shared-resource test infrastructure
Solid understanding of multi-region deployments, request routing, load balancing, global traffic management
Working with CSP partner teams to scale infrastructure across multiple platforms, navigating differences in networking, security, privacy, and managed service
Proficiency in Python or Rust

The annual compensation range for this role is listed below.

For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$320,000-$485,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Staff + Sr. Software Engineer, Cloud Inference Launch Engineering in San Francisco, CA vacancy

Senior Launch Automation Engineer
Senior Launch Automation Engineer Darwin has partnered with a fast-paced startup in the Bay Area to find a Senior Launch Automation Engineer who will own the algorithms and software that automatically load, fuel, and launch orbital rockets. What you’ll do Develop real-...
Senior
Software
Darwin Recruitment
San Francisco, CA
3 days ago
Staff / Sr. Staff Software Engineer (Frontend)
...Staff / Sr. Staff Software Engineer (Frontend) San Francisco Bay Area, California, United States Tessell is a fast-growing company focused on data... ...Familiarity with containerization (e.g., Docker) and cloud services (AWS, Azure). Prior experience working in a startup...
Senior
Software
Tessell
San Francisco, CA
1 day ago
Sr Staff Engineer, Cloud DevOps
$181.4k - $235.8k
...Sr Staff Engineer, Cloud DevOps Full time Two Folsom, San Francisco, CA, US 94105 About Gap Inc. At Gap Inc., we create culture as much... ...in high performing technology. Develop and enhance software to solve relatively complex situations and analyze data to...
Senior
Software
Minimum wage
Full time
Gap Inc.
San Francisco, CA
1 day ago
Senior Software Engineer, Platform
...team of former Scale AI engineers and operators. In less... ...As a Senior Software Engineer, Platform at... ...architecture. ~ Extensive cloud infrastructure and... ...have Experience launching systems from ground up... ...video data. Scaled up inference and train compute for...
Senior
Software
Work at office
David AI
San Francisco, CA
5 days ago
Sr. Data Analyst
...Job Description Launch Your Data Career with Proof, Not Promises... ...Data Science, Analytics, or Engineering , it's time to stop guessing... ..., visualization, statistical inference Dashboard design, KPI... ...Infrastructure-as-code and cloud data platforms (AWS, Azure)...
Senior
SynergisticIT
San Francisco, CA
5 days ago
Senior ML Inference Systems Engineer
...focused on AI workloads is seeking a Member of Technical Staff to design and optimize inference systems. The role involves managing KV cache... ...various components. Ideal candidates should have strong software engineering skills and experience with ML inference systems,...
Senior
Software
Gimlet Labs
San Francisco, CA
2 days ago
Senior Inference Systems Engineer - Scale Production ML
Acceler8 Talent is looking for a Software Engineer in San Francisco to focus on building and optimizing inference systems for next-generation AI at scale. You will design production inference pipelines and improve system performance under real production constraints. The...
Senior
Software
Acceler8 Talent
San Francisco, CA
2 days ago
Senior Cloud/ML Ops Engineer
$250k - $325k
...Engineering at Ivo Engineers at Ivo are inventors. Ivo was first-to... ...Things break. Regions go down. Cloud and LLM providers have "... ...legal drudgery. People love our software - despite high competition, we... ...How far along are we? We launched in early access in 2023. Since...
Senior
Software
Contract work
Work at office
Remote work
IVO Inc
San Francisco, CA
1 day ago
Launch Automation Engineer - Real-Time Systems & Simulink
A fast-paced aerospace startup in the Bay Area seeks a Senior Launch Automation Engineer to develop systems for orbital rocket launches. You will own algorithms and software for propellant loading and monitor live countdowns in a collaborative team environment. Ideal candidates...
Software
Darwin Recruitment
San Francisco, CA
3 days ago
Distributed Systems Software Engineer - Public Cloud (Mid/Senior/Lead/Principal)
...Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal) Our Public Cloud engineering teams are responsible for innovating and maintaining a large scale distributed systems engineering platform that ships hundreds of features to production for tens...
Senior
Software
Salesforce, Inc..
San Francisco, CA
3 days ago
Sr Manager, AV Behavior Safety Engineering (GPSSC)
$250.6k - $384.6k
...Sr Manager, AV Behavior Safety Engineering (GPSSC) page is loaded## Sr Manager, AV Behavior... ...Effective Autonomous Driving Software (SAFE‐ADS) department is... ...and directly influence launch decisions for GM’s next generation... ...Trees, Clustering* **Cloud & Big Data Platforms: (...
Senior
Software
Odd job
Remote work
Flexible hours
General Motors
San Francisco, CA
4 days ago
Senior Infra & Tools Engineer — Scalable Multi-Cloud
$166k - $225k
A leading data and AI company in San Francisco seeks a Senior Software Engineer to enhance their infrastructure platform. This role requires building multi-cloud systems and scalable solutions for managing data and AI workloads. Ideal candidates have a strong programming...
Senior
Software
Flexible hours
Databricks Inc.
San Francisco, CA
5 days ago
Senior Engineering Leader, Cloud ERP & Financial Systems
B Capital is seeking a Senior Engineering Manager to lead the Application Engineering team in San Francisco... .... This role requires extensive experience in software engineering and engineering management, with a strong focus on cloud ERP systems and integrations. A collaborative...
Senior
Software
B Capital
San Francisco, CA
5 days ago
Senior Software Engineer — Cloud-Native, Scalable Systems
Terra Quantum is looking for a Senior Software Engineer in San Francisco, California, with strong expertise in designing and implementing scalable systems. This role requires passionate individuals well-versed in Golang, Python, Kubernetes, and containerized applications...
Senior
Software
Terra Quantum
San Francisco, CA
2 days ago
Senior QA Engineer for AI-Driven Cloud Apps
$85k - $130k
Hewlett Packard Enterprise is seeking a Quality Assurance Engineer in San Francisco to lead test strategy and quality initiatives across software and hardware teams. You will design comprehensive test strategies, develop automation frameworks, and collaborate across functions...
Senior
Software
Hewlett Packard Enterprise
San Francisco, CA
3 days ago
Staff/Sr. iOS Engineer - AI, Search & Knowledge Platforms
$181.1k - $318.4k
...Staff/Sr. iOS Engineer - AI, Search & Knowledge Platforms Work Locations (2) Submit Resume Do you want to make Apple products smarter... ...with large codebases and practical solutions ~ Knowledge of software patterns that allow for testing ~ Excellent interpersonal...
Senior
Software
Work experience placement
Relocation
Apple
San Francisco, CA
1 day ago
Senior Backend Software Engineer, AI Observability & Evals Platform (LangSmith)
$175k - $225k
...ubiquitous. We build the foundation for agent engineering in the real world, helping developers... ..., and Deep Agents), and the newly launched LangSmith Engine for autonomous agent improvement... ...(Postgres, Redis, Clickhouse), and cloud platforms (AWS, GCP, Azure) ~ Strong...
Senior
Software
Work at office
Flexible hours
LangChain, Inc
San Francisco, CA
2 days ago
Principal Engineer - eCommerce, Digital, & Product Launch
$230.73k - $302.83k
...looking for a passionate Principal Engineer who will join us in building... ...artist community and product launch platforms. This role is... ...with a deep understanding of software architecture principles, including... ...programming languages, cloud infra, databases, caching, containers...
Software
Full time
Temporary work
Work at office
Local area
Worldwide
Minted
San Francisco, CA
7 days ago
Senior Relativity Systems Engineer — ALS & Cloud Infra
...in San Francisco is seeking a Senior Systems Administrator to support network systems and implement Automated Litigation Support software solutions. The ideal candidate will have extensive experience in implementing litigation support applications, along with a strong...
Senior
Software
Contact Government Services, LLC
San Francisco, CA
5 days ago
Senior Software Engineer - Media Pipeline & Cloud Systems
The Walt Disney Company (Germany) GmbH is hiring a Senior Software Engineer in San Francisco to develop next-generation audio tools. This hybrid position requires 5+ years in audio video workflows and proficiency in languages like Go and Python. Responsibilities include...
Senior
Software
The Walt Disney Company (Germany) GmbH
San Francisco, CA
2 days ago
Senior Software Engineer, Cloud Services
$162k - $225k
...Senior Software Engineer, Cloud Services San Francisco, CA Who We Are HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates, and collaborates....
Senior
Software
Full time
Temporary work
Local area
Flexible hours
HP IQ
San Francisco, CA
1 day ago
Sr Builder Solution Architect, AWS Product Marketing Demo Engineering
$176.6k - $239k
...have the technical skills in cloud software development to support... ...AWS Product Marketing Demo Engineering team as a Builder Solution Architect... ...demonstrations for service launches, events, sales enablement,... ...employees, supervisors, and staff; adhere to standards of excellence...
Senior
Software
Local area
Flexible hours
Amazon
San Francisco, CA
2 days ago
Senior Inference Platform Engineer - End-to-End LLM Serving
...infrastructure company in San Francisco is seeking an experienced engineer for its Inference Platform team. This role involves managing end-to-end... ...orchestration. Candidates should have deep experience in software engineering, particularly with Python or Go, and be...
Senior
Software
Fluidstack
San Francisco, CA
5 days ago
Senior Software Engineer (Platform)
$170k - $210k
...Senior Software Engineer At Trunk, our mission is to help teams create high-quality software quickly. We've helped engineering teams at... ...teams to land code faster and develop happier. Our founders launched Trunk in 2021 after designing, delivering, and scaling...
Senior
Software
Temporary work
Work at office
Shift work
TRUNK LTD
San Francisco, CA
1 day ago
Senior Staff Software Engineer, Gov
$207k - $385k
...Team Join the engineering teams that bring... ...We're seeking Software Engineers who can... ...world impact. From launching net-new capabilities... ...optimizing how we serve inference in unique, high-... ...and in the cloud, for our public sector... ...of Technical Staff . We use Senior Staff...
Senior
Software
OpenAI
San Francisco, CA
4 days ago
Sr/Staff Backend Engineers
...Staff/Senior Backend Software Engineer Full-time Hybrid SF Bay Area About Us We are... ...payment processing. After launching in January last year, we processed... ...and develop agentic workflows Cloud Infrastructure : Help maintain and...
Senior
Software
Full time
Work at office
Remote work
hireVouch
San Francisco, CA
3 days ago
Senior Manager, Software Engineering - Cloud Platform
**Job Title:**Senior Manager, Software Engineering - Cloud Platform **Location:** New York, NY; San Francisco, CA**Customer Focus:** Treating internal developers as our primary customers and prioritizing their velocity and user experience.As a Senior Manager and "Player...
Senior
Software
Work experience placement
Shift work
Salesforce, Inc.
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
...not just building software - we’re building a... ...as both a central engineering function and an embedded... ...across a modern cloud-native stack to... ...new services launch, with the authority... ...engineering leads and staff engineers to... ...services (e.g., LLM inference latency, non-determinism...
Senior
Software
Work at office
Immediate start
Worldwide
Monday to Friday
Flexible hours
Careers at Drata
San Francisco, CA
3 days ago
Senior Site Reliability Engineer
$166.9k - $225.9k
...as both a central engineering function and an embedded... ...intersection of software engineering and... ...across a modern cloud‑native stack to help... ...new services launch, with authority to... ...engineering leads and staff engineers to define... ...(e.g., LLM inference latency, non‑determinism...
Senior
Software
Flexible hours
Drata
San Francisco, CA
3 days ago
Senior Platform Engineering
...About This Role At Strava, the Foundation engineering team safeguards the infrastructure... ...Bring to the Team: Proven foundation in software engineering. Comfortable working with various... ...in a containerized microservices cloud environment (e.g. Kubernetes). Experience...
Senior
Software
Work at office
Flexible hours
3 days per week
Strava
San Francisco, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff + Sr. Software Engineer, Cloud Inference Launch Engineering. Be the first to apply!