Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff + Sr. Software Engineer, Cloud Inference Launch Engineering

$320k

Anthropic

About Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.

About the Role

The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform, from API integration and intelligent request routing to inference execution, capacity management, and day-to-day operations.

Within Cloud Inference, the model & inference launch team owns the validation pipeline for our inference server and load balancer on these platforms. We're responsible for every inference change - model launches, performance improvements, safeguard integrations - landing on cloud platforms with correctness, performance, and reliability intact.

This is high-leverage infrastructure work: validation has to be fast and cheap enough to run on the same accelerators that serve customers, trustworthy enough to replace manual checks, and consistent enough that a change working on Anthropic first-party means it works everywhere. This directly determines how fast frontier models and features ship to every cloud platform, and how quickly performance wins reach production - reclaiming capacity at a time when compute is our scarcest resource.
What You'll Do
  • Be on the critical path for frontier model launches, bringing up inference for new model architectures and shipping them to cloud platforms in lockstep with our first-party platform
  • Work with the core inference team to bring new inference features (e.g. structured sampling, prompt caching, and more) to cloud platforms, owning the platform-specific integration that gets them to production
  • Identify and dive deep on the gaps that make inference behave differently across first-party and CSPs - config drift, observability, deployment patterns, hard cross-platform bugs - and fix them at the source rather than building platform-specific workarounds
  • Design, build, and own the CI/CD infrastructure for the inference server and load balancer across cloud platforms, with shadow traffic, performance baselines (throughput and latency), and correctness checks that catch regressions before production
  • Drive down merge-to-production cycle time by making validation faster, more parallel, and cost-effective enough to run on the same constrained accelerator pool that serves customers, without trading away reliability
  • Analyze observability data across providers to identify performance bottlenecks, cost anomalies, and regressions, and drive remediation based on real-world production workloads
You May Be a Good Fit If You:
  • Have a strong interest in LLM serving; prior inference or ML experience is not required
  • Have significant software engineering experience, with a strong background in high-performance, large-scale distributed systems serving millions of users
  • Have a track record of building automation or test infrastructure that measurably improved release velocity or reliability
  • Have experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure), with exposure to Kubernetes, Infrastructure as Code, or container orchestration
  • Thrive in cross-functional collaboration with both internal teams and external partners
  • Are a fast learner who can quickly ramp up on new technologies, hardware platforms, and provider ecosystems
  • Are highly autonomous and take ownership of problems end-to-end, including work that falls outside your job description
Strong Candidates May Also Have Experience With:
  • LLM inference optimization, batching, and caching strategies
  • Capacity-constrained scheduling or shared-resource test infrastructure
  • Solid understanding of multi-region deployments, request routing, load balancing, global traffic management
  • Working with CSP partner teams to scale infrastructure across multiple platforms, navigating differences in networking, security, privacy, and managed service
  • Proficiency in Python or Rust

The annual compensation range for this role is listed below.


For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$320,000-$485,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff + Sr. Software Engineer, Cloud Inference Launch Engineering in San Francisco, CA vacancy
  • Senior Launch Automation Engineer Darwin has partnered with a fast-paced startup in the Bay Area to find a Senior Launch Automation Engineer who will own the algorithms and software that automatically load, fuel, and launch orbital rockets. What you’ll do Develop real-... 
    Senior
    Software

    Darwin Recruitment

    San Francisco, CA
    3 days ago
  •  ...Staff / Sr. Staff Software Engineer (Frontend) San Francisco Bay Area, California, United States Tessell is a fast-growing company focused on data...  ...Familiarity with containerization (e.g., Docker) and cloud services (AWS, Azure). Prior experience working in a startup... 
    Senior
    Software

    Tessell

    San Francisco, CA
    1 day ago
  • $181.4k - $235.8k

     ...Sr Staff Engineer, Cloud DevOps Full time Two Folsom, San Francisco, CA, US 94105 About Gap Inc. At Gap Inc., we create culture as much...  ...in high performing technology. Develop and enhance software to solve relatively complex situations and analyze data to... 
    Senior
    Software
    Minimum wage
    Full time

    Gap Inc.

    San Francisco, CA
    1 day ago
  •  ...team of former Scale AI engineers and operators. In less...  ...As a Senior Software Engineer, Platform at...  ...architecture. ~ Extensive cloud infrastructure and...  ...have Experience launching systems from ground up...  ...video data. Scaled up inference and train compute for... 
    Senior
    Software
    Work at office

    David AI

    San Francisco, CA
    5 days ago
  •  ...Job Description Launch Your Data Career with Proof, Not Promises...  ...Data Science, Analytics, or Engineering , it's time to stop guessing...  ..., visualization, statistical inference Dashboard design, KPI...  ...Infrastructure-as-code and cloud data platforms (AWS, Azure)... 
    Senior

    SynergisticIT

    San Francisco, CA
    5 days ago
  •  ...focused on AI workloads is seeking a Member of Technical Staff to design and optimize inference systems. The role involves managing KV cache...  ...various components. Ideal candidates should have strong software engineering skills and experience with ML inference systems,... 
    Senior
    Software

    Gimlet Labs

    San Francisco, CA
    2 days ago
  • Acceler8 Talent is looking for a Software Engineer in San Francisco to focus on building and optimizing inference systems for next-generation AI at scale. You will design production inference pipelines and improve system performance under real production constraints. The... 
    Senior
    Software

    Acceler8 Talent

    San Francisco, CA
    2 days ago
  • $250k - $325k

     ...Engineering at Ivo Engineers at Ivo are inventors. Ivo was first-to...  ...Things break. Regions go down. Cloud and LLM providers have "...  ...legal drudgery. People love our software - despite high competition, we...  ...How far along are we? We launched in early access in 2023. Since... 
    Senior
    Software
    Contract work
    Work at office
    Remote work

    IVO Inc

    San Francisco, CA
    1 day ago
  • A fast-paced aerospace startup in the Bay Area seeks a Senior Launch Automation Engineer to develop systems for orbital rocket launches. You will own algorithms and software for propellant loading and monitor live countdowns in a collaborative team environment. Ideal candidates... 
    Software

    Darwin Recruitment

    San Francisco, CA
    3 days ago
  •  ...Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal) Our Public Cloud engineering teams are responsible for innovating and maintaining a large scale distributed systems engineering platform that ships hundreds of features to production for tens... 
    Senior
    Software

    Salesforce, Inc..

    San Francisco, CA
    3 days ago
  • $250.6k - $384.6k

     ...Sr Manager, AV Behavior Safety Engineering (GPSSC) page is loaded## Sr Manager, AV Behavior...  ...Effective Autonomous Driving Software (SAFE‐ADS) department is...  ...and directly influence launch decisions for GM’s next generation...  ...Trees, Clustering* **Cloud & Big Data Platforms: (... 
    Senior
    Software
    Odd job
    Remote work
    Flexible hours

    General Motors

    San Francisco, CA
    4 days ago
  • $166k - $225k

    A leading data and AI company in San Francisco seeks a Senior Software Engineer to enhance their infrastructure platform. This role requires building multi-cloud systems and scalable solutions for managing data and AI workloads. Ideal candidates have a strong programming... 
    Senior
    Software
    Flexible hours

    Databricks Inc.

    San Francisco, CA
    5 days ago
  • B Capital is seeking a Senior Engineering Manager to lead the Application Engineering team in San Francisco...  .... This role requires extensive experience in software engineering and engineering management, with a strong focus on cloud ERP systems and integrations. A collaborative... 
    Senior
    Software

    B Capital

    San Francisco, CA
    5 days ago
  • Terra Quantum is looking for a Senior Software Engineer in San Francisco, California, with strong expertise in designing and implementing scalable systems. This role requires passionate individuals well-versed in Golang, Python, Kubernetes, and containerized applications... 
    Senior
    Software

    Terra Quantum

    San Francisco, CA
    2 days ago
  • $85k - $130k

    Hewlett Packard Enterprise is seeking a Quality Assurance Engineer in San Francisco to lead test strategy and quality initiatives across software and hardware teams. You will design comprehensive test strategies, develop automation frameworks, and collaborate across functions... 
    Senior
    Software

    Hewlett Packard Enterprise

    San Francisco, CA
    3 days ago
  • $181.1k - $318.4k

     ...Staff/Sr. iOS Engineer - AI, Search & Knowledge Platforms Work Locations (2) Submit Resume Do you want to make Apple products smarter...  ...with large codebases and practical solutions ~ Knowledge of software patterns that allow for testing ~ Excellent interpersonal... 
    Senior
    Software
    Work experience placement
    Relocation

    Apple

    San Francisco, CA
    1 day ago
  • $175k - $225k

     ...ubiquitous. We build the foundation for agent engineering in the real world, helping developers...  ..., and Deep Agents), and the newly launched LangSmith Engine for autonomous agent improvement...  ...(Postgres, Redis, Clickhouse), and cloud platforms (AWS, GCP, Azure) ~ Strong... 
    Senior
    Software
    Work at office
    Flexible hours

    LangChain, Inc

    San Francisco, CA
    2 days ago
  • $230.73k - $302.83k

     ...looking for a passionate Principal Engineer who will join us in building...  ...artist community and product launch platforms. This role is...  ...with a deep understanding of software architecture principles, including...  ...programming languages, cloud infra, databases, caching, containers... 
    Software
    Full time
    Temporary work
    Work at office
    Local area
    Worldwide

    Minted

    San Francisco, CA
    7 days ago
  •  ...in San Francisco is seeking a Senior Systems Administrator to support network systems and implement Automated Litigation Support software solutions. The ideal candidate will have extensive experience in implementing litigation support applications, along with a strong... 
    Senior
    Software

    Contact Government Services, LLC

    San Francisco, CA
    5 days ago
  • The Walt Disney Company (Germany) GmbH is hiring a Senior Software Engineer in San Francisco to develop next-generation audio tools. This hybrid position requires 5+ years in audio video workflows and proficiency in languages like Go and Python. Responsibilities include... 
    Senior
    Software

    The Walt Disney Company (Germany) GmbH

    San Francisco, CA
    2 days ago
  • $162k - $225k

     ...Senior Software Engineer, Cloud Services San Francisco, CA Who We Are HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates, and collaborates.... 
    Senior
    Software
    Full time
    Temporary work
    Local area
    Flexible hours

    HP IQ

    San Francisco, CA
    1 day ago
  • $176.6k - $239k

     ...have the technical skills in cloud software development to support...  ...AWS Product Marketing Demo Engineering team as a Builder Solution Architect...  ...demonstrations for service launches, events, sales enablement,...  ...employees, supervisors, and staff; adhere to standards of excellence... 
    Senior
    Software
    Local area
    Flexible hours

    Amazon

    San Francisco, CA
    2 days ago
  •  ...infrastructure company in San Francisco is seeking an experienced engineer for its Inference Platform team. This role involves managing end-to-end...  ...orchestration. Candidates should have deep experience in software engineering, particularly with Python or Go, and be... 
    Senior
    Software

    Fluidstack

    San Francisco, CA
    5 days ago
  • $170k - $210k

     ...Senior Software Engineer At Trunk, our mission is to help teams create high-quality software quickly. We've helped engineering teams at...  ...teams to land code faster and develop happier. Our founders launched Trunk in 2021 after designing, delivering, and scaling... 
    Senior
    Software
    Temporary work
    Work at office
    Shift work

    TRUNK LTD

    San Francisco, CA
    1 day ago
  • $207k - $385k

     ...Team Join the engineering teams that bring...  ...We're seeking Software Engineers who can...  ...world impact. From launching net-new capabilities...  ...optimizing how we serve inference in unique, high-...  ...and in the cloud, for our public sector...  ...of Technical Staff . We use Senior Staff... 
    Senior
    Software

    OpenAI

    San Francisco, CA
    4 days ago
  •  ...Staff/Senior Backend Software Engineer Full-time Hybrid SF Bay Area About Us We are...  ...payment processing. After launching in January last year, we processed...  ...and develop agentic workflows Cloud Infrastructure : Help maintain and... 
    Senior
    Software
    Full time
    Work at office
    Remote work

    hireVouch

    San Francisco, CA
    3 days ago
  • **Job Title:**Senior Manager, Software Engineering - Cloud Platform **Location:** New York, NY; San Francisco, CA**Customer Focus:** Treating internal developers as our primary customers and prioritizing their velocity and user experience.As a Senior Manager and "Player... 
    Senior
    Software
    Work experience placement
    Shift work

    Salesforce, Inc.

    San Francisco, CA
    4 days ago
  •  ...not just building software - we’re building a...  ...as both a central engineering function and an embedded...  ...across a modern cloud-native stack to...  ...new services launch, with the authority...  ...engineering leads and staff engineers to...  ...services (e.g., LLM inference latency, non-determinism... 
    Senior
    Software
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Careers at Drata

    San Francisco, CA
    3 days ago
  • $166.9k - $225.9k

     ...as both a central engineering function and an embedded...  ...intersection of software engineering and...  ...across a modern cloud‑native stack to help...  ...new services launch, with authority to...  ...engineering leads and staff engineers to define...  ...(e.g., LLM inference latency, non‑determinism... 
    Senior
    Software
    Flexible hours

    Drata

    San Francisco, CA
    3 days ago
  •  ...About This Role At Strava, the Foundation engineering team safeguards the infrastructure...  ...Bring to the Team: Proven foundation in software engineering. Comfortable working with various...  ...in a containerized microservices cloud environment (e.g. Kubernetes). Experience... 
    Senior
    Software
    Work at office
    Flexible hours
    3 days per week

    Strava

    San Francisco, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff + Sr. Software Engineer, Cloud Inference Launch Engineering. Be the first to apply!