Staff + Sr. Software Engineer, Cloud Inference Launch Engineering
$320kAnthropic
About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the Role The Cloud Inference team scales and optimizes Claude to serve the massive audiences of developers and enterprise companies across AWS, GCP, Azure, and future cloud service providers (CSPs). We own the end-to-end product of Claude on each cloud platform, from API integration and intelligent request routing to inference execution, capacity management, and day-to-day operations. Within Cloud Inference, the model & inference launch team owns the validation pipeline for our inference server and load balancer on these platforms. We're responsible for every inference change - model launches, performance improvements, safeguard integrations - landing on cloud platforms with correctness, performance, and reliability intact. This is high-leverage infrastructure work: validation has to be fast and cheap enough to run on the same accelerators that serve customers, trustworthy enough to replace manual checks, and consistent enough that a change working on Anthropic first-party means it works everywhere. This directly determines how fast frontier models and features ship to every cloud platform, and how quickly performance wins reach production - reclaiming capacity at a time when compute is our scarcest resource.
What You'll Do
For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $320,000-$485,000 USD Logistics Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
What You'll Do
- Be on the critical path for frontier model launches, bringing up inference for new model architectures and shipping them to cloud platforms in lockstep with our first-party platform
- Work with the core inference team to bring new inference features (e.g. structured sampling, prompt caching, and more) to cloud platforms, owning the platform-specific integration that gets them to production
- Identify and dive deep on the gaps that make inference behave differently across first-party and CSPs - config drift, observability, deployment patterns, hard cross-platform bugs - and fix them at the source rather than building platform-specific workarounds
- Design, build, and own the CI/CD infrastructure for the inference server and load balancer across cloud platforms, with shadow traffic, performance baselines (throughput and latency), and correctness checks that catch regressions before production
- Drive down merge-to-production cycle time by making validation faster, more parallel, and cost-effective enough to run on the same constrained accelerator pool that serves customers, without trading away reliability
- Analyze observability data across providers to identify performance bottlenecks, cost anomalies, and regressions, and drive remediation based on real-world production workloads
- Have a strong interest in LLM serving; prior inference or ML experience is not required
- Have significant software engineering experience, with a strong background in high-performance, large-scale distributed systems serving millions of users
- Have a track record of building automation or test infrastructure that measurably improved release velocity or reliability
- Have experience building or operating services on at least one major cloud platform (AWS, GCP, or Azure), with exposure to Kubernetes, Infrastructure as Code, or container orchestration
- Thrive in cross-functional collaboration with both internal teams and external partners
- Are a fast learner who can quickly ramp up on new technologies, hardware platforms, and provider ecosystems
- Are highly autonomous and take ownership of problems end-to-end, including work that falls outside your job description
- LLM inference optimization, batching, and caching strategies
- Capacity-constrained scheduling or shared-resource test infrastructure
- Solid understanding of multi-region deployments, request routing, load balancing, global traffic management
- Working with CSP partner teams to scale infrastructure across multiple platforms, navigating differences in networking, security, privacy, and managed service
- Proficiency in Python or Rust
For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $320,000-$485,000 USD Logistics Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.
How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact - advancing our long-term goals of steerable, trustworthy AI - rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.
Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Staff + Sr. Software Engineer, Cloud Inference Launch Engineering in San Francisco, CA vacancy
- Senior Launch Automation Engineer Darwin has partnered with a fast-paced startup in the Bay Area to find a Senior Launch Automation Engineer who will own the algorithms and software that automatically load, fuel, and launch orbital rockets. What you’ll do Develop real-...SeniorSoftware
- ...Staff / Sr. Staff Software Engineer (Frontend) San Francisco Bay Area, California, United States Tessell is a fast-growing company focused on data... ...Familiarity with containerization (e.g., Docker) and cloud services (AWS, Azure). Prior experience working in a startup...SeniorSoftware
$181.4k - $235.8k
...Sr Staff Engineer, Cloud DevOps Full time Two Folsom, San Francisco, CA, US 94105 About Gap Inc. At Gap Inc., we create culture as much... ...in high performing technology. Develop and enhance software to solve relatively complex situations and analyze data to...SeniorSoftwareMinimum wageFull time- ...team of former Scale AI engineers and operators. In less... ...As a Senior Software Engineer, Platform at... ...architecture. ~ Extensive cloud infrastructure and... ...have Experience launching systems from ground up... ...video data. Scaled up inference and train compute for...SeniorSoftwareWork at office
- ...Job Description Launch Your Data Career with Proof, Not Promises... ...Data Science, Analytics, or Engineering , it's time to stop guessing... ..., visualization, statistical inference Dashboard design, KPI... ...Infrastructure-as-code and cloud data platforms (AWS, Azure)...Senior
- ...focused on AI workloads is seeking a Member of Technical Staff to design and optimize inference systems. The role involves managing KV cache... ...various components. Ideal candidates should have strong software engineering skills and experience with ML inference systems,...SeniorSoftware
- Acceler8 Talent is looking for a Software Engineer in San Francisco to focus on building and optimizing inference systems for next-generation AI at scale. You will design production inference pipelines and improve system performance under real production constraints. The...SeniorSoftware
$250k - $325k
...Engineering at Ivo Engineers at Ivo are inventors. Ivo was first-to... ...Things break. Regions go down. Cloud and LLM providers have "... ...legal drudgery. People love our software - despite high competition, we... ...How far along are we? We launched in early access in 2023. Since...SeniorSoftwareContract workWork at officeRemote work- A fast-paced aerospace startup in the Bay Area seeks a Senior Launch Automation Engineer to develop systems for orbital rocket launches. You will own algorithms and software for propellant loading and monitor live countdowns in a collaborative team environment. Ideal candidates...Software
- ...Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal) Our Public Cloud engineering teams are responsible for innovating and maintaining a large scale distributed systems engineering platform that ships hundreds of features to production for tens...SeniorSoftware
$250.6k - $384.6k
...Sr Manager, AV Behavior Safety Engineering (GPSSC) page is loaded## Sr Manager, AV Behavior... ...Effective Autonomous Driving Software (SAFE‐ADS) department is... ...and directly influence launch decisions for GM’s next generation... ...Trees, Clustering* **Cloud & Big Data Platforms: (...SeniorSoftwareOdd jobRemote workFlexible hours$166k - $225k
A leading data and AI company in San Francisco seeks a Senior Software Engineer to enhance their infrastructure platform. This role requires building multi-cloud systems and scalable solutions for managing data and AI workloads. Ideal candidates have a strong programming...SeniorSoftwareFlexible hours- B Capital is seeking a Senior Engineering Manager to lead the Application Engineering team in San Francisco... .... This role requires extensive experience in software engineering and engineering management, with a strong focus on cloud ERP systems and integrations. A collaborative...SeniorSoftware
- Terra Quantum is looking for a Senior Software Engineer in San Francisco, California, with strong expertise in designing and implementing scalable systems. This role requires passionate individuals well-versed in Golang, Python, Kubernetes, and containerized applications...SeniorSoftware
$85k - $130k
Hewlett Packard Enterprise is seeking a Quality Assurance Engineer in San Francisco to lead test strategy and quality initiatives across software and hardware teams. You will design comprehensive test strategies, develop automation frameworks, and collaborate across functions...SeniorSoftware$181.1k - $318.4k
...Staff/Sr. iOS Engineer - AI, Search & Knowledge Platforms Work Locations (2) Submit Resume Do you want to make Apple products smarter... ...with large codebases and practical solutions ~ Knowledge of software patterns that allow for testing ~ Excellent interpersonal...SeniorSoftwareWork experience placementRelocation$175k - $225k
...ubiquitous. We build the foundation for agent engineering in the real world, helping developers... ..., and Deep Agents), and the newly launched LangSmith Engine for autonomous agent improvement... ...(Postgres, Redis, Clickhouse), and cloud platforms (AWS, GCP, Azure) ~ Strong...SeniorSoftwareWork at officeFlexible hours$230.73k - $302.83k
...looking for a passionate Principal Engineer who will join us in building... ...artist community and product launch platforms. This role is... ...with a deep understanding of software architecture principles, including... ...programming languages, cloud infra, databases, caching, containers...SoftwareFull timeTemporary workWork at officeLocal areaWorldwide- ...in San Francisco is seeking a Senior Systems Administrator to support network systems and implement Automated Litigation Support software solutions. The ideal candidate will have extensive experience in implementing litigation support applications, along with a strong...SeniorSoftware
- The Walt Disney Company (Germany) GmbH is hiring a Senior Software Engineer in San Francisco to develop next-generation audio tools. This hybrid position requires 5+ years in audio video workflows and proficiency in languages like Go and Python. Responsibilities include...SeniorSoftware
$162k - $225k
...Senior Software Engineer, Cloud Services San Francisco, CA Who We Are HP IQ is HP's new AI innovation lab. Combining startup agility with HP's global scale, we're building intelligent technologies that redefine how the world works, creates, and collaborates....SeniorSoftwareFull timeTemporary workLocal areaFlexible hours$176.6k - $239k
...have the technical skills in cloud software development to support... ...AWS Product Marketing Demo Engineering team as a Builder Solution Architect... ...demonstrations for service launches, events, sales enablement,... ...employees, supervisors, and staff; adhere to standards of excellence...SeniorSoftwareLocal areaFlexible hours- ...infrastructure company in San Francisco is seeking an experienced engineer for its Inference Platform team. This role involves managing end-to-end... ...orchestration. Candidates should have deep experience in software engineering, particularly with Python or Go, and be...SeniorSoftware
$170k - $210k
...Senior Software Engineer At Trunk, our mission is to help teams create high-quality software quickly. We've helped engineering teams at... ...teams to land code faster and develop happier. Our founders launched Trunk in 2021 after designing, delivering, and scaling...SeniorSoftwareTemporary workWork at officeShift work$207k - $385k
...Team Join the engineering teams that bring... ...We're seeking Software Engineers who can... ...world impact. From launching net-new capabilities... ...optimizing how we serve inference in unique, high-... ...and in the cloud, for our public sector... ...of Technical Staff . We use Senior Staff...SeniorSoftware- ...Staff/Senior Backend Software Engineer Full-time Hybrid SF Bay Area About Us We are... ...payment processing. After launching in January last year, we processed... ...and develop agentic workflows Cloud Infrastructure : Help maintain and...SeniorSoftwareFull timeWork at officeRemote work
- **Job Title:**Senior Manager, Software Engineering - Cloud Platform **Location:** New York, NY; San Francisco, CA**Customer Focus:** Treating internal developers as our primary customers and prioritizing their velocity and user experience.As a Senior Manager and "Player...SeniorSoftwareWork experience placementShift work
- ...not just building software - we’re building a... ...as both a central engineering function and an embedded... ...across a modern cloud-native stack to... ...new services launch, with the authority... ...engineering leads and staff engineers to... ...services (e.g., LLM inference latency, non-determinism...SeniorSoftwareWork at officeImmediate startWorldwideMonday to FridayFlexible hours
$166.9k - $225.9k
...as both a central engineering function and an embedded... ...intersection of software engineering and... ...across a modern cloud‑native stack to help... ...new services launch, with authority to... ...engineering leads and staff engineers to define... ...(e.g., LLM inference latency, non‑determinism...SeniorSoftwareFlexible hours- ...About This Role At Strava, the Foundation engineering team safeguards the infrastructure... ...Bring to the Team: Proven foundation in software engineering. Comfortable working with various... ...in a containerized microservices cloud environment (e.g. Kubernetes). Experience...SeniorSoftwareWork at officeFlexible hours3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Staff + Sr. Software Engineer, Cloud Inference Launch Engineering. Be the first to apply!
Related searches
- graduate software developer San Francisco, CA
- rust software engineer San Francisco, CA
- senior software design engineer San Francisco, CA
- software engineer student San Francisco, CA
- software engineer amazon San Francisco, CA
- software developer positions San Francisco, CA
- software engineer full time San Francisco, CA
- software qa engineer San Francisco, CA
- new graduate software engineer San Francisco, CA
- junior software developer San Francisco, CA


