Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Software Engineer, Node Infra

$320k - $405k

anthropic

Staff Infrastructure Engineer, Node Infra

Anthropic's Infrastructure organization is foundational to our mission of developing AI systems that are reliable, interpretable, and steerable. The systems we build determine how quickly we can train new models, how reliably we can run safety experiments, and how effectively we can scale Claude to millions of users — demonstrating that safe, reliable infrastructure and frontier capabilities can go hand in hand.

Node Infra owns the full lifecycle of accelerator capacity at Anthropic. We ingest and provision compute from all major CSPs and our own datacenters, stand up and scale clusters from thousands to hundreds of thousands of hosts, and build the health, diagnostics and repair automation that keep every GPU, TPU and Trainium node in the fleet usable and ready to power Anthropic's frontier AI research.

Key Responsibilities
  • Own the technical strategy and roadmap for node lifecycle management - ingestion, bring-up, health checking, and automated repair
  • Drive cross-team initiatives to build and scale AI clusters across multiple clouds and accelerator families
  • Design and operate the systems that detect, isolate, and remediate unhealthy hardware automatically, driving up fleet MTBI and minimizing stranded capacity
  • Define infrastructure architecture, ensuring the hardest problems get solved - whether by you directly or by working through others
  • Work closely with cloud providers and internal research/inference/product teams to shape long-term compute, data, and infrastructure strategy
  • Establish and evolve operational excellence practices (incident response, postmortem culture, on-call)
  • Support the growth of engineers around you through technical mentorship and coaching
Minimum Qualifications
  • Deep expertise in distributed systems, reliability, and cloud platforms (e.g., Kubernetes, IaC, AWS/GCP/Azure)
  • Strong proficiency in at least one systems language (e.g., Rust, Go, or Python), IaC proficiency with Terraform.
  • Hands-on experience with machine learning accelerators (GPUs, TPUs, or Trainium)
  • Track record of leading complex, multi-quarter technical initiatives that span multiple teams or systems
  • Ability to build alignment across senior stakeholders and communicate effectively at all levels
Preferred Qualifications
  • 8+ years of software engineering experience, including time as a technical lead setting direction for a team
  • Experience managing large scale compute infrastructure at hyperscale (10K+ nodes), including capacity management and efficiency
  • Depth in one or more of: Kubernetes internals (scheduler, autoscaler, kubelet, Karpenter), cluster orchestration systems (Mesos, Borg-like), or node provisioning pipelines
  • Low-level systems experience: kernel, virtualization, device drivers, firmware, or hardware health/diagnostics daemons
  • Familiarity with high-performance networking (EFA, RDMA, InfiniBand) for distributed ML workloads.
  • Demonstrated ownership of production reliability for high-throughput, latency-sensitive systems
  • Contributions to relevant open-source projects (Kubernetes, Linux kernel, container runtimes, etc.)
  • Skill in quickly understanding systems design tradeoffs and keeping track of rapidly evolving software systems

The annual compensation range for this role is listed below. For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.

Annual Salary:

$320,000 - $405,000 USD

Logistics

Minimum education: Bachelor's degree or an equivalent combination of education, training, and/or experience

Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience

Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings.

How We're Different

We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.

The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.

Come Work With Us!

Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Staff Software Engineer, Node Infra in San Francisco, CA vacancy
  • $208.45k - $364.8k

     ...our recruiting process here. The Cache Infra team manages one of Pinterest's most vital...  ...-edge technologies. Maintain a high engineering standard for cache infrastructure, focusing...  .... 8+ years of hands-on backend software engineering experience in large-scale distributed... 
    Suggested
    Full time
    Work at office
    Local area
    Relocation
    Relocation package

    Pinterest

    San Francisco, CA
    2 days ago
  • A high-growth fintech company in San Francisco seeks a knowledgeable Staff Software Engineer to design and maintain scalable systems. The role involves leading technical initiatives and mentoring engineers while collaborating with cross-functional teams. Candidates should... 
    Suggested

    Jack & Jill/External ATS

    San Francisco, CA
    2 days ago
  • $210k - $275k

    Insider, Inc. is seeking experienced software engineers to join a dynamic team in developing an AI-native operating system for regulated finance. The position involves designing robust infrastructure while ensuring regulatory compliance, and ideal candidates will have... 
    Suggested
    Remote work

    Insider, Inc.

    San Francisco, CA
    4 days ago
  • $209k - $253k

    A leading AI infrastructure company in San Francisco seeks a Staff Software Engineer to design and develop control systems for GPU node management. The candidate will be critical in building foundational cloud infrastructure and achieving business goals. This role requires... 
    Suggested

    Crusoe Energy Systems LLC

    San Francisco, CA
    1 day ago
  • $200k - $275k

     ...company. The Role As one of the foundational members of our Engineering team, you will architect and develop petabyte-scale data...  ...RL systems at scale What You'll Bring ~7+ years of software development experience ~ Strong technical foundation and breadth... 
    Suggested

    Watney Robotics Inc

    San Francisco, CA
    2 days ago
  • $350k

     ...Francisco is looking for a Member of Technical Staff specializing in distributed systems. You...  ...execution across thousands of nodes while ensuring reliability and fault-tolerance. Ideal candidates have strong software engineering fundamentals and experience in production... 

    Acceler8 Talent

    San Francisco, CA
    4 days ago
  • $180k - $300k

     ...Join to apply for the Software Engineer (Infra) role at Numeral . This range is provided by Numeral. Your actual pay will be based on your skills...  ...infrastructure systems in high-growth environments. Proficiency in Node.js, PostgreSQL, Redis, and AWS (or equivalent cloud... 
    Full time
    Immediate start
    Remote work
    Flexible hours

    Numeral

    San Francisco, CA
    23 hours ago
  •  ...to get your help as we're hiring several extremely talented software engineers across the stack. In this role, you will... Build...  ...platforms that power Pylon's AI features - prompt executions, search infra, and more! Improve LLM observability - AI evals (online... 
    Work at office
    Relocation

    Pylon Labs

    San Francisco, CA
    3 days ago
  •  ...about the mission (and each other), we'd love to meet you. About the Role We're looking for our first engineer focused on infrastructure to start and lead Infra at Amperos. You'll get to own dev ops, dev experience, compliance, observability, monitoring for our AWS... 
    Work at office
    Flexible hours

    Amperos Health, Inc

    San Francisco, CA
    2 days ago
  • $170k - $250k

     ...Senior Infra Software Engineer Title of Role: Senior Infra Software Engineer Location: San Francisco, onsite Company Stage of Funding: Seed - Software Development, Devtools, AI Office Type: Onsite Salary: $170K-$250K Company Description We're... 
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    2 days ago
  • $225k - $300k

     ...About the job Staff Software Engineer, Full Stack Staff Software Engineer, Full Stack Hybrid | North America | IC-Only Read This...  ...Engineering managers looking for an IC reset Backend-only or infra-only profiles Fully remote-only candidates Engineers... 
    Work at office
    Remote work
    3 days per week

    Transparent Search Group

    San Francisco, CA
    23 hours ago
  • $150k - $230k

     ...Staff Software Engineer, Forward Deployed fal is the generative media ecosystem powering the next generation of AI products. We build the infrastructure...  ...customer issues end-to-end across frontend, backend, and infra layers Translate customer feedback into clear product... 
    Currently hiring
    Relocation package

    fal

    San Francisco, CA
    5 days ago
  •  ...About Flow Flow Engineering is an AI-native requirements platform...  ...role Flow is hiring a staff frontend software engineer to own core user...  ...with design, product, infra, and AI engineers to make complex...  ...built in TypeScript/Node.js. How we work & values... 
    Flexible hours

    Flow Engineering

    San Francisco, CA
    2 days ago
  • $2,000 per month

     ...world of data with you. About The Role As a Principal/Staff Software Engineer , you will help build out the next generation data platform...  ...to big data / analytics OSS projects or internal data infra products Experience working in digital-native scale-up data... 

    Nextdata

    San Francisco, CA
    23 hours ago
  • $240k - $270k

     ...Staff Software Engineer - EC Lifecycle Redwood City, CA (Hybrid); San Francisco, CA (Hybrid) About Snorkel At Snorkel, we believe meaningful...  ...-functional teams to design and deliver improvements to dev-infra, release processes, and internal tooling. Engage with... 
    Work at office
    Local area
    3 days per week

    Snorkel AI

    San Francisco, CA
    4 days ago
  •  ...spans: Benefits Platform – the core engine powering eligibility, recommendations,...  ...real engagement. Be the connector: Align infra, security, compliance, and product...  ...Deep experience: 10+ years of professional software engineering, with a track record of building... 
    Work at office
    Worldwide
    3 days per week

    MyHealthTeam

    San Francisco, CA
    2 days ago
  •  ...Infrastructure Team Lead Serve as the lead of our infrastructure team Ownership of our entire infra stack Analyze, design, develop, maintain, and improve observability & monitoring stack Push the boundaries of what is possible. Skills: Kubernetes, Helm,... 

    Reflex USA

    San Francisco, CA
    2 days ago
  • $207k - $362.25k

     ...the systems that allow 1,000+ engineers to build and ship products at...  ...workflows, the way we build software is fundamentally changing. Rippling...  ...We are looking for a Senior Staff Engineer to lead the...  ...Work cross-functionally with Infra, Platform, and Product engineering... 
    Work at office
    Local area
    3 days per week

    Rippling

    San Francisco, CA
    3 days ago
  • $200k - $400k

     ...this entire capability end-to-end. About the Role As a Staff Software Engineer focused on Voice Agent, you will lead the architecture and...  ...to integrate the next generation of speech models, with Infra to push the boundaries of latency and performance, and with... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    4 days ago
  •  ...use case runs through the perception team. We're hiring a Staff Software Engineer to own ML Infrastructure at Voxel. Our applied ML team is...  ...You can explain complex tradeoffs clearly to ML researchers, infra peers, and leadership Nice to Have Production experience... 
    Work at office
    Flexible hours

    Voxel Labs

    San Francisco, CA
    23 hours ago
  • $200k - $400k

     ...is highly experimental, frontier-style engineering. The team continuously analyzes real-world...  .... About the Role As a Staff Software Engineer on the Agent Orchestration team...  ...'ll collaborate closely with Research, Infra, and Product teams to ship improvements... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    4 days ago
  • $200k - $400k

     ...ergonomics. We organize around five focus areas: Core Infra: The foundational cloud stack-networking, compute, storage, security...  .... About the Role We're hiring a Senior Infrastructure Engineer to design, build, and operate production infrastructure for... 
    Full time
    Work at office
    Local area

    Decagon

    San Francisco, CA
    2 days ago
  • $281k - $356k

     ...Senior Staff Software Engineer, Model Post Training Waymo is an autonomous driving technology company with the mission to be the world's most...  ...leadership to influence senior engineers and researchers across ML, infra, and data teams. Raise the technical bar for how Waymo... 
    Full time
    Remote work

    Waymo

    San Francisco, CA
    4 days ago
  • $200k - $275k

     ...every morning. You'll work alongside our data scientists and ML engineers to build and operate the infrastructure that makes this...  ...networking Background at an early-stage startup where you owned infra end-to-end This Role Is Not For You If Startup unpredictability... 
    Home office
    Day shift

    Healthleap AI

    San Francisco, CA
    2 days ago
  •  ...looking for a member of the AI technical staff to join the founding team. Someone technically...  ..., etc. Responsibilities: Scale infra for post-training of multimodal LLMs (CPT...  ...web-agent Work closely with product engineers to translate cutting-edge AI capabilities... 
    Work at office
    Relocation
    Visa sponsorship

    Yutori

    San Francisco, CA
    12 days ago
  • $240k

     ...fundamentally change how software is built on the...  ...has assembled a team of engineers who have built and designed...  ...looking for exceptional staff or principal-level engineers...  ...operating large-scale infra, we'd love to talk!...  ...that might disappear if a node restarts, or setting... 
    Full time
    Work at office
    Remote work
    Shift work
    Night shift

    Convex

    San Francisco, CA
    7 hours ago
  •  ...who want to have outsized impact early. Our core engineering team is looking for a builder, a Staff Software Engineer who thrives in early-stage (ambiguous) environments...  ...orchestration systems Know your way around cloud infra Have experience building integrations or data... 

    Saris AI

    San Francisco, CA
    3 days ago
  • $192k - $260k

     ...growing SaaS companies in the world. Our engineering teams build highly technical products...  ...and operate one of the largest scale software platforms. The fleet consists of millions...  ...different clouds and environments. Core Infra: Build the core infrastructure that... 
    Work at office
    Local area
    Worldwide
    Flexible hours

    Databricks

    San Francisco, CA
    4 days ago
  • $240k - $300k

     ...About Sentry Software runs the world and the pace is faster than ever. Sentry helps developers...  ...About the Role This isn't a typical engineering role. You won't be embedded in a single...  ...agents work. This could be CI/CD or dev infra experience, or it could be AI harness... 
    Remote work
    Work from home
    Shift work

    Sentry

    San Francisco, CA
    3 days ago
  •  ...responsible for designing, building and operating the software and solutions that connect all Airbnb users and...  ...Difference You Will Make: As a member of the network infra team, you will be working with talented engineers on cutting edge technologies of cloud native... 
    Casual work
    Live in
    Work at office
    Remote work

    GrabJobs

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Software Engineer, Node Infra. Be the first to apply!