Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Staff+ Software Engineer, Kubernetes Platform

$320k - $405k

Menlo Ventures

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role Anthropic runs some of the largest Kubernetes clusters in the industry. We have fleets of hundreds of thousands of nodes across multiple cloud providers and datacenters to train, research, and serve frontier AI models. The Kubernetes Platform team owns the Kubernetes control plane that makes those clusters work. We are operating at a scale where the defaults stop working. We own the scheduler and extend it to place topology-sensitive ML workloads across thousands of accelerators at once. We scale the control plane itself - apiserver, etcd, controllers - so it stays responsive as object counts and node counts grow by orders of magnitude. And we build the core cluster services every workload depends on, like service discovery, so they hold up under the same pressure. We make sure the control plane is fast, correct, and always available. Your work will directly determine whether Anthropic can keep reliably and safely training frontier models as our compute footprint continues to grow. Key responsibilities Own, operate, and extend the Kubernetes scheduler for Anthropic's accelerator fleets, including custom scheduling plugins and policies for gang scheduling, topology awareness, and preemption Scale the Kubernetes control plane (apiserver, etcd, controller-manager) to support clusters far beyond typical limits, and find the next bottleneck before it finds us Design, build, and operate core cluster services such as service discovery that every workload in the fleet depends on Build and maintain custom controllers, operators, and CRDs Partner with research, training, and inference to understand workload shapes and turn their requirements into platform capabilities Collaborate with cloud providers on required features and escalations Participate in on-call, lead incident response, and design processes (postmortems, runbooks, SLOs) that help the team avoid repeating failures Minimum qualifications Significant software engineering experience building and operating production distributed systems Proficiency in at least one systems-appropriate language (e.g., Go, Python, Rust, or C++) Deep, hands-on Kubernetes experience (well beyond "user of") into scheduler, controllers, apiserver, or operating large multi-tenant clusters Demonstrated ability to debug complex issues across the stack, from API behavior down to node and network-level root causes A track record of designing for reliability, correctness, and clear failure semantics in systems other engineers depend on Strong written and verbal communication; comfort building consensus with internal stakeholders Preferred qualifications Experience with Kubernetes internals or contributions: kube-scheduler / scheduling framework, apiserver, etcd, client-go, controller-runtime, or similar Experience building or operating cluster schedulers or batch systems (e.g., Kueue, Volcano, Slurm, or in-house equivalents) Background scaling control planes or coordination systems (etcd, ZooKeeper, Consul, or large DNS/service-mesh deployments) Familiarity with ML infrastructure: GPUs, TPUs, or Trainium; gang scheduling; topology-aware placement; collective networking such as NCCL Experience with GCP and/or AWS, including GKE/EKS internals and Infrastructure as Code Low-level systems experience such as Linux kernel tuning, cgroups, or eBPF 8+ years of relevant industry experience, including time leading large, ambiguous infrastructure projects Annual Salary: $320,000 - $405,000 USD #J-18808-Ljbffr Menlo Ventures

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior Staff+ Software Engineer, Kubernetes Platform in San Francisco, CA vacancy
  •  ...About the job Senior/Staff Software Engineer, Developer Platform Job Description: About the Team The Infrastructure team builds and operates...  ...and modern cloud infrastructure (e.g., Docker/Kubernetes and related tooling). ~ A track record of improving... 
    Senior
    Local area

    Dilectus Workforce Solutions

    San Francisco, CA
    2 days ago
  •  ...: We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering...  ...Influence and scale the adoption of platforms, tools and best practices across the...  ...), using containers (e.g., Docker, Kubernetes), cloud-native technologies, service... 
    Senior
    Remote work

    Israelvcforum

    San Francisco, CA
    4 days ago
  • $279.2k - $390.9k

     ...The ML Indexing & Retrieval Platform team at Reddit is responsible...  ...& Key‑Value Databases Tools: Kubernetes, Docker, AWS, GCP What You’ll...  ...generation ML Indexing & Retrieval engine, integrating capabilities...  ...10+ years of experience in software engineering, specializing in... 
    Senior
    For contractors
    Work experience placement
    Remote work
    Flexible hours

    Tensec

    San Francisco, CA
    3 days ago
  • $232k - $313k

     ...data and AI infrastructure platform, so our customers can focus...  ...companies in the world. Our engineering teams build highly technical...  ...actors. We are looking for senior leaders such as yourselves to...  ...the following—Cryptography, Kubernetes Security, Web Security, Governance... 
    Senior
    Work at office
    Local area
    Worldwide
    Flexible hours

    I did my part and supported the Regular Toilet

    San Francisco, CA
    4 days ago
  •  ...Senior Staff Backend Platform Engineer Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive...  ...systems in cloud-native environments such as AWS and Kubernetes-based infrastructure Proficiency in one or more... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours
    2 days per week

    Patreon

    San Francisco, CA
    1 day ago
  •  ...Persona seeks a Software Engineer to enhance its internal platform tooling and APIs. You will focus on improving developer experience and platform observability, while owning the Kubernetes platform that underpins every service. The role requires strong software engineering... 
    Senior

    Persona

    San Francisco, CA
    4 days ago
  •  ...generation drug acquisition platform driven by cutting‑edge...  ...colleagues. Job Summary As a Staff Software Engineer, you will play a key role in...  ...and code reviews Work with senior leadership to turn...  ...AWS platform, Docker, and Kubernetes a plus Positivity; non‑dogmatic... 
    Senior
    Temporary work
    Work at office
    Remote work
    Flexible hours

    SmithRx

    San Francisco, CA
    5 days ago
  •  ...Slash Financial is on the lookout for a Senior Infrastructure/Platform Engineer to enhance and scale its banking platform infrastructure. Based in...  ...experience in high-availability systems, proficient in AWS and Kubernetes, and have a solid programming background. The position... 
    Senior

    Slash Financial

    San Francisco, CA
    4 days ago
  •  ...provisioning processes with tools like Chef or Ansible. A successful candidate will have proficiency in DevOps environments, experience with Kubernetes, scripting, and CI/CD tools. This position supports various events including webinars and conferences. #J-18808-Ljbffr... 
    Senior
    Remote work

    Cerebras

    San Francisco, CA
    1 day ago
  • $320k - $405k

    Menlo Ventures is seeking a Kubernetes Engineer to join Anthropic. This role includes owning and operating the Kubernetes scheduler for their...  ...successful candidate will have significant experience in software engineering and deep Kubernetes knowledge. Ideal qualifications... 
    Senior

    Menlo Ventures

    San Francisco, CA
    5 days ago
  •  ...implementing infrastructure as code, and automating cloud environments. The ideal candidate will be skilled in debugging, managing Kubernetes clusters, and have strong communication skills. This hybrid position requires working on-site three days a week, focusing on the... 
    Senior
    3 days per week

    Skyfire Systems Inc.

    San Francisco, CA
    4 days ago
  • $220k - $250k

     ...Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated...  ...optimization will be instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry... 
    Temporary work

    Crusoe

    San Francisco, CA
    18 hours ago
  • $190k - $215k

    Gridware Technologies Inc. in San Francisco is looking for a DevOps Engineer to manage AWS infrastructure, scale monitoring solutions, and...  ...candidate has over 5 years of experience, particularly with Kubernetes and CI/CD pipelines. The compensation ranges from $190,000 to... 
    Senior

    Gridware Technologies Inc.

    San Francisco, CA
    5 days ago
  • $225k - $265k

    53 Stations is seeking an experienced Senior Software Engineer to join our team in San Francisco or remote. In this role, you will work closely with Data Scientists and lead engineering efforts for our AI-powered product. Candidates should possess over 12 years of experience... 
    Senior
    Remote work
    Flexible hours

    53 Stations

    San Francisco, CA
    3 days ago
  • $102.5k - $187.9k

     ...technology. Key Responsibilities As a Kubernetes DevOps Engineer, you are responsible for designing,...  ...containerized applications and orchestration platforms (Kubernetes) to ensure scalable,...  .... Comprehensive understanding of the software development lifecycle (SDLC),... 
    Senior
    Summer holiday
    Work at office
    Flexible hours

    Ernst & Young Oman

    San Francisco, CA
    2 days ago
  •  ...Senior/Staff Software Engineer (Platform) at Tenkara builds the software layer that enables autonomous manufacturing operations. You’ll design scalable backend services, data pipelines, and integrations with ERP and freight systems to move physical goods reliably. This... 
    Senior

    Getclera

    San Francisco, CA
    1 day ago
  • $148.5k - $260.1k

     ...is for an experienced DevOps engineer to join a team of highly skilled...  ...capacity and working on Kubernetes deployments at scale. You will...  ...release support of the Salesforce platform and tools used by a variety...  ...‑quality, production‑grade software using modern engineering practices... 
    Senior

    Centaur Labs

    San Francisco, CA
    3 days ago
  • $102.5k - $187.9k

     ...Your key responsibilities As a Kubernetes DevOps Engineer, you are responsible to design, deploy...  ...containerized applications and orchestration platforms (Kubernetes) to ensure scalable,...  ...Comprehensive understanding of the software development lifecycle (SDLC), including... 
    Senior
    Summer holiday
    Work at office
    Flexible hours

    EY

    San Francisco, CA
    18 hours ago
  • $2,000 per month

     ...reality. Nextdata OS is a data-mesh-native platform built to meet the challenge of...  ...About The Role As a Principal/Staff Software Engineer , you will help build out the next generation...  ...- A working understanding of Kubernetes and at least one of the public clouds... 
    Senior

    Nextdata

    San Francisco, CA
    3 days ago
  • $237.6k - $288k

     ...Senior Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated...  ...team to design, build, and scale Crusoe Cloud's billing platform. This role sits within the Cloud Customer Experience (CCX... 
    Senior
    Temporary work
    Flexible hours

    Crusoe

    San Francisco, CA
    4 days ago
  • $320k

     ...committed researchers, engineers, policy experts, and...  ...running in multiple cloud platforms. Key responsibilities...  ...Significant software engineering experience...  ...strategies Experience with Kubernetes and cloud infrastructure...  ...Currently, we expect all staff to be in one of our... 
    Senior
    Worldwide
    Visa sponsorship

    United States Digital Space LLC

    San Francisco, CA
    1 day ago
  • $207k - $385k

     ...Team Join the engineering teams that bring OpenAI...  ...We're seeking Software Engineers who can solve...  ...scalable, reliable platform. You'll also partner...  ...infrastructure primitives like Kubernetes and Terraform ~...  ...Member of Technical Staff . We use Senior Staff externally to... 
    Senior

    OpenAI

    San Francisco, CA
    4 days ago
  • $159k - $268k

     ...development. Our closed-loop simulation engine built with the latest in generative AI technologies...  ...self-driving systems. The Simulation Platform team is responsible for delivering a...  ...- Design and implement orchestration software between simulation subcomponents... 
    Senior
    Full time
    Work at office
    Work from home
    Flexible hours

    Waabi

    San Francisco, CA
    2 days ago
  • $405k

     ...Senior Staff Software Engineer, API San Francisco, CA | New York City, NY About Anthropic Anthropic...  ...to join the Claude Developer Platform team and serve as the senior-most individual...  ...Infrastructure: GCP, AWS, Azure, Kubernetes Databases: PostgreSQL (AlloyDB),... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  •  ...Abridge Engineering Role Abridge's platform is scaling fast alongside our expanding customer base and product...  ...that power how we develop and ship software. As an early member of this team,...  ...with containerized environments, Kubernetes, and modern infrastructure tooling... 
    Senior
    Hourly pay
    Full time
    Work at office
    Local area
    Relocation
    Flexible hours
    3 days per week

    Abridge

    San Francisco, CA
    3 days ago
  • A leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their infrastructure. Responsibilities...  ...in SRE or related fields, particularly with GCP and Kubernetes, and a proven record in managing high-traffic... 
    Senior

    Speak

    San Francisco, CA
    5 days ago
  • $237.6k - $288k

     ...Senior Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence...  ...in advancing our managed Kubernetes and AI training clusters, ensuring they...  ...preferred), with the ability to influence platform-level decisions You have a strong... 
    Senior
    Temporary work

    G2 Venture Partners

    San Francisco, CA
    3 days ago
  • A leading finance tech company in San Francisco is seeking an experienced software engineer to drive architectural discussions and shape the technical strategy for their software products. The role requires 8+ years of experience and a Bachelor's degree in Computer Science... 
    Senior

    Rillet

    San Francisco, CA
    4 days ago
  • $215k - $235k

     ...days for team or company events. Engineering at Ironclad As a Staff Software Engineer at Ironclad, you’ll partner...  ...solutions. You’ll help shape the platform that empowers legal teams and business...  ...ReactJS, Node.js, PostgreSQL, and Kubernetes on Google Cloud Platform—no need... 
    Senior
    Full time
    Work at office

    Ironclad Inc

    San Francisco, CA
    3 days ago
  •  ...Xcede is seeking an experienced Infrastructure Security Engineer to design and implement security in a rapidly scaling multi-cloud environment. You'll be essential in securing Kubernetes ecosystems while defining the long-term security roadmap for advanced AI systems.... 
    Senior

    Xcede

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Staff+ Software Engineer, Kubernetes Platform. Be the first to apply!