Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Director, Site Reliability Engineering

Clover Health

Senior Manager, Site Reliability Engineering Remote - USA At Counterpart Health, we are transforming healthcare and improving patient care with our innovative primary care tool, Counterpart Assistant. By supporting Primary Care Physicians (PCPs), we deliver improved outcomes at lower cost through early diagnosis and longitudinal care management of chronic conditions. We're looking for a Senior Manager of Site Reliability Engineering to join our team. You'll lead a team of ~10 SREs across North America, UK, HK, and New Zealand — owning both the day-to-day operations and the long-term technical direction of the SRE organization. This role sits at the intersection of people leadership, technical depth, and strategic partnership: you're here to make Counterpart’s infrastructure reliable, scalable, and cost‑efficient — and to transform the SRE team's engagement model from reactive support to proactive collaboration with our product engineering pillars. Responsibilities Lead and grow our SRE team of ~10 engineers, including hiring, retention, career development, and performance management across multiple time zones (US, HK, NZ). Build strategic partnerships with product engineering pillars — shifting SRE from reactive, ticket‑based support to proactive co‑ownership of reliability outcomes. Scale our multi‑tenant infrastructure to support new customer onboarding and growing patient populations. Own cloud cost management and FinOps practices, building frameworks that balance cost control with reliability and performance. Champion developer self‑service and platform engineering. Build self‑service capabilities so product teams can manage routine operations without filing SRE tickets. Establish SLOs/SLIs for critical services and improve alert quality so every page is meaningful. Ensure the SRE team is fully leveraging AI tooling in their workflows — using tools like Claude Code for IaC generation, log analysis, root cause investigation, and automating repetitive work — at the same level as the rest of engineering. Qualifications You have 6+ years managing an SRE team and 10+ years of hands‑on SRE or infrastructure engineering experience. You're deeply comfortable with our core stack: Kubernetes, GCP (GKE, Cloud SQL, Pub/Sub, GCS), Terraform, Helm, ArgoCD, PostgreSQL, and Prometheus/Grafana. You have strong programming skills in Python and/or Go, and you're comfortable writing and reviewing infrastructure tooling code — including using AI coding tools to do so. You have experience with CI/CD pipelines (GitHub Actions) and a track record of building or improving developer tooling and automation. You have sound build vs. buy judgment — you default to the right answer, not the easiest one, and you're comfortable building internal tooling when existing solutions don't fit. You have experience leading teams across multiple time zones and a track record of developing engineers into strong technical contributors. Benefits Overview Financial Well‑Being : Our commitment to attracting and retaining top talent begins with a competitive base salary and equity opportunities. Additionally, we offer a performance‑based bonus program, 401k matching, and regular compensation reviews to recognize and reward exceptional contributions. Physical Well‑Being : We prioritize the health and well‑being of our employees and their families by providing comprehensive medical, dental, and vision coverage. Your health matters to us, and we invest in ensuring you have access to quality healthcare. Mental Well‑Being : We understand the importance of mental health in fostering productivity and maintaining work‑life balance. To support this, we offer initiatives such as No‑Meeting Fridays, monthly company holidays, access to mental‑health resources, and a generous flexible time‑off policy. Additionally, we embrace a remote‑first culture that supports collaboration and flexibility, allowing our team members to thrive from any location. Professional Development : Developing internal talent is a priority for Clover. We offer learning programs, mentorship, professional development funding, and regular performance feedback and reviews. Employee Stock Purchase Plan (ESPP) offering discounted equity opportunities Reimbursement for office setup expenses Monthly cell phone & internet stipend Remote‑first culture, enabling collaboration with global teams Paid parental leave for all new parents And much more! About Counterpart Health: In 2018, Clover Health set out to do something unprecedented: build a clinically intuitive, AI‑enabled solution that fits within physicians' workflows to help support the earlier diagnosis and management of chronic conditions. Years later, that vision is a reality, with thousands of practitioners using Counterpart Assistant during patient visits to improve disease management, reduce medical expenses, and drive success in value‑based care. With an exceptional team of value‑based care and technology experts, Counterpart Health is driving value‑based care at the speed of software. Counterpart Health is a subsidiary of Clover Health. From Clover’s inception, Diversity & Inclusion have always been key to our success. We are an Equal Opportunity Employer and our employees are people with different strengths, experiences, perspectives, opinions, and backgrounds, who share a passion for improving people's lives. Diversity not only includes race and gender identity, but also age, disability status, veteran status, sexual orientation, religion and many other parts of one’s identity. All of our employee’s points of view are key to our success, and inclusion is everyone's responsibility. #LI-REMOTE #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Director, Site Reliability Engineering in New York, NY vacancy
  • $126k - $248k

     ...As a Senior TPM for SRE, you will partner with SRE leaders and engineers to scale the platform that underpins all of MongoDB’s cloud...  ...products. You will drive program execution, strengthen production reliability practices, and coordinate cross-functional efforts across US... 
    Suggested
    Work at office
    Local area

    INSIDER

    New York, NY
    1 day ago
  •  ...Applications Deployment Responsible for reliability and support of Container Platform on-...  ...Perform blameless RCA, partner with engineering and operation teams across the...  ...Additional Skills : Automation Process Engineer,Site Reliability Engineer,Full Stack DeveloperThis... 
    Suggested

    Kaav Inc.

    New York, NY
    3 days ago
  • $175k - $225k

     ...Site Reliability Engineer Chicago, IL or New York, NY Old Mission is a global proprietary trading firm that leverages state-of-the-art technology and research to identify and execute profitable trading strategies across multiple asset classes around the world. Our... 
    Suggested
    Full time
    Work at office
    Remote work
    Monday to Friday
    Flexible hours
    Rotating shift

    Old Mission Capital

    New York, NY
    2 days ago
  • $100k - $250k

     ...financial markets. Role Roadmap As a member of Kalshi's engineering team, you'll help build the next-generation financial...  ..., and evolve. What You'll Do Improve observability, reliability, and service availability by defining and measuring key metrics... 
    Suggested
    Local area

    Kalshi

    New York, NY
    4 days ago
  •  ...Site Reliability Engineer II Our engineering fleet is a horizontal set of teams providing engineering services across the organization. Our specific team provides reliability engineering and operational support to backend service development teams. Technology is... 
    Suggested

    Disney France

    New York, NY
    1 day ago
  • $176.75k - $209.1k

     ...development and learning. It allows us to scale easily, enabling our engineers to maximize attention on new features and capabilities. A...  ...all over the world. Peloton is looking for a Site Reliability Engineer with an operations focus to work with teams across... 
    Temporary work
    Local area

    Peloton

    New York, NY
    1 day ago
  •  ...Site Reliability Engineer (SRE) We are seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, performance, and availability of mission-critical applications and infrastructure. The ideal candidate will combine software engineering... 
    Full time
    Remote work

    Ova Technologies

    New York, NY
    1 day ago
  • $150k - $175k

     ...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed... 
    Remote work

    ASAPP

    New York, NY
    14 hours ago
  •  ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient... 

    TechChain Talent

    New York, NY
    14 hours ago
  •  ...self-healing, deployment/rollback automation). Establish reliability standards: SLOs/SLIs, error budgets, production readiness reviews...  ..., and release risk controls. Performance and reliability engineering: capacity planning, load/performance analysis, resilience... 

    Bahwan CyberTek

    New York, NY
    2 days ago
  • $182.3k - $220k

     ...healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the...  ...hardening infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across teams... 
    Local area
    Flexible hours

    Ro

    New York, NY
    2 days ago
  • $160k - $250k

     ...to transform how enterprises manage and engage with their IT ecosystems. About the Role We're looking for a Senior Site Reliability Engineer (SRE) to own the reliability, performance, and scalability of our AI-native platform. You'll operate at the intersection... 
    Work at office
    Local area

    Standard Template Labs

    New York, NY
    4 days ago
  •  ...Site Reliability Engineer Visa: USC,GC only Rate: DOE Position is remote to start, then after conversion to W2, moves into one of three offices: Nashville, Los Angeles or New York Job Description: Strong problem solving/triage skills Strong cloud/infrastructure... 
    Remote work

    ShiftCode Analytics

    New York, NY
    1 day ago
  •  ...DevOps Engineer DevOps teams in our Infrastructure Engineering group enable Company to continually disrupt the Insure tech space. Our teams build, maintain and deliver infrastructure that enables Company Life product teams to ship industry leading and innovative systems... 

    MRINetwork

    New York, NY
    4 days ago
  •  ...Senior Site Reliability Engineer (SRE) Our client is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies.... 
    Local area

    E-Solutions

    New York, NY
    14 hours ago
  • $165k - $225k

     ...operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog...  ...taking a look here. How you'll make an impact: As a Site Reliability Engineer (SRE) with advanced expertise in networking and security... 
    Work at office
    Flexible hours

    Dataiku

    New York, NY
    14 hours ago
  • $133k - $185k

     ...applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Chief Data & Analytics Office (CDAO) AI/ML & Data Platforms team,, you will solve complex... 
    Work at office

    JPMorgan Chase Bank, N.A.

    Jersey City, NJ
    2 days ago
  • $189k - $283.6k

     ...the SRE team, you will proactively and reactively improve the reliability of Block's platform and critical infrastructure. You are metrics...  ...~ A strong desire to perform and grow as an engineer ~5+ years of software development experience Technologies... 
    Full time
    Local area
    Remote work
    Relocation package
    Flexible hours
    Shift work

    Block USA

    New York, NY
    4 days ago
  • $89k - $178k

     ...Sr. Site Reliability Engineer I NYC Global HQ Hybrid (3 days per week in office) DV is the leader in digital performance solutions, helping our advertiser and agency partners verify the quality of their digital campaigns, optimize to improve performance and prove... 
    Work at office
    3 days per week

    DoubleVerify

    New York, NY
    1 day ago
  • $125k - $350k

     ...Site Reliability Engineer New York, Miami, Gurugram, London, Singapore, Sydney Job Description Opportunities may be available from time to time in any location in which the business is based for suitable candidates. If you are interested in a career with Citadel... 

    Citadel Securities

    New York, NY
    14 hours ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper).... 
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    6 days ago
  •  ...Site Reliability Engineer I, Abhishek, would like to share a job opportunity as Site Reliability Engineer in Jacksonville, FL, Cary, NC or New York, NY (Onsite) location for a Fulltime position. In case, if you are not comfortable with this location, please share your... 
    Full time
    Work visa

    Syntricate Technologies

    New York, NY
    4 days ago
  •  ...they are shifting towards Linux – (70% Windows, 30% Linux) Remote access technology protocols are a plus Job Description: Site Reliability Engineer Periodic updates and maintenance of Windows-based golden image for ESX & AWS. Patching of software, systems, appliances etc... 
    Remote work
    Shift work

    TechDigital Group

    New York, NY
    1 day ago
  •  ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas...  ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle... 

    Forhyre

    New York, NY
    14 days ago
  • $65 - $75 per hour

     ...virtualization technologies. Knowledge of ITIL frameworks, Jira, Confluence, and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and customers to identify end-user requirements for infrastructure monitoring... 
    Contract work
    Remote work

    SBS Creatix

    New York, NY
    14 hours ago
  • $130k - $165k

     ...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology Team: Site Reliability Engineering About Snapsheet Snapsheet exists to simplify claims. We leverage... 
    Full time
    Temporary work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Snapsheet

    New York, NY
    4 days ago
  • $160k - $230k

     ...We are currently looking to add Platform Engineers to our team, with at least 5 years of experience...  .... You’ll ensure our platform is reliable, secure, and performant from day one. Responsibilities...  ...collaborative setting. Our team works on-site five days a week, growing and building... 
    Work at office
    Local area

    Standard Template Labs

    New York, NY
    1 day ago
  •  ...Curated careers, resources, tips and trends from the DevOps World. The Site Reliability Engineer position at Remotive revolves around ensuring the reliability, availability, and performance of services. This role requires a combination of software engineering and system... 
    Remote work

    DevOpsChat

    New York, NY
    14 hours ago
  •  ...Participate in an oncall rotation. Work with teams across the company to ensure we achieve the right balance of developer velocity, reliability and performance, and cost efficiency. What You’ll Bring 5+ years of experience Experience with containerization and orchestration... 

    SwiftCruit

    New York, NY
    14 hours ago
  •  ...Site Reliability Engineer OXIO is the first NeoTelco. We are building the world’s largest, most accessible, and insightful Telecom network. Our platform empowers anyone to spin up their own carrier from a browser, scaling and supporting you as you scale your network to... 

    MoneyLion

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Director, Site Reliability Engineering. Be the first to apply!