Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

$150k - $195k
Full-time

Careerscape

Our client is a vertical B2B SaaS company serving the property management industry, with more than 12,000 customers, 200 employees, and a six-year track record of profitable growth. They're hiring a remote Senior SRE.

Responsibilities

  • Own the reliability of one product domain end-to-end including SLOs, error budgets, and quarterly reliability reviews with that domain's product engineering team
  • Participate in a 1-week-in-6 production on-call rotation with full pay differential
  • Lead incident response for severity-1 and severity-2 events; write or co-author post-incident reviews
  • Maintain and extend the Terraform modules used across the platform (50+ AWS accounts)
  • Improve EKS / Kubernetes platform: cluster upgrades, node pool optimization, autoscaling, cost tuning
  • Build observability for your domain: Datadog dashboards, alerts, and traces tuned to actual user journeys
  • Lead capacity planning for known seasonal load (month-end, quarter-end) and run game-day exercises
  • Mentor product engineers on production excellence: runbooks, alert design, dashboard hygiene, and rollback patterns

Requirements

  • 5+ years in SRE, DevOps, or platform engineering at companies running production systems for paying customers
  • Hands-on production AWS experience across compute (EKS, ECS), data (RDS, Aurora), networking (VPC, ALB), and security (IAM, KMS, GuardDuty)
  • Strong Terraform skills including module design, state management, and CI integration
  • Production Kubernetes experience including upgrades, RBAC, networking (CNI, ingress), and resource management
  • Comfort with at least one scripting language (Python or Go preferred) for tooling and automation
  • Production experience with an observability platform (Datadog, Grafana stack, New Relic, or Honeycomb)
  • Track record leading incidents and writing high-quality post-incident reviews
  • Strong written communication; comfort working async with a distributed team

Benefits

  • Base salary $150,000-$195,000 plus on-call differential ($150 per on-call shift) and 10-15% target annual bonus
  • Equity grant with standard 4-year vesting and 1-year cliff
  • Comprehensive medical, dental, and vision (employer pays 100% of employee premium)
  • 401(k) with 6% match, immediate vesting
  • $3,000 annual learning and conference stipend (AWS re:Invent and KubeCon are pre-approved)
  • $1,500 home office setup stipend plus $100/month ongoing internet reimbursement
  • Unlimited PTO with a 15-day minimum and a paid sabbatical at the 5-year mark

Job Type: Full-Time | Work Type: Remote | Industry: Technology | Experience: Senior

Vacancy posted a month ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Washington DC vacancy
  • Senior Site Reliability Engineer Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketplaces. Included in the S&P 500 Index and the NASDAQ 100, CoStar Group... 
    Suggested
    Full time
    Work at office
    Work from home
    Monday to Thursday

    Visual Lease

    Arlington, VA
    1 day ago
  • $166k - $220k

    ABOUT THE JOB As a site reliability engineer in Platform Discovery, you will solve a wide variety of problems involving networking, autonomy, systems integration, robotics, and more, while making pragmatic engineering tradeoffs along the way. Your efforts will ensure that... 
    Suggested
    Full time
    Work experience placement
    Relocation package

    Slope

    Washington DC
    1 day ago
  • Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our production ecosystems, ensuring that our complex, data-driven AI platforms remain resilient, scalable, and highly performant... 
    Suggested
    Local area

    Tiger Analytics Inc.

    Washington DC
    1 day ago
  • $96k - $151.8k

     ...Remotely? Y Position Type Management Bonus Eligible: Y Expiration Date: 06/22/2026 JOB SUMMARY: The Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem... 
    Suggested
    Full time
    Remote work
    Flexible hours

    Marriott

    Bethesda, MD
    1 day ago
  • Description Onsite in Washington, DC Our client seeks a Sr. Site Reliability Engineer III to design, automate, and operate mission-critical systems for federal environments. The role focuses on Kubernetes or VMWare platforms, CI/CD enablement, observability, and developer... 
    Suggested
    Permanent employment
    Full time
    Immediate start

    Eliassen Group

    Washington DC
    9 hours ago
  • $51.9 per hour

     ...OVERVIEW: This job is responsible for the reliability, availability, and performance of...  ...operational efficiency. This role blends software engineering, clinical engineering, and security...  .... Works cross-functionally with AHN site leaders and teams to navigate and to... 
    For contractors
    Local area

    Highmark Health

    Washington DC
    4 days ago
  • $160k - $200k

     ...Engineering Leader Filevine is a Legal AI company delivering Legal Operating Intelligence for the future of legal work. Grounded in...  ...looking for an experienced engineering leader to spearhead system reliability, drive platform project execution, and work in close... 
    Full time
    Temporary work
    Work experience placement

    Filevine

    Washington DC
    3 days ago
  • $60 per hour

     ...including front-end, back-end, full-stack, machine learning, and other engineers — who are driving real-world impact in AI development.Our...  .... Those located outside of these countries will not see work or assessments available on our site at this time.J-18808-Ljbffr... 
    Hourly pay
    Full time
    Remote work
    Flexible hours

    DataAnnotation

    Washington DC
    3 days ago
  •  ...Python, and PowerShell, integrating systems, and managing Microsoft Entra services. A minimum of 5 years of experience in systems engineering is required along with a Bachelor's degree in Computer Science. The position offers a hybrid work model as employees must be... 
    Local area

    Highlighttech

    Washington DC
    3 days ago
  • $126k - $248k

    As a TPM for SRE, you will partner with SRE leaders and engineers to scale the platform that underpins all of MongoDB’s cloud products. You will drive program execution, strengthen production reliability practices, and coordinate cross-functional efforts across US and EMEA... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Washington DC
    1 day ago
  • Geico is seeking a Staff Engineer to innovate and enhance systems while mentoring engineers and collaborating across teams. This position involves utilizing programming languages like Go and Python, working with Azure services, Docker, and Kubernetes, and requires 6+ years... 

    Geico

    Bethesda, MD
    9 hours ago
  • Salesforce is seeking a Site Reliability Engineer in Washington, DC to ensure cloud services availability. This role involves monitoring services, incident management, and driving automation for resilient systems. Candidates should have a Bachelor's in Computer Science... 

    Salesforce

    Washington DC
    2 days ago
  • $110k - $230k

     ...Pledge: Great Company, Great Culture, Great Rewards and Great Careers. GEICO's Cyber Security Engineering & Analytics, Automation (SEA) team is seeking a Staff Cyber Site Reliability Engineer (SRE) — a hands-on, engineering-minded practitioner who is passionate about... 
    Hourly pay
    Full time
    Work experience placement
    Local area
    Flexible hours

    GEICO

    Bethesda, MD
    2 days ago
  • $147.4k - $221.2k

    Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerremote type: Flexlocations: USA, VA, McLean: USA.VA.Restontime type: Full Timeposted on: Posted Yesterdayjob requisition id: JR-0104084**Your work days are brighter here.**We’re obsessed with... 
    Work experience placement
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    1 day ago
  • $125k - $200k

    Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that support mission‑critical applications and services. You will blend software engineering, DevOps practices, and infrastructure... 
    Local area
    2 days per week

    Steampunk

    Mc Lean, VA
    4 days ago
  •  ...’s safety and security. Make an impact by using your expertise to protect our country from threats. Job Description SITE RELIABILITY ENGINEER (SRE) Own your opportunity. Make your impact As a Site Reliability Engineer (SRE) supporting the CIO Infrastructure... 

    General Dynamics Information Technology

    Washington DC
    23 days ago
  • $120k - $252k

     ...mission critical capabilities to our customers. System Deployment Engineers work in complex environments with shared environmental...  ...Participate in customer demonstrations and exercises Work with site reliability engineers to provide and refine requirements for tooling and... 
    Full time
    Temporary work
    Work experience placement
    Local area
    Relocation package

    Anduril Industries

    Washington DC
    more than 2 months ago
  • A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have... 
    Remote job
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    1 day ago
  • $55.2k - $126k

     ...what to expect during your journey as a candidate with us. Engineering to make a system more resilient and efficient frees up time...  ...have a passion for making systems better, we need you! As a site reliability engineer on our team, you’ll help our Platform Engineering team... 
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Phase2 Technology

    Mc Lean, VA
    2 days ago
  • Overview of the Role: Join our Site Reliability Engineering (SRE) team, where you'll work alongside Infrastructure and Research & Development (R&D) partners to keep Salesforce cloud services available for customers around the clock. In this role, you'll detect and resolve... 
    Work experience placement

    Relha LLC

    Washington DC
    5 days ago
  • Job Category: Software Engineering About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer success together...  ...at the heart of it all. Overview Of The Role Join our Site Reliability Engineering (SRE) team, where you’ll work alongside... 
    Work experience placement

    Salesforce

    Washington DC
    4 days ago
  • $188k - $235k

     ...(TechOps) team, we live this mission by building the most reliable and performant systems on the planet. We empower organizations...  ...The Role We are looking for an experienced Senior Site Reliability Engineer (SRE) who thrives on the challenge of managing large-scale... 
    Permanent employment
    Full time
    Work at office
    Local area
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • $194k - $267k

     ...something more than once, automate it” and who can rapidly self-educate on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • $194k - $267k

     ...Join our team! We’re building a world where Identity belongs to you. We are seeking a highly technical Observability Site Reliability Engineer with a specialty in Google Cloud, to own and expand our Observability ecosystem into GCP. In this role, you will move beyond... 
    Permanent employment
    Full time
    Work at office
    Local area
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • $207k - $284.9k

     ...is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk. Senior Manager, Site Reliability Engineering District of Columbia Area Secure Every Identity, from AI to Human Identity is the key to unlocking the potential... 
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta

    Washington DC
    8 days ago
  •  ...RiVidium is seeking a Site Reliability Engineer / Platform Reliability Engineer to support our planned MODES III team supporting Military Community and Family Policy (MC&FP). This role supports IT Cybersecurity and Data Operations - Modernization & Innovation and helps... 
    Full time

    Rividium

    Alexandria, VA
    7 days ago
  • $188k - $258.5k

     ...build, deliver, and maintain Okta’s legendary resiliency and reliability. We’re the SME’s for a vast number of synchronous and...  ...secure, and Always On. The Role We are seeking a Staff Site Reliability Engineer (TS/SCI) to join our high-stakes National Security team.... 
    Permanent employment
    Full time
    Work at office
    Flexible hours

    Okta

    Washington DC
    more than 2 months ago
  • Relha LLC is seeking a Site Reliability Engineer to join their team in Washington, DC. The role involves monitoring customer-facing services, managing incidents, and automating production issue resolutions. Candidates should possess a Bachelor's degree in Computer Science... 

    Relha LLC

    Washington DC
    5 days ago
  • $55.2k - $126k

    A leading consulting firm in McLean, Virginia, is seeking a Site Reliability Engineer to enhance system resilience and efficiency. Key duties include developing robust infrastructure, implementing automation, and reducing manual tasks. The role requires experience with... 
    Remote job

    Booz Allen Hamilton

    Mc Lean, VA
    1 day ago
  • Salesforce.com, inc. is looking for a Site Reliability Engineer in Washington, DC. In this role, you will monitor customer-facing services, respond to critical incidents, and drive automation to enhance service resiliency. Required qualifications include a Bachelor's degree... 

    salesforce.com, inc.

    Washington DC
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!