Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

$125k - $200k

Steampunk

Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that support mission‑critical applications and services. You will blend software engineering, DevOps practices, and infrastructure expertise to improve system reliability, performance, and operational excellence across the platform. Contributions Responsibilities Establishing development tools and infrastructure for automation. Understanding the needs of stakeholders and conveying this to developers. Automate and improve development, testing, deployment, and release processes. Testing and examining code written by others and analyzing results. Own and improve the reliability, availability, and performance of production systems and services. Define, implement, and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets. Perform capacity planning, scalability analysis, and performance tuning for applications and infrastructure. Participate in on‑call rotations, incident response, and post‑incident reviews to drive long‑term improvements. Design and implement infrastructure‑as‑code (IaC) to provision and manage cloud resources (e.g., AWS, Azure, GCP). Build and maintain CI/CD pipelines to ensure reliable, repeatable delivery of application and infrastructure changes. Engineer resilient architectures using concepts such as auto‑scaling, blue/green deployments, canary releases, and self‑healing patterns. Collaborate with security and platform teams to ensure infrastructure adheres to compliance, security, and governance requirements. Collaborate with application development teams to design reliable, observable, and operable services from the outset. Contribute to application code, tooling, and frameworks that enhance reliability, resilience, and performance. Act as an individual contributor and mentor more junior team members. Present regular status updates and provide cross‑training to other DevOps team members. Qualifications Required Ability to obtain a U.S. government Security Clearance. BS Degree in an IT field with 10 years of experience OR BS in a non‑IT field and 12 years of related IT experience. 3 years of experience with one or more clouds (i.e. AWS, Azure, or GCP). 3 years of experience with Git SCM providers such as GitHub, GitLab, Bitbucket. 3 years of experience with at least one programming language (e.g., Python, Go, Java, or JavaScript) for tooling, automation, or application development. Hands‑on experience working with AWS in production environments. Hands‑on experience designing, deploying, and operating Kubernetes‑based systems (e.g., EKS, AKS, GKE). Experience with DevOps practices and tools, including CI/CD pipelines (e.g., GitHub Actions, GitLab CI, Jenkins, Azure DevOps). Hands‑on experience with infrastructure‑as‑code tools (e.g., Terraform, CloudFormation, Pulumi) to manage cloud resources. Experience configuring and managing containerization and orchestration platforms. Experience implementing monitoring, logging, and tracing solutions (e.g., CloudWatch, Prometheus, Grafana, Datadog, New Relic, Elastic, OpenTelemetry). Familiarity with networking fundamentals (DNS, load balancing, routing, TLS) and their impact on reliability and performance. Experience with incident management, on‑call operations, and production support practices. Certification(s) such as: Cloud certifications (e.g., AWS DevOps Engineer, AWS SysOps Administrator, Azure Administrator/DevOps Engineer, GCP Professional Cloud DevOps Engineer). Kubernetes certifications (e.g., CKA, CKAD). Preferred Hands‑on experience with Drupal and Azure. Experience implementing Automated Testing frameworks including Selenium. Excellent written and verbal communication skills, interpersonal and collaborative skills. Experience documenting an as‑is state of the environment, perform a gap analysis, and produce artifacts that articulate options and recommendations. Experience designing and implementing SLOs, SLIs, and error budgets in production environments. Experience with chaos engineering, game days, and resilience testing. Local to Washington, DC metro area and available to be onsite 2 days a week. NIH experience. About steampunk Steampunk relies on several factors to determine salary, including but not limited to geographic location, contractual requirements, education, knowledge, skills, competencies, and experience. The projected compensation range for this position is $125,000 to $200,000. The estimate displayed represents a typical annual salary range for this position. Annual salary is just one aspect of Steampunk’s total compensation package for employees. Learn more about additional Steampunk benefits here. We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E‑Verify program. #J-18808-Ljbffr Steampunk

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Mc Lean, VA vacancy
  • $147.4k - $221.2k

    Senior Site Reliability Engineer page is loaded## Senior Site Reliability Engineerremote type: Flexlocations: USA, VA, McLean: USA.VA.Restontime type: Full Timeposted on: Posted Yesterdayjob requisition id: JR-0104084**Your work days are brighter here.**We’re obsessed with... 
    Suggested
    Work experience placement
    Work at office
    Remote work
    Home office
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    3 days ago
  • $55.2k - $126k

     ...what to expect during your journey as a candidate with us. Engineering to make a system more resilient and efficient frees up time...  ...have a passion for making systems better, we need you! As a site reliability engineer on our team, you’ll help our Platform Engineering team... 
    Suggested
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Phase2 Technology

    Mc Lean, VA
    4 days ago
  • A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have... 
    Suggested
    Remote job
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    3 days ago
  • $55.2k - $126k

    A leading consulting firm in McLean, Virginia, is seeking a Site Reliability Engineer to enhance system resilience and efficiency. Key duties include developing robust infrastructure, implementing automation, and reducing manual tasks. The role requires experience with... 
    Suggested
    Remote job

    Booz Allen Hamilton

    Mc Lean, VA
    3 days ago
  • $135.8k - $183.8k

     ...dynamic and flexible work environment with competitive benefits and the ability to grow your career. We are looking for a Site Reliability Engineer to support our team responsible for building, managing, maintaining, deploying, and securing mission-critical services to... 
    Suggested
    Work at office
    Flexible hours

    Verisign

    Reston, VA
    6 days ago
  •  ..., and Onsite Notice: This role requires regularly working on-site at customer locations in Arlington, VA. If you are not currently...  ...obtain SCI eligibility. About The Role We are hiring a Site Reliability Engineer to join our Infrastructure & Security team. You’ll work... 
    Relocation
    Relocation package

    Onebrief, Inc.

    Arlington, VA
    4 days ago
  • Description Onsite in Washington, DC Our client seeks a Sr. Site Reliability Engineer III to design, automate, and operate mission-critical systems for federal environments. The role focuses on Kubernetes or VMWare platforms, CI/CD enablement, observability, and developer... 
    Permanent employment
    Full time
    Immediate start

    Eliassen Group

    Washington DC
    2 days ago
  • Senior Site Reliability Engineer Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketplaces. Included in the S&P 500 Index and the NASDAQ 100, CoStar Group... 
    Full time
    Work at office
    Work from home
    Monday to Thursday

    Visual Lease

    Arlington, VA
    3 days ago
  • $96k - $151.8k

     ...Remotely? Y Position Type Management Bonus Eligible: Y Expiration Date: 06/22/2026 JOB SUMMARY: The Systems Engineer - Site Reliability Engineering (SRE) is responsible for the reliability, scalability, and performance of mission-critical cloud and on-prem... 
    Full time
    Remote work
    Flexible hours

    Marriott

    Bethesda, MD
    3 days ago
  • Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our production ecosystems, ensuring that our complex, data-driven AI platforms remain resilient, scalable, and highly performant... 
    Local area

    Tiger Analytics Inc.

    Washington DC
    3 days ago
  • $166k - $220k

    ABOUT THE JOB As a site reliability engineer in Platform Discovery, you will solve a wide variety of problems involving networking, autonomy, systems integration, robotics, and more, while making pragmatic engineering tradeoffs along the way. Your efforts will ensure that... 
    Full time
    Work experience placement
    Relocation package

    Slope

    Washington DC
    3 days ago
  • $100k - $160k

     ...all bring to the team; and empower our employees to create innovative and trusted results. We are looking for a dynamic Site Reliability Engineer (SRE) with a Top Secret clearance to join our team! The Site Reliability Engineer (SRE) will manage, monitor, and... 
    Full time
    Temporary work

    CATHEXIS

    Tysons, VA
    a month ago
  •  ...Job Description Job Description Site Reliability Engineer LOCATION: Reston, VA SUMMARY OF POSITION The Site Reliability Engineer (i.e., “SRE”) role is responsible for the optimization and reliability of core technical platforms and platform services,... 
    Work at office

    Cirrus Group Consulting

    Reston, VA
    6 days ago
  •  ...Job Description Job Description Apply now: Site Reliability Engineer (DevOps/SRE), location is Remote. The start date is Targeting June 29 for this 12 month contract position. Job Title: Site Reliability Engineer (DevOps/SRE) Location-Type: 100% Remote Start... 
    Contract work
    Remote work

    Mondo

    Herndon, VA
    10 days ago
  • $160k - $200k

     ...Engineering Leader Filevine is a Legal AI company delivering Legal Operating Intelligence for the future of legal work. Grounded in...  ...looking for an experienced engineering leader to spearhead system reliability, drive platform project execution, and work in close... 
    Full time
    Temporary work
    Work experience placement

    Filevine

    Washington DC
    5 days ago
  • $51.9 per hour

     ...OVERVIEW: This job is responsible for the reliability, availability, and performance of...  ...operational efficiency. This role blends software engineering, clinical engineering, and security...  .... Works cross-functionally with AHN site leaders and teams to navigate and to... 
    For contractors
    Local area

    Highmark Health

    Washington DC
    2 days ago
  • $84.24k - $142.48k

    Overview Join us to work collaboratively with our talented team of dynamic and passionate engineers to deliver capabilities that enable our customers to make a difference. You'll deploy and operate ArcGIS Velocity and ArcGIS Workflow Manager SaaS solutions. You will also... 
    Worldwide
    Flexible hours

    Esri

    Vienna, VA
    1 day ago
  •  ...grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise. The Senior Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers... 
    Work experience placement
    Remote work
    Flexible hours

    Donnelley Financial, LLC

    Rockville, MD
    11 days ago
  • Job Summary Job Summary The Support Lead (SRE) is responsible for overseeing the support operations and site reliability engineering tasks, ensuring the effective functioning of systems and applications. The primary goal is to enhance system performance, availability,... 

    TechDigital Group

    Fairfax, VA
    4 days ago
  • $60 per hour

     ...including front-end, back-end, full-stack, machine learning, and other engineers — who are driving real-world impact in AI development.Our...  .... Those located outside of these countries will not see work or assessments available on our site at this time.J-18808-Ljbffr... 
    Hourly pay
    Full time
    Remote work
    Flexible hours

    DataAnnotation

    Washington DC
    5 days ago
  •  ...Job Description Job Description Required U.S. Citizenship / No clearance needed / 100% remote within the US  Staff Site Reliability Engineer / Cloud SME Location: 100% remote in the continental US  Type: Long-term contract (3+ years) Role Summary As... 
    Long term contract
    Remote work

    ASCENDING

    Fairfax, VA
    a month ago
  •  ...Python, and PowerShell, integrating systems, and managing Microsoft Entra services. A minimum of 5 years of experience in systems engineering is required along with a Bachelor's degree in Computer Science. The position offers a hybrid work model as employees must be... 
    Local area

    Highlighttech

    Washington DC
    5 days ago
  • $126k - $248k

    As a TPM for SRE, you will partner with SRE leaders and engineers to scale the platform that underpins all of MongoDB’s cloud products. You will drive program execution, strengthen production reliability practices, and coordinate cross-functional efforts across US and EMEA... 
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    Washington DC
    2 days ago
  • Geico is seeking a Staff Engineer to innovate and enhance systems while mentoring engineers and collaborating across teams. This position involves utilizing programming languages like Go and Python, working with Azure services, Docker, and Kubernetes, and requires 6+ years... 

    Geico

    Bethesda, MD
    2 days ago
  • $150k - $170k

     ...PostgreSQL Database / Senior Database Reliability Engineer Department: Technology Employment Type: Full Time Location: Tysons Corner Compensation: $150,000 - $170,000 / year Description Nodal Exchange is a derivatives exchange providing price, credit... 
    Full time
    Local area

    NODAL EXCHANGE

    Vienna, VA
    1 day ago
  •  ...Terrestrial Software Systems Engineer LOCATION Tysons, VA 22182 CLEARANCE TS/SCI Full Poly (Please note this position...  ...based systems, ensuring seamless integration, performance, and reliability. The ideal candidate is adept at solving complex technical... 
    Temporary work
    For contractors
    Immediate start
    Flexible hours

    Cymertek

    Vienna, VA
    1 day ago
  • $110k - $230k

     ...Pledge: Great Company, Great Culture, Great Rewards and Great Careers. GEICO's Cyber Security Engineering & Analytics, Automation (SEA) team is seeking a Staff Cyber Site Reliability Engineer (SRE) — a hands-on, engineering-minded practitioner who is passionate about... 
    Hourly pay
    Full time
    Work experience placement
    Local area
    Flexible hours

    GEICO

    Bethesda, MD
    4 days ago
  •  ...enable national security missions worldwide. Job Description OUSW (R&E) is seeking a highly skilled senior software systems engineer to be the technical lead for a complex project involving the design, development, integration and assessment of complex System of... 
    Full time
    Work at office
    Worldwide
    Night shift

    SOS International LLC

    Falls Church, VA
    3 days ago
  • $155k - $185k

     ...professional growth. Discover your future with us. We are seeking a highly skilled and innovative Senior Software Systems Engineer to join our team and lead critical aspects of system design, integration, and validation. This role requires a strong technical... 
    Full time
    Work experience placement
    Work at office
    Local area
    Flexible hours

    Arete Associates

    Falls Church, VA
    5 days ago
  • Salesforce is seeking a Site Reliability Engineer in Washington, DC to ensure cloud services availability. This role involves monitoring services, incident management, and driving automation for resilient systems. Candidates should have a Bachelor's in Computer Science... 

    Salesforce

    Washington DC
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!