Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Software Engineer- Site Reliability Engineering (SRE)

$106.5k - $177.5k
Full-time

Noctua Technology

The Site Reliability Engineering discipline at Noctua Technology, LLC is a strategic force driving digital transformation. We treat operations as a software engineering challenge, focusing on the seamless integration, scalability, and long-term reliability of cloud native systems. Our SREs don’t just manage infrastructure; they build it using Infrastructure as Code (IaC), monitor it through advanced observability stacks, and protect it by engineering for failure. We work closely with clients to bridge the gap between development and operations. We are seeking a motivated Site Reliability Engineer (SRE) to join our dynamic team. As a key contributor, you will apply software engineering principles to operations, focusing on the reliability, scalability, and performance of production systems. You will play a crucial role in reducing toil through automation, defining and monitoring Service Level Objectives (SLOs), and implementing best practices for system stability and incident response. This role requires working with modern cloud technologies to ensure the high availability and efficiency of applications and infrastructure. Location: Primarily Remote. Candidates must be based in CA or DC Metro Area for proximity to project and client teams. Security Clearance Requirement: Applicants must be US citizens and eligible to obtain and maintain an active Secret security clearance or above. Key Responsibilities Site Reliability Engineering Define, measure, and report on Service Level Indicators (SLIs) and Service Level Objectives (SLOs) to ensure system reliability and uptime. Develop and deploy Infrastructure as Code (IaC) using Terraform, CloudFormation, or similar tools, with an emphasis on repeatability and change management. Implement and manage containerized and serverless architectures using Docker, Kubernetes, and cloud-native services, focusing on performance and error budgets. Build and maintain reliable and self-healing CI/CD pipelines to automate deployments and improve development workflows. Toil Reduction and Incident Management Implement and refine comprehensive monitoring, alerting, and logging to detect and address performance and availability issues proactively. Eliminate toil by extensively automating operational tasks, including provisioning, patching, and deployments, using scripting and configuration management tools such as Python, Bash, or Ansible. Conduct post-incident reviews (blameless postmortems) to drive continuous improvement in system reliability and operational processes. Testing and Service Resiliency Implement cloud security best practices, including identity and access management (IAM), encryption, and compliance controls. Proactively identify and address system weaknesses and ensure performance under stress. Support disaster recovery and high availability strategies through backup and failover planning. Collaboration and Knowledge Sharing Collaborate with development teams to improve the operability and production readiness of applications from design through deployment. Create and maintain documentation for cloud architectures, deployment processes, and best practices. Contribute to internal knowledge-sharing initiatives, ensuring continuous learning within the team. Stakeholder Communication Provide technical guidance and support to clients and internal teams on cloud infrastructure and reliability best practices, with a focus on defining Service Level Agreements (SLAs). Act on client feedback to refine and enhance cloud solutions. Conduct training and knowledge-sharing sessions to help clients manage their cloud environments effectively. Continuous Learning and Innovation Stay updated on the latest developments in cloud infrastructure and technology trends. Drive innovation by proposing and implementing new techniques and technologies. Qualifications 1-5 years of experience in site reliability engineering, cloud engineering, or related fields. Strong software engineering skills with an emphasis on writing clean, modular, and maintainable code, specifically for automation and system management. Proficiency in Infrastructure as Code (IaC) tools like Terraform or CloudFormation. Experience with containerization and orchestration tools like Docker and Kubernetes. Knowledge of networking concepts, cloud security best practices, and identity management. Experience with programming or scripting languages such as Python, Bash, or Go. Familiarity with CI/CD pipelines and DevOps methodologies. Strong problem-solving skills and the ability to troubleshoot complex cloud environments. Effective communication skills and a willingness to learn and collaborate. Preferred qualifications: Bachelor's or advanced degree in Computer Science or a related field. Any of the below cloud certifications: Google Cloud Professional Cloud Architect Google Cloud Professional Cloud DevOps Engineer AWS Certified Solutions Architect AWS Certified Developer AWS Certified SysOps Administrator Azure Solutions Architect Expert CompTIA Security+ certification or an equivalent DoD 8140/8570 IAT Level II baseline certification. Salary Range: $106,500 - $177,500

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Software Engineer- Site Reliability Engineering (SRE) in Virginia vacancy
  •  ...Site Reliability Engineer (SRE) Remote No sponsorship available. Must be able to obtain a Public Trust clearance. What You Will Do We are seeking a Site Reliability Engineer (SRE) to support the SBA Disaster Lending Platform modernization effort in a remote... 
    Suggested
    Contract work
    Local area
    Remote work

    System One

    McLean, VA
    6 days ago
  • $62k - $141k

     ...Job Number: R0228604 Site Reliability Engineer The Opportunity Engineering to make a system more resilient...  ..., systems administration, or software development, if you have a passion for...  ...you! As a Site Reliability Engineer (SRE) on our team, you'll help the Intelligence... 
    Suggested
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Phase2 Technology

    Chantilly, Loudoun County, VA
    12 hours ago
  • $103.5k - $150k

     ...self. The Role and Team The Site Reliability Engineering organization at Medallia brings together...  ...global SaaS platform. As an SRE II, you will help operate and improve...  ...infrastructure. You will work closely with software engineering teams to build automation,... 
    Suggested
    Temporary work
    Work experience placement
    Local area
    3 days per week

    Medallia

    McLean, VA
    2 days ago
  • £65k - £95k per year

     ...founders were working as engineers solving complex cross...  ...teams working both on-site with clients and remotely...  ...an experienced Site Reliability Engineer to help satisfy that demand. As an SRE you will be responsible...  ...Engineer Collaborate with Software Engineers to improve... 
    Suggested
    Remote work
    Work from home
    Flexible hours
    Rotating shift

    TwinStream

    Bristol, Washington County, VA
    12 hours ago
  • $112.5k - $187.5k

     ...TransUnion, this role will report to a DevOps Director. The Site Reliability Engineering team drives reliability strategy, elevates engineering...  ...serve as a senior technical leader and force multiplier on the SRE team. Operating with full autonomy, you will drive reliability... 
    Suggested
    Full time
    Temporary work
    Work experience placement
    Work at office
    Flexible hours
    2 days per week

    TransUnion

    Reston, VA
    1 day ago
  • $180k - $200k

    Zachary Piper Solutions is seeking an Elastic Site Reliability Engineer (SRE) to support a mission-focused organization delivering secure, scalable observability and reliability solutions across Department of Defense environments. This position is on-site at Hanscom AFB... 

    Zachary Piper Solutions

    Hampton, VA
    1 day ago
  • $125k - $200k

    Overview As a Site Reliability Engineer (SRE) , you will help design, build, and operate reliable, secure, and observable cloud‑native systems that...  ...‑critical applications and services. You will blend software engineering, DevOps practices, and infrastructure expertise... 
    Local area
    2 days per week

    Steampunk

    Mc Lean, VA
    2 days ago
  •  ...stores. Building and operating u‑Slicer reporting tool. As a Site Reliability Engineer in the Platform Core group at Criteo, you’ll play a key role...  ...experience. You have 5+ years’ experience in back‑end or SRE roles, with strong coding skills in Python or Go (C# or Java... 

    Centaur Labs

    Woodbridge, VA
    12 hours ago
  • Job Summary Job Summary The Support Lead (SRE) is responsible for overseeing the support operations and site reliability engineering tasks, ensuring the effective functioning of systems and applications. The primary goal is to enhance system performance, availability,... 

    TechDigital Group

    Fairfax, VA
    12 hours ago
  • Communications Training Analysis Corporation is seeking a DevOps Engineer or Site Reliability Engineer for a hybrid role primarily in Northern Virginia. The role focuses on ensuring system integrity and creating automations to streamline operations, working within a cross... 

    Communications Training Analysis Corporation

    Falls Church, VA
    2 days ago
  • $86.8k - $198k

     ...your journey as a candidate with us. Engineering to make a system more resilient and efficient...  ..., systems administration, or software development, if you have a passion for...  ...making systems better, we need you! As a Site Reliability Engineer on our team, you’ll lead the... 
    Full time
    Part time
    Casual work
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    McLean, VA
    1 day ago
  • Communications Training Analysis Corporation (CTAC) is looking for DevOps Engineers and Site Reliability Engineers to join their team in Falls Church, Virginia. This hybrid role involves ensuring operational integrity and availability of applications, focusing on automation... 

    Communications Training Analysis Corporation

    Falls Church, VA
    2 days ago
  • $164.3k - $222.3k

     ...offers a hybrid work schedule. Verisign is hiring a Senior Site Reliability Engineer to help lead a team responsible for building, managing,...  ...Coordination with other technical staff to implement systems and software Performance of operations support functions, including... 
    Work at office
    Flexible hours

    Accreditation Council For Graduate Medical Education

    Reston, VA
    12 hours ago
  • A leading technology company is seeking a Senior Site Reliability Engineer in Virginia. The role involves maintaining a Kubernetes-based platform, ensuring high availability, and automating infrastructure processes with tools like Terraform. The ideal candidate will have... 
    Remote job
    Flexible hours

    Workday, Inc.

    Mc Lean, VA
    4 days ago
  •  ..., and Onsite Notice: This role requires regularly working on-site at customer locations in Arlington, VA. If you are not currently...  ...obtain SCI eligibility. About The Role We are hiring a Site Reliability Engineer to join our Infrastructure & Security team. You’ll work... 
    Relocation
    Relocation package

    Onebrief, Inc.

    Arlington, VA
    12 hours ago
  • $77.6k - $176k

    DevOps and Site Reliability Engineer The Opportunity Everyone is trying to “harness the cloud,” but not everyone knows how. As a DevOps engineer...  ...capabilities. We need you to develop container management software to solve some of our clients’ toughest challenges. As a... 
    Full time
    Part time
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    Mc Lean, VA
    1 day ago
  • $84.24k - $142.48k

     ...talented team of dynamic and passionate engineers to deliver capabilities that enable our customers...  ...-generation real-time and big data GIS software-as-a-service (SaaS) capabilities for...  ...Responsibilities Collaborate with a team of SRE engineers to operate SaaS capabilities... 
    Worldwide
    Flexible hours

    Esri

    Vienna, VA
    12 hours ago
  • $62k - $141k

     ...Site Reliability Engineer The Opportunity:   Engineering to make a system more resilient and efficient...  ..., systems administration, or software development—if you have a passion for...  ...you!  As a site reliability engineer (SRE) on our team, you’ll help the Intelligence... 
    Full time
    Contract work
    Part time
    Local area
    Remote work

    Booz Allen Hamilton

    Chantilly, Loudoun County, VA
    more than 2 months ago
  •  ...Job Description Job Description Site Reliability Engineer LOCATION: Reston, VA SUMMARY OF POSITION The Site Reliability Engineer (i.e., “SRE”) role is responsible for the optimization and reliability of core technical platforms and platform services,... 
    Work at office

    Cirrus Group Consulting

    Reston, VA
    27 days ago
  •  ...better care. Requirements: As a Senior Site Reliability Engineer at Commence, you will own the...  ...they become habits. Collaborate with software engineers to establish reliability-first...  ...Qualifications ~7+ years of experience in SRE, platform engineering, or DevOps roles... 
    Remote work

    Commence

    Leesburg, VA
    23 days ago
  •  ...special at Virtru. We hope you consider joining our team and helping us create a brighter future for data privacy. As a Site Reliability Engineer (SRE) at Virtru, you will play a pivotal role in driving continuous improvements in observability, performance, and... 
    Work at office
    Local area
    Home office
    Flexible hours
    Shift work

    Virtru

    Reston, VA
    more than 2 months ago
  •  ...100% remote within the US  Staff Site Reliability Engineer / Cloud SME Location: 100% remote in...  ...years) Role Summary As the Staff SRE/Cloud SME, you will be a critical...  ...principles and practices throughout the software development lifecycle, ensuring seamless... 
    Long term contract
    Remote work

    ASCENDING

    Fairfax, VA
    22 days ago
  • $77.6k - $176k

    Booz Allen Hamilton in McLean, Virginia is seeking a DevOps and Site Reliability Engineer. You will develop and manage a container platform to enhance cloud capabilities and address clients’ challenges. The position requires expertise in automation and DevOps practices... 
    Remote job

    Booz Allen Hamilton

    Mc Lean, VA
    1 day ago
  • $159k - $230k

    C3 AI (NYSE: AI), is the Enterprise AI application software company. C3 AI delivers a family of fully integrated products including...  ...Learn more at: C3 AI [ C3 AI is seeking a Senior/Lead Site Reliability Engineer - Federal to join our team in Tysons, VA or Redwood City,... 
    Full time

    C3 AI

    Tysons, VA
    1 day ago
  • $77.6k - $176k

    DevOps and Site Reliability Engineer The Opportunity: Everyone is trying to “harness the cloud,” but not everyone knows how. As a DevOps engineer...  ...capabilities. We need you to develop container management software to solve some of our clients’ toughest challenges. As a... 
    Full time
    Contract work
    Part time
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    McLean, VA
    3 days ago
  •  ...Please review the job details below. Vantor is seeking a motivated Engineer to support the development, maintenance, and enhancement of...  ...years of experience in systems engineering, automation, DevOps, software development or related technical roles. Experience working with... 
    Full time

    Vantor

    Herndon, VA
    12 hours ago
  • TryApplyNow is looking for a mid-level DevOps Engineer. This full-time hybrid position is based in Falls Church, Virginia. Your primary responsibility will be to support operations and maintenance of applications, developing automated tooling, and configuring monitoring... 
    Full time

    TryApplyNow

    Falls Church, VA
    2 days ago
  • $286.2k - $326.7k

     ...Overview Sr. Distinguished Engineer, Acquisitions Platform & SRE Foundations As a Sr....  ...9 years of experience in Software engineering and solution...  ..., incident response, and reliability engineering ~...  ...information available through this site. Capital One... 
    Full time
    Part time
    Local area

    Capital One

    McLean, VA
    more than 2 months ago
  •  ...role involves designing sustainable cloud environments, primarily focused on AWS, ensuring software performance aligns with quality standards, and providing training to engineers. Candidates must have a Bachelor’s in Computer Science or a related field, with 9 years of... 
    Remote job

    Trader Interactive

    Virginia Beach, VA
    4 days ago
  •  ...Overview The Virtual Server Engineering (VSE) team provides infrastructure design...  ...environment. Virtual Server Engineering uses Site Reliability Engineering (SRE) principles and engineering...  ...using the combination of software and systems engineering practices.... 
    Internship
    Monday to Friday

    Navy Federal Credit Union

    Vienna, VA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineer- Site Reliability Engineering (SRE). Be the first to apply!