Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

$125k - $135k

Ad Hoc LLC

Site Reliability Engineer

Job number: 880

This is a remote position.

Ad Hoc is a technology company that empowers organizations to deliver scalable, impactful digital services. Using modern, agile methods, our team creates products that meet people's needs and transform their experience of government.

Work on things that matter

Our collaborations have shaped some of the defining moments in public-sector service delivery. We've helped build products that connect Veterans to tailored services, help millions access affordable health care, and support important programs like Head Start. As we work with agencies to deliver critical services, we're also changing how the government approaches technology.

Built for a remote life

Our culture, communications, and tools are built for remote work, enabling us to bring together top talent nationwide. At Ad Hoc, remote life empowers our teams to design work environments that fit their lives and that foster flexibility and collaboration to achieve positive outcomes for our customers.

Committed to high expectations and a welcoming culture

Ad Hoc values acceptance, accountability, and humility. We aren't heroes. We learn from our mistakes and improve the process for the next time. We build small, inclusive teams to collaborate closely with our partners to solve the right problems and deliver software that works.

The Veterans Affairs business unit helps transform the VA into a modern digital services organization where Veteran outcomes are at the center of every effort. We partner with the VA to design and deliver seamless user experiences for Veterans, their families and caregivers, and VA employees. By applying better practices in service design, product management, and technology, we enable the VA to increase the use, quality, and reliability of services and decrease the time Veterans spend waiting for outcomes.

Primary Responsibilities:

As a Site Reliability Engineer, you will help ensure the availability, performance, and reliability of a large federal enterprise cloud platform that operates around the clock. With the support and guidance of senior engineers, you will help meet scope, schedule, and delivery requirements while improving the platform's reliability practices. Primary expectations of a Site Reliability Engineer include:

  • Monitoring platform health and supporting service level objectives (SLOs), service level indicators, and error budgets
  • Building and maintaining observability tooling, including metrics, logging, alerting, and dashboards
  • Participating in on-call rotations and incident response, helping restore service and reduce time to recovery
  • Contributing to blameless postmortems and driving follow-up actions
  • Automating repetitive operational tasks to reduce toil
  • Supporting capacity planning and performance tuning across cloud infrastructure (AWS) and Kubernetes (Amazon EKS)
  • Implementing reliability improvements as infrastructure as code (Terraform)
  • Working with government partners and application teams to meet security, SLA, and performance requirements
  • Supporting recruiting efforts by evaluating exercises and assisting with interviews





Basic Qualifications :

  • Bachelor's and 5+ years of experience; relevant experience may be substituted for education
  • Experience with monitoring and observability tooling and on-call operations
  • Proficient with at least one infrastructure-as-code tool (Terraform preferred)
  • Background in key DevOps concepts: containerization, networking, and cloud infrastructure
  • Must be able to obtain and maintain a U.S. Public Trust / suitability determination





Preferred Qualifications:

  • Prior experience with the Department of Veterans Affairs
  • Experience with Kubernetes (Amazon EKS) and AWS in production
  • Familiarity with SLO-based reliability practices and error budgets
  • Relevant certifications (e.g., AWS, Certified Kubernetes Administrator)


To learn more about working at Ad Hoc, please visit:

Benefits:

  • Company-subsidized health, dental, and vision insurance
  • Flexible PTO
  • 401K with employer match
  • Paid parental leave after one year of service
  • Employee Assistance Program


Ad Hoc LLC is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, national origin, ancestry, sex, sexual orientation, gender identity or expression, religion, age, pregnancy, disability, work-related injury, covered veteran status, political ideology, marital status, or any other factor that the law protects from employment discrimination.

We value the unique skills gained through military service and encourage veterans and transitioning service members to apply.

In support of various state and city equal pay transparency laws, Ad Hoc job descriptions feature the starting range we reasonably expect to pay to candidates who would join our team with little to no need for training on the responsibilities we've outlined above. Actual compensation is influenced by a wide range of factors including but not limited to skill set, level of experience, and responsibility. The range of starting pay for this role is $125,000-$135,000. Our recruiters will be happy to answer any questions you may have, and we look forward to learning more about your salary requirements.

job reference:

Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in McLean, VA vacancy
  •  ...Detail Description: The AWS Site Reliability Engineer (SRE) is responsible for the operational health, availability, and performance of the AWS and Databricks environments built by the Platform Engineering team. You prepare and take ownership of "day two" operations... 
    Suggested

    InstantServe LLC

    Vienna, VA
    4 days ago
  • $160k - $180k

     ...Site Reliability Engineer Location: Hybrid – Washington DC/Virginia/Maryland metro with the ability to travel to Patuxent River, MD, as needed (up to 20% of the time). Compensation: $160,000 - 180,000 per year, depending on experience and qualifications. Employment... 
    Suggested
    Full time
    Temporary work
    Local area
    Remote work
    Flexible hours

    Fortress Information Security

    Washington DC
    4 days ago
  • $104.9k - $174.7k

     ...scale, 24x7, distributed and fault-tolerant systems within agreed reliability objectives, whilst enabling the fast flow of feature and...  ...strong automation skills. About team; This diverse team of Engineers in assisting multiple product teams as we continue to innovate... 
    Suggested
    Local area
    Immediate start
    Worldwide

    RELX

    Alexandria, VA
    1 day ago
  •  ...MANTECH seeks a motivated, career and customer-oriented Site Reliability Engineer to join our team. The Site Reliability Engineer (SRE) will leverage their strong technical background and knowledge to support the Sponsor's system accreditation efforts, to include... 
    Suggested
    Work at office
    Local area

    ManTech

    Herndon, VA
    1 day ago
  • $121.4k - $218.6k

     ...will be responsible for ensuring best-in-class uptime and reliability of our AI hardware infrastructure offerings. **Partner with...  ...and defend them when they are breached. As a Senior Site Reliability Engineer, you will be responsible for: + Developing and scaling robust... 
    Suggested
    Work experience placement
    Work at office

    Akamai

    Washington DC
    1 day ago
  •  ...Site Reliability Engineer Mc Lean, VA Long Term Client's Enterprise Data Machine Learning (EDML) employs innovative minds like yourself to design and develop software-systems that can meet the demand of our ever-growing customer base. Like a... 
    Immediate start

    Maintec Technologies

    McLean, VA
    5 days ago
  •  ...Senior Site Reliability Engineer United States About OfficeSpace: OfficeSpace Software provides the leading AI operating system for the built world, that helps teams plan, connect, and perform in the workplace. As a performance-based, PE-backed company, we hire... 
    Shift work

    OfficeSpace Software

    Washington DC
    4 days ago
  • $86.8k - $198k

     ...Job Number: R0243370 Site Reliability Engineer The Opportunity: At Booz Allen, our Global Defense Sector (GDS) supports the Department of War (DoW) in delivering resilient, mission-critical digital capabilities. We are seeking a Site Reliability Engineer to help... 
    Full time
    Contract work
    Part time
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    Arlington, VA
    2 days ago
  • $112k - $179k

     ...system, network, software, and security solutions. About The Role Peraton is seeking a self-driven and resourceful Site Reliability Engineer to join our dynamic of Network and UC engineers in Washington, DC. This position combines software engineering and systems... 
    Contract work
    Worldwide
    Shift work

    Peraton

    Washington DC
    2 days ago
  •  ...and competitive offerings to customers in the intelligence community, defense, civil, and commercial markets. Job Title: Site Reliability Engineer Location: Sterling, VA Clearance: TS/SCI Poly This position is CONTINGENT upon contract award The Site... 
    Contract work

    Nightwing

    Sterling, VA
    6 days ago
  •  ...Site Reliability Engineer II Join the leader in providing smarter solutions for a safer world. The property technology space is growing rapidly, and Kastle Systems is leading the way. Kastle Systems is the leader in managed security, with a track record of introducing... 
    Remote work

    Kastle Systems

    Falls Church, VA
    5 days ago
  • $131k - $227.13k

     ...Description: The 1LMX MES COE is seeking an engineer who will own infrastructure‑as‑code, cloud platform, and reliability for the Apriso environment on AWS. This role blends full‑stack development, DevOps, and Site Reliability Engineering (SRE) practices to deliver a... 
    Full time
    Temporary work
    Work experience placement
    Work at office
    Remote work
    Relocation
    Flexible hours
    Shift work
    3 days per week

    Lockheed Martin Corporation

    Bethesda, MD
    2 days ago
  • $86.8k - $198k

     ...Job Number: R0238722 Site Reliability Engineer The Opportunity: Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software... 
    Full time
    Contract work
    Part time
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    Herndon, VA
    1 day ago
  •  ...Sr. Site Reliability Engineer (SRE) III As a Sr. Site Reliability Engineer (SRE) III, you'll work as part of a collaborative and high-performing team providing your expertise to deliver technical solutions within the highest levels of the federal government. We believe... 
    Immediate start

    Mount Indie

    Washington DC
    5 days ago
  •  ...Site Reliability Engineer Location: Occasional onsite visits to Reston VA (Zip code 20190). Duration-1 year plus Interview process: The final interview is a mandatory, face-to-face interview in Reston VA. Zip code: 20190 Strong... 
    Long term contract
    Temporary work
    H1b
    Immediate start
    Relocation

    3B Staffing LLC

    Reston, VA
    2 days ago
  •  ...Site Reliability Engineer Qualifications: 10+ years of overall experience in IT including, with hands-on Development and Systems engineering background 3-5 years of experience in a Site Reliability Engineering role Experience with Enterprise Cloud transformation... 
    Temporary work
    Immediate start

    Samprasoft

    Washington DC
    2 days ago
  • $175k - $195k

     ...Filevine Sr. Observability Engineer Filevine is a Legal AI company delivering Legal Operating...  .... # Define and manage SLIs, SLOs, and reliability metrics. # Lead incident response,...  ..., or operations. #5+ years of Site Reliability Engineering experience. #... 
    Full time
    Temporary work

    Filevine

    Washington DC
    2 days ago
  • $106.3k - $221.1k

     ...more. Join us to drive positive, lasting change that moves missions and the government forward! Job Description The Site Reliability Engineer will ensure the reliability, performance, and scalability of the Client System. The engineer will define and track Key... 
    Live in
    Work at office
    Local area

    Accenture

    Arlington, VA
    3 days ago
  • $135k - $150k

     ...Mission Focused Expertise: From veteran leadership to cleared engineers, our people understand both the technology and the mission. Summary Bridge Defense seeks a highly qualified Site Reliability Engineer to build and lead the company's deployment engineering... 
    Relocation
    Flexible hours

    Bridge Defense

    Washington DC
    4 days ago
  • $165k - $230k

     ...actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. SITE RELIABILITY ENGINEER (STARSHIELD) Starshield leverages SpaceX’s Starlink technology and launch capability to support national security efforts... 
    Permanent employment
    Temporary work
    Immediate start
    Weekend work

    SpaceX

    Washington DC
    3 days ago
  • $95k - $171k

     .... Opportunities exist to focus on GPU infrastructure, Kubernetes, and ensuring reliability for AI workloads within Akamai's serverless inference platform. As an Site Reliability Engineer II, you will be responsible for: Building and maintaining dashboards, alerts... 
    Permanent employment
    Work experience placement
    Work at office
    Remote work
    Work from home
    Worldwide
    Flexible hours

    Akamai

    Washington DC
    7 days ago
  • $103.5k - $150k

     ...exceptional people to create extraordinary experiences together. Bring your whole self. The Role and Team The Site Reliability Engineering organization at Medallia brings together the infrastructure and applications that power a highly reliable global SaaS... 
    Temporary work
    Work experience placement
    Local area
    3 days per week

    Medallia

    McLean, VA
    4 days ago
  • $131k - $164k

     ...Staff Site Reliability Engineer New York, New York, United States Position Overview We are seeking a highly skilled Staff Site Reliability Engineer with deep technical expertise across VMware, Linux, and automation frameworks, to join our global Infrastructure... 
    Work at office
    Local area
    Flexible hours

    Diligent

    Washington DC
    4 days ago
  • $112.5k - $187.5k

     ...We Collect Your Privacy Choices Team Overview At TransUnion, this role will report to a DevOps Director. The Site Reliability Engineering team drives reliability strategy, elevates engineering standards, and owns some of the most complex and consequential work... 
    Full time
    Work experience placement
    Work at office
    Flexible hours
    2 days per week

    TransUnion

    Reston, VA
    5 days ago
  • $84.9k - $209.5k

     ...Designs and architects infrastructure and service to ensure reliability and functionality. Forecasts demands and responds to capacity needs...  ...new tools and develops and maintains advanced knowledge of site reliability trends. #LI-E2 Responsibilities Key Responsibilities... 
    Temporary work
    Immediate start
    Flexible hours
    Shift work

    Oracle

    Washington DC
    3 days ago
  •  ...Required U.S. Citizenship / No clearance needed / 100% remote within the US  Staff Site Reliability Engineer / Cloud SME Location: 100% remote in the continental US  Type: Long-term contract (3+ years) Role Summary As the Staff SRE/Cloud SME, you will be... 
    Long term contract
    Remote work

    ASCENDING LLC

    Fairfax, VA
    1 day ago
  • $121.5k - $306.4k

     ...infrastructure and service and provides input on best practices for reliability and functionality. Establishes direction to ensure accurate...  ...with new technology, executing improvements, building site reliability knowledge, and providing clear data. #LI-ES2 Responsibilities... 
    Temporary work
    Flexible hours

    Oracle

    Washington DC
    5 days ago
  •  ...hiring talent AI can't replace to help us shape the future of information management. Join us. YOUR IMPACT As a Lead Site Reliability Engineer at OpenText, you will play a vital role in keeping our cloud infrastructure reliable, scalable, and high-performing.... 
    Work experience placement

    OpenText

    Gaithersburg, MD
    1 day ago
  • $112k - $150k

     ...containerized workloads (Docker) for repeatable, reliable deployments. Define and track the...  ...bachelor’s degree in computer science, engineering, or a related field (or equivalent hands...  ...). 3-5 years of experience in site reliability, systems, or cloud engineering... 
    For contractors
    Remote work
    Flexible hours

    Skyward IT Solutions, LLC

    Rockville, MD
    1 day ago
  • Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our production ecosystems, ensuring that our complex, data-driven AI platforms remain resilient, scalable, and highly performant... 
    Local area

    Tiger Analytics, LLC

    Washington DC
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!