Site Reliability Engineer
$115.9k - $205.1kThomas Jefferson National Accelerator Facility
At Jefferson Lab,you'llchampioncutting-edgescience and operational excellence while shaping the future of discovery. Join us and make your mark - where excellence meets purpose, andgreat mindstruly matter.
Salary Range: $115,900 - $205,100 (SCS-III)
What your job will be like:
You embed within the HPDF architecture team to make reliability, resilience, and observability first-class features of the facility's scientific data lifecycle systems - not afterthoughts. You define the initial Service Level Objectives (SLOs) and Service Level Indicators (SLIs), establish monitoring and alerting foundations, influence technology selections across compute, storage, and networking, and build the automation tooling that eliminates manual operations risk. When the facility transitions to operations, you lead the HPDF SRE team, owning availability metrics, incident response, and the continuous improvement processes that keep the facility performing to its design parameters.
In this job you will:- Work closely with the rest of the architecture team to review and influence technology choices to establish reliability, and resilience parameters (e.g., meeting expected availability, failure domain isolation, disaster recovery)
- Ensure the selected software and hardware systems meet those parameters, while also meeting performance expectations and security requirements.
- Evaluate vendor and open-source solutions against established reliability and resilience parameters, develop comparative assessments, and provide technically grounded recommendations to inform architecture decisions and support acquisitions.
- Metrics & Observability: Establish the foundation for system observability, defining initial SLOs/SLIs, architecting, prototyping and then implementing comprehensive monitoring, logging, and alerting solutions.
- Lead the design, prototyping and implementation of these solutions including custom automation to eliminate manual operations and further improve facility resilience.
- Performance Engineering: Participate in testing and performance analysis to validate reliability and resilience design decisions, to identify bottlenecks and alternative approaches.
- Establish SRE Team Framework: Define the operational framework, on-call structures, incident response, other operational processes, and staffing plans for the future SRE team, bridging the design-to-operations transition.
- Required: 10 or more years SRE (Site Reliability Engineering), DevOps, or Systems Engineering roles
- Required: Bachelor's Degree Computer Science or related field
- Preferred: Master's Degree Computer Science or related field
Education above the minimum may be substituted for experience.
Knowledge, Skills, and Abilities- High: Deep experience and understanding of distributed systems principles, failure modes, consensus protocols and self-healing architectures.
- High: Expertise in defining and implementing SLOs and SLIs and comprehensive monitoring stacks and experience architecting observability frameworks in greenfield environments (e.g. Prometheus, ELK, OpenTelemetry)
- High: Strong scripting and automation skills (Go, Python, Shell).
- Medium: Deep experience with public cloud environments (AWS, Azure, GCP) and container orchestration (Kubernetes).
- Medium: Experience with configuration management and IaC tools (e.g., Terraform, Puppet, Ansible).
- Medium: Experience with IPv4 and IPv6 networking, high-speed interconnects and data transfer protocols, familiarity with network reliability patterns and software-defined networking (pref)
- Low: Experience with HPC infrastructure and environments (pref)
- Low: Experience leading or mentoring small teams (pref)
About Jefferson Lab
Join a community with a common purpose of solving the most challenging scientific and engineering problems of our time. The Jefferson Lab campusis located insoutheasternVirginiaamidst a vibrant and growing technology community.
A career at Jefferson Lab is more than a job. You will be part of "big science" and work alongside top scientists and engineers from around the world unlocking the secrets of our visible universe. Managed by SURATech, LLC, Thomas Jefferson National Accelerator Facility is entering an exciting period of mission growth and is seeking new team members ready to apply their skills and passion to have an impact. You could call it work, or you could call it a mission. We call it a challenge. We do things that will change the world.
Total Rewards at Jefferson LabAt Jefferson Lab, we believe that a comprehensive employee benefits program is an important and meaningful part of the compensation employees receive. Our benefits program includes, but is not limited to:
* Medical, Dental, and Vision Care Plans * Flexible Spending Accounts
* Paid Time-off and Leave Programs (Paid Parental, vacation, holidays, and sick leave)
* 401(k) Plan - 9% Lab Contribution; 100% vested * Flexible Work Arrangements
(Remote & Alternate Work Schedules available)
* Tuition Assistance, Training and Professional Development Programs
* Live near the waterways of the Chesapeake Bay region with access to nearby beaches,
mountains, and all major metropolitan centers on the East Coast
SURATech, LLC manages and operates the Thomas Jefferson National Accelerator Facility (Jefferson Lab). SURATech is an Equal Opportunity Employer.
SURATech is committed to providing reasonable accommodation for people with disabilities (unless doing so will result in an undue hardship). If you need a reasonable accommodation for any part of the employment process, please send an e-mail to View email address on click.appcast.io or contact Human Resources by calling View phone number on click.appcast.io and selecting option 1 between 8 am - 5 pm EST to provide the nature of your request.
Employment with SURATech is conditional upon DOE approval if at any time during your employment you are participating in a Foreign Government Talent Recruitment Program or Affiliated activity. Generally, such programs/activities include any foreign-state-sponsored attempt to acquire U.S.-funded scientific research through programs run or funded by the government that target scientists, engineers, students, academics, researchers, and entrepreneurs of all nationalities working or educated in the United States. This includes positions or appointments, both domestic and foreign, titled academic, professional, or institutional appointments whether or not remuneration is received and whether full-time, part-time or voluntary.
$180k - $200k
Zachary Piper Solutions is seeking an Elastic Site Reliability Engineer (SRE) to support a mission-focused organization delivering secure, scalable observability and reliability solutions across Department of Defense environments. This position is on-site at Hanscom AFB...Suggested$180k - $200k
...observability, and telemetry operations. Ensure platform reliability, uptime, scalability, and performance across production mission... .... Qualifications 5+ years of experience supporting Site Reliability Engineering, DevOps, or infrastructure operations environments. Strong...SuggestedRemote job$85.39k - $116.98k
...Syms Strategic Group (SSG) is seeking a talented Senior Systems Engineer (Angular) Location: Remote Department: Veterans Affairs... ...services to deliver live and historical EDI transaction data reliably and performantly Support and contribute to an Angular-based...SuggestedFull timeRemote work- ...testing. If you are passionate about pushing technological boundaries in a dynamic environment, we invite you to join us. The Lead Engineer – Digital Integrated Systems is a key technical leadership position that reports to the Office of the Chief Engineer and drives...SuggestedWork at office
- ...Senior IT Systems Engineer Company: Swisslog Logistics Inc. Location: Newport News, VA, US Additional posting countries (for remote jobs only): Workplace: Onsite - Company Location Address Customer Location: Where do people love what they do, and being great at...SuggestedLocal areaRemote workWorldwide
$103.71k - $138.28k
...demonstrated knowledge and experience in system architecture and engineering disciplines. Specific technical knowledge of enterprise level... ...Amazon Web Services. •Supports due diligence activities including site surveys, design, design review, bill of materials creation,...Full timeTemporary workRemote work- ..., Canon Virginia, Inc. serves as Canon's only manufacturing, engineering, recycling and technical support center in the Americas region... ...of products. Analyzes data to verify efficiency / reliability of prototypes and to determine feasibility/ manufacturability...Contract workWork at officeLocal area
- About the job MICROSOFT DYNAMICS SYSTEM ADMINISTRATOR/ DEVELOPER Job Summary: The Microsoft Dynamics 365 System Administrator primary responsibility will be to provide support for the system configuration, upgrade, security administration, change management...Flexible hoursWeekend work
- ...MGR ENGINEERING 3 (PLATFORM WORK CONTROL) Location: Newport News, Virginia, United States Date: Jun 4, 2026 Req ID: 47592 Team: E25 TEST... ...with base procedures. Ensure sufficient manning for NNS on-site work and off-site work, training and qualification on all work...Full timeTemporary workFor contractorsWork at officeLocal areaRemote workRelocationShift work
- ...role may require collaboration with distributed teams and participation in scheduled project meetings. Reliable transportation may be required if supporting on-site client locations. Work schedules may include extended hours or adjusted schedules based on project deadlines...Contract workWork at officeRemote work
- ...This position opportunity is 100% On-Site or Hybrid per HRT Telecommuting Policy. Hampton Roads Transit is looking for dynamic, customer service oriented, and energetic people to become part of a committed team providing excellent and effective public transportation...Remote workFlexible hoursWeekend work
$85.39k - $116.98k
...Syms Strategic Group (SSG) is seeking a talented Senior Systems Engineer (Amazon Web Services (AWS) Cloud Applications) Location:... ...architectures to process, transform, and deliver healthcare data reliably and securely Design, build, and maintain Representational State...Full timeRemote work- Cruz Associates Inc. of Yorktown, VA ( has an open position and is looking for highly motivated and uniquely qualified candidates who specializes in Future Vertical Lift (FVL) and can perform in a very high OPTEMPO workenvironment as a Mission Equipment Program Integrator...Interim roleWork at office
- ...highly skilled Maximo Application Suite (MAS) Solution Design Engineer to support NASA’s Office of Strategic Infrastructure (OSI) through... .... • Analyze requirement sets to ensure compliance with reliability, maintainability, and availability standards. • Install...Remote jobTemporary workWork at officeFlexible hours
- Who We Are: Headquartered in Washington, DC, Versar Global Solutions provides full mission lifecycle solutions for challenges faced by our government and commercial Customers in the natural, built, and digital environments. With nearly 2,000 team members around the...Work at officeLocal area
- Introduction This team innovated a classified data service to support data-intensive assessment of Intelligence Surveillance Reconnaissance (ISR) system performance and effectiveness. We are seeking a software developer to improve existing features and innovate new ...Immediate start
- Sierra Lobo is seeking an Electrical Engineer to join the Maintenance Operations team at NASA’s Langley Research Center. This role focuses on enhancing the reliability of electrical systems and supporting maintenance teams with technical guidance. The ideal candidate must...
- ...SYMVIONICS has a Current Opening for a Jr. Software Engineer at NASA Langley Research Center ON-SITE POSITION SYMVIONICS at the NASA Langley Research Center (LaRC) inHampton, Virginia is seeking a junior level software engineer to join our team. Our Simulation...For contractorsWork at office
- ...the team of the Air Operations Center Weapons System (AOC WS) Falconer Program. The ideal candidate will serve as a member of the Engineering and Sustainment software development team, provide technical and design aspects, and aid in the innovation and creation of...For contractors
- ...ENGINEER SOFTWARE 4 - EMBEDDED CONTROLS AND HMI Location: Newport News, Virginia, United States Date: May 21, 2026 Req ID: 46... ...medical, prescription drug, dental and vision plan choices, on-site health centers, tele-medicine, wellness resources, employee assistance...Full timeLocal areaRemote workRelocationRelocation packageShift work
$120k - $125k
...Software Engineer Newport News, VA 23606 Industry: Engineering & Design Job Description CTR Group is seeking a SOFTWARE... ...and clarify requirements. Ensure ongoing functionality and reliability of software through testing and maintenance. Document code,...Temporary work- ...across the software stack. Our current technologies include Ruby on Rails, React, TypeScript, MySQL, and Elasticsearch. We look for engineers who love finding efficient, thoughtful, and highly usable solutions to a variety of technical and product challenges. You should...Permanent employmentFull timeWork experience placementWork at officeRemote work
- ...ENGINEER SOFTWARE 3 Location: Newport News, Virginia, United States Date: Jun 8, 2026 Req ID: 47757 Team: E44 NTWK/COMM/AUTO... ...medical, prescription drug, dental and vision plan choices, on-site health centers, tele-medicine, wellness resources, employee assistance...Full timeWork at officeLocal areaRemote workRelocationRelocation packageShift work
- ...Software Engineer II Job Location US Job ID 2026-4445 Fill Type Vacancy Overview At ITA International, we're a tech-enabled professional services company. Headquartered in Newport News, Virginia, we leverage subject matter expertise...Contract workTemporary work
- ...Inc., headquartered in Newport News, Virginia, seeks a Software Engineer to work at unanticipated location(s) in the U.S. Will develop... ...software training for Mühlbauer's semiconductor systems at customer's sites; Provide software support service for customers; and...Remote workWorldwideRelocation
- ...an experienced Senior Ovation DCS Programmer / Control Systems Engineer to support the design, implementation, commissioning, and long-term... ..., Commissioning & Support Lead and support FAT, SAT, and on-site commissioning activities. Provide operational troubleshooting...Temporary workWork at officeImmediate startRemote workFlexible hours
$120k - $132k
...inclusive of health benefits. We encourage you to learn more about our Total Rewards Program by visiting the Resource page on our Careers site. Salary at MAG Aerospace is determined by various factors including but not limited to location, the particular combination of...Full timeContract workPart timeLocal area$139.4k - $220.3k
...Salary: $139,400.00 - $220,300.00 (SSE) What your job will be like: The candidate will supervise a group of Mechanical Engineers. Will work on multiple major cryogenic projects of the highest complexity, which include a sub-atmospheric 2 Kelvin temperature...Full timePart timeSummer workRemote workFlexible hours- ...PPM Pro, AgilePlace, and Roadmaps. The role involves configuring and optimizing workflows, enhancing data accuracy, and ensuring reliable reporting across projects. Candidates should have a Bachelor's degree in a related field and 3-5 years of experience with Planview...
- ...LaRC) in Hampton, VA is seeking a hands-on Simulation Systems Engineer with an electrical engineering background to join our team. Our... ...ensure that created module meets all performance, quality, reliability and functional requirements by analyzing and troubleshooting issues...Permanent employmentContract workWork experience placementShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site safety Newport News, VA
- on-site clinical research associate (traveling/remote) Newport News, VA
- junior website developer Newport News, VA
- lead site reliability engineer
- site reliability engineer remote
- site reliability engineer sre
- site reliability engineer
- site reliability engineering manager
- junior site reliability engineer
- website auditor



