Senior Software Engineer- Site Reliability Engineering (SRE)
$149.4k - $202kNoctua Technology
Senior Software Engineer- Site Reliability Engineering (SRE)
DC, MD, VA, CA
The Site Reliability Engineering discipline at Noctua Technology, LLC is a strategic force driving digital transformation. We treat operations as a software engineering challenge, focusing on the seamless integration, scalability, and long-term reliability of cloud native systems. Our SREs don’t just manage infrastructure; they build it using Infrastructure as Code (IaC), monitor it through advanced observability stacks, and protect it by engineering for failure. We work closely with clients to bridge the gap between development and operations. We are seeking a highly experienced and autonomous Senior Site Reliability Engineer (SRE) to join our dynamic team. As a technical leader, you will define the strategy and apply advanced software engineering principles to operations, focusing on the architecture, reliability, and long-term performance of large-scale production systems. You will play a crucial role in reducing toil through automation, defining and monitoring Service Level Objectives (SLOs), and implementing best practices for system stability and incident response. This role requires working with modern cloud technologies to ensure the high availability and efficiency of applications and infrastructure. Location : Primarily Remote. Candidates must be based in CA or DC Metro Area for proximity to project and client teams. Security Clearance Requirement : Applicants must be US citizens and eligible to obtain and maintain an active Secret security clearance or above. Key Responsibilities Site Reliability Engineering Drive the definition and adoption of SLIs and SLOs across multiple services or entire platforms, ensuring alignment with business goals. Design and architect Infrastructure as Code (IaC) solutions for large-scale, complex environments, establishing standards and best practices. Implement and manage containerized and serverless architectures using Docker, Kubernetes, and cloud-native services, focusing on performance and error budgets. Build and maintain reliable and self-healing CI/CD pipelines to automate deployments and improve development workflows. Toil Reduction and Incident Management Implement and refine comprehensive monitoring, alerting, and logging to detect and address performance and availability issues proactively. Lead the strategic effort to eliminate toil, identifying and championing major automation projects that deliver significant organizational efficiency. Lead high-severity incident response and coordinate blameless postmortems for major outages, driving the resulting remediation and systemic improvements. Testing and Service Resiliency Implement cloud security best practices, including identity and access management (IAM), encryption, and compliance controls. Proactively identify and address system weaknesses and ensure performance under stress. Support disaster recovery and high availability strategies through backup and failover planning. Collaboration and Knowledge Sharing Serve as a primary SRE liaison for development teams, influencing application architecture and design to meet reliability and scalability targets from inception. Create and maintain documentation for cloud architectures, deployment processes, and best practices. Contribute to internal knowledge-sharing initiatives, ensuring continuous learning within the team. Stakeholder Communication Act as a subject matter expert and trusted advisor to clients and internal leadership on cloud infrastructure, reliability strategy, and Service Level Agreement (SLA) negotiations. Act on client feedback to refine and enhance cloud solutions. Conduct training and knowledge-sharing sessions to help clients manage their cloud environments effectively. Continuous Learning and Innovation Stay updated on the latest developments in cloud infrastructure and technology trends. Drive innovation by proposing and implementing new techniques and technologies. Qualifications 5+ years of experience in site reliability engineering, cloud engineering, or related fields. Strong software engineering skills with an emphasis on writing clean, modular, and maintainable code, specifically for automation and system management. Deep experience with Infrastructure as Code (IaC) tools like Terraform or CloudFormation. Deep experience with containerization and orchestration tools like Docker and Kubernetes. Deep knowledge of networking concepts, cloud security best practices, and identity management. Experience with programming or scripting languages such as Python, Bash, or Go. Experience with CI/CD pipelines and DevOps methodologies. Strong problem-solving skills and the ability to troubleshoot complex cloud environments. Demonstrated ability to influence technical decision-making across organizational boundaries. Preferred qualifications Bachelor's or advanced degree in Computer Science or a related field. Any of the below cloud certifications: Google Cloud Professional Cloud Architect Google Cloud Professional Cloud DevOps Engineer AWS Certified Solutions Architect AWS Certified Developer AWS Certified SysOps Administrator CompTIA Security+ certification or an equivalent DoD 8140/8570 IAT Level II baseline certification. Salary Range : $149,400 - $202,000 #J-18808-Ljbffr Noctua TechnologyVacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer- Site Reliability Engineering (SRE) in California, MO vacancy
$140k - $180k
...automate, simplify, and accelerate revenue. We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud... ...You Are Experienced Architect: 5+ years of experience in SRE, DevOps, or Systems Engineering, with a proven track record...SeniorFull timeWork at officeFlexible hours2 days per week$152k - $241.5k
Senior Site Reliability Engineer - Compute Farm Team What you’ll be doing: Own SRE solutions end‑to‑end, from design and implementation to operation and continuous improvement, ensuring they integrate cleanly with HPC schedulers, storage, and network fabrics. Use IaC...Senior$200k - $322k
...and make a profound global impact. NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations function at... ...leader will apply strong operational execution with an SRE attitude, facilitating the move from reactive processes...Senior$152k - $241.5k
Dormont Manufacturing Co in California is looking for a Senior Site Reliability Engineer to manage SRE solutions from design to operations within a multi-cloud hybrid environment. The ideal candidate will have experience in HPC clusters and a strong background in Infrastructure...Senior$148.7k - $297.3k
Abbott Laboratories is seeking a Senior Manager for Platform Engineering to lead and develop a team responsible for the reliability, performance, and scalability of their cloud environment... ...and hands-on management in DevOps and SRE practices. Responsibilities include...Senior$168k - $270.25k
At NVIDIA, Site Reliability Engineering provides a rare chance to define, develop, and support large-scale... .... This demanding position merges software and systems engineering efforts to guarantee... ...reliability and uptime. As an SRE here, you will be part of a welcoming...$184k - $287.5k
...application is built. We are seeking a Senior Software Engineer focused on container and cloud... ...and deployment. You will help improve reliability, performance, and scale across thousands... ...Collaborate across research, backend, SRE, and product teams to ensure day‑0 availability...Senior$152k - $241.5k
...We are looking for highly motivated Senior Software Engineers to join our Fabric Networking team with... ...NVLink Rack-Scale Systems Stability & Reliability. In this role, you will partner closely... ...and reliability. Build and maintain SRE-style validation infrastructure, including...Senior- A leading technology firm is seeking an Enterprise Account Executive to drive sales of their AI-powered SRE platform to enterprise organizations. This role involves prospecting, closing new business, and collaborating with internal teams to ensure customer success. The...SeniorRemote job
$170k - $277k
...Dormont Manufacturing Co is seeking a Senior Principal Software Engineer to lead the development of next-generation Layer 7 security capabilities. The ideal candidate will have extensive experience in designing scalable security technologies and will drive innovation...Senior- ...Irvine, California, with growing sites in Denver, Colorado and San... ...our products, enable our engineers, and keep our platform infrastructure reliable as we grow. As a Senior Software Engineer on the Platform... ...in production Solid DevOps/SRE fundamentals (CI/CD, observability...Senior
$148k - $235.75k
...Join our team of innovative engineers who are building an AI Data Center... ..., high-volume telemetry into reliable, job‑centric insights and... ...operators depend on. You’ll partner Software Engineering and Systems... ...distributed systems as SRE/DevOps/Platform Ops. Proven...Senior$148.7k - $297.3k
...partners worldwide, with the reliability and scalability of cloud‑... .... We are seeking a Senior Manager, Platform Engineering. You are a platform engineer... ...Engineering, DevOps, and SRE, and will be accountable for... ...environment. Experience with software delivery and release...SeniorWorldwide- Ring Inc is seeking a Tech Lead for Mobile Core Network Engineering in California. This role involves taking charge of the health and performance... .... The ideal candidate will have over 7 years of experience in SRE or infrastructure roles, strong networking skills, and a proven...SeniorRemote work
- ...Insider, Inc. is seeking an experienced engineering leader for the Application Modernization Platform (AMP). The role involves defining... ...application modernization. Candidates should have over 10 years in software development, leading teams effectively while driving...Senior
$184k - $287.5k
...highly skilled and experienced Senior DevOps Engineer to join NVIDIA’s Robotics... ...supporting robotics software, including ROS 2-based systems... ...engineering teams, and ensure the reliability, scalability, and... ...years of experience in DevOps, SRE, or infrastructure engineering...SeniorNight shift- ...This role involves upgrading the server and implementing best practices for optimal performance. Candidates should have over 8 years of SRE experience, recent expertise in ETL processes, and strong communication skills for collaboration across teams. The position requires...Senior
- ...responsible for: Cloud Platform Software Development Design, develop,... ...code following software engineering best practices (CI/CD, code review... ..., and ensuring system reliability, the role directly applies engineering... .... Collaborate with SRE and infrastructure teams to ensure...SeniorLocal area
- Cryoport Systems, LLC is seeking a Senior Software Engineer to lead technical initiatives and optimize team delivery. This role involves implementing scalable and maintainable software systems to meet business goals while collaborating closely with product managers and...Senior
$170k - $277k
...drives great outcomes. The Team Engineering - Our engineering team is at the... ...Layer 7 security team is seeking a Senior Principal Software Engineer to lead the design and development... ...components, ensuring scalability, reliability, and security Partner with...SeniorFull timeWork at officeWorldwideVisa sponsorshipWork visa- Dormont Manufacturing Co is looking for a committed Release Engineer in California. In this role, you will collaborate across teams to improve software release processes and ensure high-quality execution. The ideal candidate has over 5 years of experience in programming...Senior
- ...A healthcare technology startup is seeking a Senior DevOps Engineer to create and optimize infrastructure for their AI Care platform. The... ...deployment, and collaboration with multiple teams to ensure system reliability and performance. Ideal candidates should have strong...SeniorRemote workFlexible hours
- ...About the Role Join the NVIDIA Developer Tools team and contribute to our mission of accelerating computing. As a Senior Staff Software Engineer, you will be instrumental in developing high-performance software and libraries that leverage NVIDIA's cutting-edge GPU technologies...Senior
$152k - $287.5k
Dormont Manufacturing Co in California is seeking a talented systems software engineer to architect and maintain their IaaS product. The ideal candidate will have expertise in Rust, C++, and Go, along with extensive experience in distributed systems. Competitive salary...Senior$100.32k - $125.4k
...others. Join us in our mission to create a more connected world, where every voice is heard and every story matters. Senior Engineer, Embedded Software Senior Engineer, Embedded Software (Job ID 164042) The Senior Engineer, Embedded Software is responsible for designing...SeniorTemporary workRemote workFlexible hours$158.41k - $224.1k
Penumbra, Inc. seeks a Manager of SAP BASIS and Security in California, MO. This role involves overseeing the SAP technical landscape, ensuring compliance with regulations, and managing a team of professionals. The ideal candidate will have 8+ years of SAP BASIS experience...Senior$184k - $357k
Senior Systems Software Engineer - GPU Software Mar 11, 2026 $184K - $357K NVIDIA is searching for a creative and highly motivated engineer with expertise in systems software to join the GPU Software team. You will design key aspects of our production GPU kernel drivers...Senior$110k - $130k
A vehicle manufacturing company in Riverside, CA seeks a Sr. Engineer, Vehicle Applications to lead the design and integration of customer-specific vehicle configurations. The ideal candidate will have 5-7 years of engineering experience in a transportation environment...Senior- Dormont Manufacturing Co is seeking a Senior Software Engineer to join our storage management team. This role involves maintaining and developing Kubernetes operators as well as creating web-based solutions for managing distributed storage. Candidates should have extensive...Senior
- ...Role Summary Oracle Health Platform Engineering builds core platform capabilities that... ...documentation, and operations. We are seeking a Senior Software Developer (IC3) to design, develop,... ...and strengthen platform security and reliability. Location / Work Authorization /...SeniorVisa sponsorship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer- Site Reliability Engineering (SRE). Be the first to apply!
Related searches
- software engineer amazon California, MO
- experienced software developer California, MO
- federal - software developer California, MO
- software developer internship California, MO
- senior software engineer California, MO
- software developer fintech California, MO
- part time software developer remote California, MO
- software developer intern California, MO
- software data engineer California, MO
- software engineer California, MO


