Principal Site Reliability Engineer
Palo Alto Networks
Principal Site Reliability Engineer
The Cortex team builds and delivers the industry's most advanced SecOps platform, consisting of XDR, XSIAM, XSOAR, and XPANSE.
As a Principal Site Reliability Engineer within the Cortex DevOps team, you will serve as a technical leader responsible for driving the reliability, scalability, observability, and operational excellence strategy across the Cortex platform. You will partner closely with engineering, product, and infrastructure teams to influence architecture decisions, establish reliability standards, and build innovative solutions that improve service availability, performance, and operational efficiency at global scale.
This role requires deep expertise in cloud infrastructure, observability, distributed systems, automation, and incident management. You will help shape the future direction of our observability and reliability platforms while mentoring engineers and driving best practices across the organization.
Key Responsibilities
- Define and drive reliability, observability, and operational excellence standards across Cortex services and infrastructure.
- Design and evolve large-scale observability platforms using technologies such as Prometheus, Thanos, Grafana, OpenTelemetry, and cloud-native monitoring solutions.
- Partner with engineering teams to ensure services are designed, instrumented, and operated with reliability and scalability in mind.
- Drive improvements in monitoring, alerting, incident management, and service health to proactively identify and prevent customer-impacting issues.
- Lead initiatives focused on automation, self-healing systems, operational efficiency, and reduction of operational toil.
- Influence architectural decisions and technology adoption to improve platform reliability, performance, and cost efficiency.
- Mentor engineers and provide technical leadership across multiple teams and organizations.
- Stay current with emerging technologies and industry trends, evaluating and implementing solutions that advance Cortex's operational capabilities.
- Provide leadership during major incidents and drive post-incident reviews focused on systemic improvements.
Qualifications
Required Qualifications
- 10+ years of experience in Site Reliability Engineering, DevOps, Cloud Engineering, or related disciplines.
- Deep expertise with Prometheus, Thanos, Grafana, OpenTelemetry, and modern observability platforms.
- Strong understanding of SRE principles including SLIs, SLOs, error budgets, incident management, and operational excellence.
- Expert knowledge of Google Cloud Platform (GCP), Amazon Web Services (AWS), or similar cloud platforms.
- Expert-level experience with Kubernetes, Docker, and cloud-native architectures.
- Strong software engineering and automation skills using Python, Linux, Terraform, Ansible, and GitOps practices.
- Proven ability to influence technical direction and drive cross-functional initiatives across multiple engineering teams.
Preferred Qualifications
- Experience building and operating observability platforms at large scale.
- Experience implementing AI-driven operational tooling, automation, or AIOps solutions.
- Strong communication and leadership skills with experience mentoring senior engineers and leading complex technical initiatives.
- Ability to operate independently, influence stakeholders, and drive outcomes across organizational boundaries.
Compensation Disclosure
The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/com-missioned roles) is expected to be the annual range listed below. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.
Our Commitment
We're trailblazers that dream big, take risks, and challenge cybersecurity's status quo. It's simple: we can't accomplish our mission without diverse teams innovating, together.
We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at View email address on click.appcast.io.
Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.
All your information will be kept confidential according to EEO guidelines.
Is role eligible for Immigration Sponsorship? No. Please note that we will not sponsor applicants for work visas for this position.
$153k - $185k
...Principal Site Reliability Engineer El Segundo, California, United States About Varda Low Earth orbit is open for business. Varda is accelerating the development of commercial space infrastructure, from in-orbit pharmaceutical processing to reliable and economical...PrincipalPermanent employment$250.5k - $335.9k
...Sr Principal Site Reliability Engineer P5/P6: SRE Lead, Content Distribution Engineering Media Engineering. SF CA / LA CA / NYC Team Intro On any given day at Disney Entertainment & ESPN Technology, we're reimagining ways to create magical viewing experiences...PrincipalWorldwide$147k - $237.5k
...Principal SRE Palo Alto Networks is disrupting the Cyber Security industry! We're definitely not business-as-usual and that goes for... ...~ Must be a US Citizen. ~ BS/MS in Computer Science/Engineering or equivalent training, education, and experience in information...PrincipalVisa sponsorshipWork visa$153k - $185k
...Senior Site Reliability Engineer El Segundo, California, United States About Varda Low Earth orbit is open for business. Varda is accelerating the development of commercial space infrastructure, from in-orbit pharmaceutical processing to reliable and economical...SuggestedPermanent employmentWeekend work- A leading aerospace company in Hawthorne, CA is seeking a Senior Site Reliability Engineer to enhance Starlink’s satellite internet infrastructure. The role involves upgrading systems for geo-redundancy, collaborating closely with engineers, and ensuring optimal performance...Suggested
$183k - $235k
...Senior Staff Site Reliability Engineer El Segundo, CA HiveWatch is a tech-forward, inclusive organization fostering the evolution of the physical security industry. We are a diverse team of forward thinkers who empower each other to find creative and collaborative...Flexible hours$129k - $193.5k
...innovation and transformation . The DID Mission IT pillar supports engineering teams across Aerospace by delivering top-tier IT... ...community. Mission IT Operations is seeking a skilled Site Reliability Engineer with deep expertise in Kubernetes, Linux,...Full timeImmediate startRemote workRelocation packageFlexible hours$160k - $220k
Sr. Site Reliability Engineer (Starlink) Hawthorne, CA SpaceX is developing Starlink, the world’s largest satellite constellation and is providing fast, reliable internet to 9M+ users worldwide. We design, build, test, and operate all parts of the system - thousands of...Temporary workWorldwideWeekend work$129k - $193.5k
Job Summary Mission IT Operations is seeking a skilled Site Reliability Engineer with deep expertise in Kubernetes, Linux, programming, and automation. The role involves developing and maintaining both on‑premises and cloud‑based Kubernetes clusters that form the core of...Immediate startRemote workRelocation packageFlexible hours- SPACE EXPLORATION TECHNOLOGIES CORP is seeking a Site Reliability Engineer in Hawthorne, California, to manage mission-critical products for Guidance, Navigation, and Control (GNC) teams. The ideal candidate possesses a degree in a relevant field or equivalent experience...
$125k - $175k
SpaceX is seeking a Site Reliability Engineer in Hawthorne, California. The role involves deploying, maintaining, and scaling mission-critical software infrastructure for vehicle operations, ensuring software quality and reliability. The ideal candidate will have a strong...- ...A leading engineering solutions firm in El Segundo, California, is seeking a Principal Software Engineer to join a systems engineering team focused on developing next-generation space communication systems for national security. The ideal candidate will have at least...Principal
$125k - $145k
Site Reliability Engineer - Top Secret Clearance Hawthorne, CA SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to...Permanent employmentTemporary workWeekend work$125k - $145k
...SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. Site Reliability Engineer, GNC SpaceX’s mission is to make humanity multiplanetary by developing fully and rapidly reusable launch systems capable...Permanent employmentTemporary workFlexible hoursWeekend work$125k - $175k
Software Engineer, Site Reliability Engineering (application Software) Design, deploy, and scale SpaceX mission‑critical software infrastructure for vehicle operations. Location: Hawthorne, California, United States Compensation: $125,000 - 175,000 USD / year Job Tags...Permanent employmentTemporary workWeekend work- ...A leading SpaceTech organization seeks a Principal Flight Software Engineer to lead the design and development of mission-critical flight software. This role involves architecting delivery of real-time flight software and mentoring engineers in a fast-paced environment...Principal
- Northrop Grumman Corp. (AU) is hiring a Principal/Senior Principal Systems Engineer specializing in High Performance Computing in Redondo Beach, California. This role involves managing HPC systems lifecycle and collaborating with IT and Cybersecurity to optimize Low Observable...Principal
$180k - $250k
...Principal Flight Software Engineer This range is provided by InnoForge. Your actual pay will be based on your skills and experience — talk with... ...with hardware and GNC teams. Develop high‑reliability embedded software in C/C++ (and/or Rust) within RTOS...PrincipalPermanent employmentFull time- ...A leading aerospace manufacturer in Hawthorne, CA is seeking a Principal Security Software Engineer to automate security systems using AI. The role involves designing and implementing security solutions for Starshield, a project supporting national security efforts. Ideal...Principal
- ...Radiant-Industries in El Segundo, CA is seeking a Principal DevOps Engineer to lead high-performance computing (HPC) infrastructure projects. You... ...of cloud architectures. This position requires on-site work and contributes significantly to innovative nuclear microreactor...Principal
$170k - $277k
...Principal Software Engineer At Palo Alto Networks®, we're united by a shared mission—to protect our digital way of life. We thrive at the intersection of innovation and impact, solving real-world problems with cutting-edge technology and bold thinking. Here, everyone...PrincipalFull timeWork at officeVisa sponsorshipWork visa$175k - $215k
...contested environments. Designed by military veterans and engineers from leading technology companies, CHAOS operates at the intersection... ...accelerates. The Opportunity CHAOS is looking for a Principal Mechanical Engineer to lead the mechanical integration of...Principal$200k - $285k
A leading aerospace manufacturer in Hawthorne, CA is seeking a Principal Software Engineer for their Platform Team. The role involves defining the long-term technical vision for AI platforms, leading large initiatives, and optimizing systems for security and efficiency....Principal- ...on applications and provide technical support Follow the true agile principles What you bring ~10+ years of software engineering experience ~ Expert-level ability utilizing technologies such as Java, Spring Framework ~ REST and Microservices ~ Strong...Principal
$141k - $175k
...CesiumAstro is seeking a Principal DevOps Engineer I in El Segundo, California. The incumbent will own and scale infrastructure supporting mission-critical software for satellites and UAVs. Key responsibilities include managing CI/CD pipelines, optimizing embedded builds...Principal- A leading aerospace company in Hawthorne, CA, is seeking a Principal RF Software Engineer to support national security projects. This role involves designing and building RF test benches, collaborating with engineers, and applying advanced software skills in a fast-paced...Principal
- ...Principal Software Engineer with GPS Experience - El Segundo, CA Our dynamic and diverse engineers develop demanding, trusted, superior solutions to make the world a safer place. We are looking for a proven Senior Principal Software Engineer to join our Software Engineering...PrincipalWork experience placement
$180k - $260k
...Redondo Beach Description As a Senior Embedded Software Engineer at Impulse focused on Actuation & Control Systems, you will be... ...high‑performance, fault‑tolerant firmware capable of operating reliably in harsh environments such as launch and space. Responsibilities...PrincipalPermanent employmentFull time$200k - $270k
SpaceX is looking for a Principal Software Engineer, Flight Software in Hawthorne, California. The role involves designing and developing software for Starship’s flight systems. Candidates must have over 8 years of experience in software development, with expertise in C...Principal$205k - $245k
...Senior Principal Mechanical Engineer, Platform Integration CHAOS Industries is redefining modern defense with omniscient systems purpose-built... ...disciplinary engineering efforts. This is a full-time, on-site position in Hawthorne, CA. Responsibilities:...PrincipalFull timeWork experience placementCasual workRelocation package
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!
- engineering director Hawthorne, CA
- chief engineer Hawthorne, CA
- data center chief engineer Hawthorne, CA
- hotel chief engineer Hawthorne, CA
- principal developer Hawthorne, CA
- general engineer Hawthorne, CA
- principal engineer Hawthorne, CA
- principal Hawthorne, CA
- senior principal scientist Hawthorne, CA
- senior principal cloud computing engineer Hawthorne, CA

