Principal Site Reliability Engineer
$202k - $247kFortinet Inc
Job Category Site Reliability Engineering Posting Date 11/18/2025, 12:24 AM Locations Santa Clara, CA, United States Job Schedule Full time Job Description At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess over getting the details right. We love what we do and are proud of our work to secure clouds and container environments for thousands of b2b customers worldwide. Our team is growing, and we are looking for engineers with passion for automation. You will help support the FortiCNAPP platform and play a key role in building, operating, and improving the FortiCNAPP Cloud Security Platform, the world's best real-time cloud-native threat detection system. Our team develops and supports the infrastructure layers spanning our cloud accounts, network/connectivity, workload management, observability, and storage services. We build tooling to perform automated operations in order to scale the FortiCNAPP infrastructure and service. To be successful you will design, define, develop, deploy and operate internal tooling, APIs, and frameworks which streamline our workflows and automate our infrastructure. About this role: As a Principal Site Reliability Engineer at FortiCNAPP, you will lead the design, implementation, and optimization of our highly scalable, resilient, and efficient platform infrastructure. You will drive strategic initiatives to enhance operational excellence, mentor teams, and set the standard for reliability and automation across the organization. Your expertise will shape the future of FortiCNAPP’s infrastructure, ensuring it meets the demands of our customers and supports rapid growth. Responsibilities Architect and implement advanced automation strategies to maximize operational efficiency and minimize toil across the FortiCNAPP platform. Lead the design, development, and enhancement of infrastructure systems to ensure world-class scalability, resiliency, and performance. Proactively identify and resolve complex, systemic issues through innovative automation, tooling, and architectural solutions, preventing customer-facing incidents. Drive the evolution of monitoring, instrumentation, and observability systems to anticipate and mitigate scalability and reliability risks before they impact customers. Champion company-wide adoption of reliability best practices, establishing key metrics, SLAs, and milestones to embed scalability and resiliency into all engineering processes. Collaborate with cross-functional teams to define and implement industry-leading practices for infrastructure, deployment, and operational workflows. Provide technical leadership and mentorship to engineering and operations teams, fostering a culture of reliability, automation, and continuous improvement. Lead incident response and post-mortem processes, driving root cause analysis and implementing preventive measures. Participate in an on-call rotation, serving as an escalation point for complex issues and guiding the team through critical incidents. Influence strategic technology decisions, evaluating and integrating cutting-edge tools, services, and methodologies to enhance platform reliability. Minimum Qualifications 10+ years of DevOps/SRE experience, with at least 5 years in a senior or lead role managing production systems at scale. Expert-level development and automation skills, with a proven track record of building sophisticated tools and workflows. Deep expertise in Infrastructure as Code (e.g., Terraform) and supporting tools (e.g., Atlantis, ArgoCD, Flux). Advanced experience with Kubernetes and its ecosystem (e.g., Helm, operators, Kustomize), including managing large-scale, production-grade clusters. Extensive experience with multiple cloud providers and managed services (e.g., AWS: EKS, EC2, S3, RDS, Secrets Manager; GCP, Azure). Proven ability to architect and operate highly reliable, fault-tolerant cloud infrastructure that supports rapid microservice deployment with robust monitoring and high availability. Exceptional cross-team communication and leadership skills, with experience driving alignment across engineering, product, and operations teams. Deep knowledge of large-scale system building blocks, including load balancing, distributed/cloud computing, container orchestration, and advanced monitoring/observability. Expert understanding of cloud networking, including VPC configuration, cross-cloud connectivity, and hybrid cloud architectures. Proficiency in one or more programming languages (e.g., Python, Go, Rust) for building tools and automation frameworks. Preferred Qualifications Extensive experience designing and implementing advanced monitoring and observability systems (e.g., Prometheus, Grafana, New Relic, Datadog, OpenTelemetry). Strong advocate for “everything as code” principles, with experience institutionalizing IaC and GitOps practices across teams. Deep expertise in Java application servers, JVM tuning, and performance optimization for high-throughput systems. Experience leading cross-functional initiatives to improve system reliability, such as chaos engineering, disaster recovery planning, or zero-downtime deployments. Educational Requirements - Bachelor or Masters degree in Computer Science, Computer Engineering or related fields. The US base salary range for this full-time position is $202,000-$247,000. Fortinet offers employees a variety of benefits, including medical, dental, vision, life and disability insurance, 401(k), 11 paid holidays, vacation time, and sick time as well as a comprehensive leave program. Wage ranges are based on various factors including the labor market, job type, and job level. Exact salary offers will be determined by factors such as the candidate's subject knowledge, skill level, qualifications, experience, and geographic location. All roles are eligible to participate in the Fortinet equity program, Bonus eligibility is reviewed at time of hire and annually at the Company’s discretion. Why Join Us We encourage candidates from all backgrounds and identities to apply. We offer a supportive work environment and a competitive Total Rewards package to support you with your overall health and financial well-being. Embark on a challenging, enjoyable, and rewarding career journey with Fortinet. Join us in bringing solutions that make a meaningful and lasting impact to our 660,000+ customers around the globe. About Us Fortinet (NASDAQ: FTNT) secures the largest enterprise, service provider, and government organizations around the world. Fortinet empowers its customers with intelligent, seamless protection across the expanding attack surface and the power to take on ever-increasing performance requirements of the borderless network - today and into the future. Only the Fortinet Security Fabric architecture can deliver security without compromise to address the most critical security challenges, whether in networked, application, cloud or mobile environments. Fortinet ranks number one in the most security appliances shipped worldwide and more than 500,000 customers trust Fortinet to protect their businesses. We are committed to providing reasonable accommodations for all qualified individuals with disabilities. If you require assistance or accommodation due to a disability, please contact us at View email address on click.appcast.io. Fortinet is an equal opportunity employer. We value diversity in our company, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, military/veteran status or any other applicable legally protected characteristics in the location in which the candidate is applying. #J-18808-Ljbffr Fortinet, Inc.
- Job Summary Note: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud-native infrastructure. You’re responsible for architecting the reliability, scalability, and security of a...PrincipalVisa sponsorshipWork visaShift work
$151.6k - $245.3k
...outcomes. Job Summary Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture...PrincipalFull timeWork at officeVisa sponsorshipWork visa- Palo Alto Networks, Inc. is seeking a Principal Site Reliability Engineer in Santa Clara, CA. This role involves supporting a large infrastructure and ensuring applications are production-ready, scalable, and reliable. You'll work closely with developers and researchers...Principal
$147k - $237.5k
...usual and that goes for the talent we hire. We’re looking for a Principal SRE to join our InfoSec SRE team that owns the process of... ...Qualifications Must be a US Citizen. BS/MS in Computer Science/Engineering or equivalent training, education, and experience in information...PrincipalFull timeWork at officeVisa sponsorshipWork visa$180k - $200k
...Holmdel, NJ. Join us and be part of a team that's shaping the future of payments—one experience at a time. As our Site Reliability Engineer, you will design, build, and maintain the systems and infrastructure that power our applications, ensuring their...SuggestedFor contractorsWork at officeWork from homeFlexible hours$200k - $260k
...Position Title: Senior Principal Engineer, Software/Firmware - Coherent Optical Module Firmware/SoC-Based Embedded Platforms/CPO (Confidential Client) Location: Santa Clara, CA | Onsite Employment Type: Permanent Compensation ~ Salary: $200,000 - $2...PrincipalPermanent employment- ...Job Description Forhyre is looking for engineers who can bring unique perspectives and... ...practices while building a culture of reliability and observability Engage in and improve... ...& Skills We are looking for Principal SRE with proven experience in running distributed...
- ...design by customizing MES tool per business needs Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Experience in C#, Delphi desired Knowledge of the...Work at office
$232.9k - $335.81k
...Generative AI, Knowledge AI, Emotion AI, workflow automation and co-pilot guidance. About the Role: We're looking for a Principal Site Reliability Engineer to join our Platform Engineering team — someone equally at home writing production Go as designing and operating...PrincipalPermanent employmentFull time- ...of Huobi globe spanning infrastructure. • Work with engineering teams to make sure new features and changes are deployed quickly... .... • Constantly improve our system performance and reliability through better tools, process and monitoring system. •...Worldwide
$200k - $260k
...About the job Senior Principal Engineer, Software/Firmware Job Title: Senior Principal Engineer, Software/Firmware Location: Onsite, Santa Clara, CA, US Industry: Engineering / Architecture Salary: USD $200,000 - $260,000 / year Sponsorship: None...PrincipalVisa sponsorship- ...that keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity... ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in...Work experience placement
- ...building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into reliable, job‑centric insights and automation for GPU fleets. Join our team of innovative engineers who are building this platform and operating it (not the compute cluster): uptime, performance...
$174k - $252k
Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California...Full time$152k - $241.5k
...infrastructure platforms for automated host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑driven operations (... ...languages such as Python, Go, Perl, or Ruby. Mentored other engineers and influenced technical direction through design reviews,...- ...Palo Alto Networks is at the forefront of cloud‑native infrastructure, where reliability, scale, and intelligent automation define the future of operations. As a Senior Site Reliability Engineer, you will design and operate the platforms that power our applications...Full timeWork at officeVisa sponsorshipWork visa
$177.82k - $266.4k
...Senior Principal Firmware Engineer Marvell's semiconductor solutions are the essential building blocks of the data infrastructure that connects our world. Marvell's Optics firmware team develops the software that powers the next generation of optical interconnects....PrincipalInternship- System / Clojure Principal Software Engineer Integrated Resources, Inc is a premier staffing firm recognized as one of the tri-state's most well-respected professional specialty firms. IRI has built its reputation on excellent service and integrity since its inception in...Principal
$205.5k - $310.2k
Senior Principal Security Software Engineer - C and Cryptographic Systems Join us to do the best work of your career and make a profound social impact... ...Engineer on our Software Engineering Team. This is an on‑site position in Austin, Texas (relocation available) or Hopkinton...PrincipalRelocation$200k - $322k
Senior Manager, Site Reliability Engineering page is loaded## Senior Manager, Site Reliability Engineeringlocations: US, CA, Santa Claratime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2016119For over 25 years, NVIDIA has been at the forefront of transforming...- NVIDIA Gruppe is looking for a Principal or Distinguished Engineer to join our Enterprise AI & Automation team. The role involves developing enterprise-grade AI systems using Python and Go, with a focus on building and implementing multi-agent orchestration patterns. The...Principal
$170k - $277k
...lead the development of our Chromium-based enterprise browser in Santa Clara, California. This role involves mentoring a team of engineers and tackling complex challenges in cybersecurity. The ideal candidate has over 8 years of C++ experience and a track record of optimizing...Principal$126k - $204.5k
...As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and... ...team to influence the operability of the product and ensure the reliability and availability of our services. Qualifications Required...$96.8k - $251.6k
...mission-critical cloud services and strong skills in C/C++. Responsibilities include providing technical leadership, mentoring senior engineers, and defining scalable system architectures. Oracle offers a competitive salary range of $96,800 - $251,600 annually, alongside a...Principal$150k - $195k
...milestones so that scale and resiliency are a part of every conversation. Develop best practices alongside engineering/operations teams to improve the scalability and reliability of internal processes. Participate in an on-call rotation. Minimum Qualifications 3 years of...Full timeWorldwide- Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Knowledge of the application of tools/techniques Experience in one coding language (Preferred) Experience in Database (Preferred...
$147.4k - $220.9k
Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn...Relocation$147.4k - $272.1k
Site Reliability Engineer, Enterprise Technology Services Sunnyvale, California, United States Software and Services Imagine what we could do together. At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring...Relocation$170k - $277k
Palo Alto Networks, Inc. is looking for innovative engineers to design and develop security features for next-generation firewalls. You will work closely with cross-functional teams, applying your programming skills to tackle real-world security problems. A successful candidate...Principal$145k - $165k
...Your Ego : Selflessly collaborate towards our shared purpose. About the role Bolt Graphics is seeking a highly experienced Site Reliability Engineer (SRE) to design, build, and operate highly reliable developer and production systems. This role is mission-critical to...Work at officeImmediate start
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!
- senior civil engineer project manager Santa Clara, CA
- senior chief engineer Santa Clara, CA
- director of product engineering Santa Clara, CA
- engineering director Santa Clara, CA
- chief engineer Santa Clara, CA
- chief design engineer Santa Clara, CA
- principal network engineer Santa Clara, CA
- data center chief engineer Santa Clara, CA
- principal infrastructure engineer Santa Clara, CA
- hotel chief engineer Santa Clara, CA


