Principal Site Reliability Engineer
$99.6k - $234.6kOracle
Job Description
As a Principal Site Reliability Engineer, you will play a pivotal role in building and operating the Oracle HealthPatient Portal. In this role, you will design, build, and operate highly reliable, scalable infrastructure that supports Commercial and Federal customers.
You will also contribute to the next evolution of cloud operations by advancing automation, observability, and AI-assisted reliability practices.
You will work within a globally distributed team to deliver robust solutions that handle massive load by the end users with precision and performance, while continuously improving system reliability and operational excellence.
U.S. citizenship is required for this position, as the successful candidate will be required to obtain (and maintain) a U.S. government security clearance after hire.
Required Skills
Infrastructure & Reliability
Experience building and operating high-availability, fault-tolerant systems
Strong understanding of distributed systems, performance monitoring, and resiliency patterns
Experience with incident response, root-cause analysis, and production troubleshooting
Cloud Ecosystems
Experience with one or more cloud environments OCI, AWS/Azure
DevOps/SRE Practices
Advanced competency in CI/CD pipelines (Jenkins, Kubernetes)
Infrastructure as Code (Terraform)
Observability tools (Prometheus, Grafana)
Strong focus on automation-first operations
Data Technologies
• Proficiency in Data Warehousing platforms (e.g., Vertica, Snowflake)
• Experience with ETL frameworks and large-scale data processing
• Understanding of columnar storage systems
Programming & Tools
Proficiency in Python, Java, or Go
Experience with Docker, Kubernetes, and shell scripting
Problem-Solving
Strong troubleshooting skills with ability to perform root-cause analysis
Experience resolving complex production issues in distributed systems
Operational Excellence
Apply DevOps/SRE practices to automate deployments and operations
Enhance observability using Prometheus/Grafana and AI-driven insights
Incident Response
Participate in on-call rotations
Implement preventative and automated remediation solutions
Collaboration
Work closely with engineers to execute technical roadmaps
Contribute to code reviews and infrastructure improvements
What You Bring
7+ years of software engineering, cloud infrastructure, SRE, or DevOps experience
Proven ownership of production system reliability in cloud environments
Core Expertise
Cloud infrastructure design and automation
Distributed systems and performance optimization
Data warehousing and ETL frameworks
Technical Skills
Terraform, Docker, Kubernetes
Observability stacks (Prometheus, Grafana)
Python, Java, or Go
Additional Strengths
Strong problem-solving mindset with a focus on automation and scalability
Experience improving system reliability through intelligent automation
Preferred Qualifications
Experience in healthcare or regulated environments (HIPAA, compliance frameworks)
Experience working in environments requiring security clearance
Experience building self-healing or autonomous infrastructure systems
Responsibilities
• Work with the Site Reliability Engineering (SRE) team to take shared ownership of services and platform components. Develop a strong understanding of end-to-end system architecture, dependencies, and production behavior.
• Design, build, and operate reliable, scalable, and secure infrastructure supporting large-scale distributed systems
• Improve system reliability through automation, monitoring, and performance optimization
• Contribute to the adoption of AI-assisted approaches for operations, including:
Enhancing observability and alerting
Supporting automated incident detection and remediation
Exploring intelligent automation for infrastructure lifecycle management
• Partner with development teams to enhance service architecture, scalability, and operability
• Participate in on-call rotations and act as an escalation point for complex production issues
• Perform root cause analysis and implement long-term fixes to prevent recurrence
• Apply knowledge of distributed systems to troubleshoot issues and optimize system performance
• Drive continuous improvement in DevOps/SRE practices, including CI/CD, Infrastructure as Code, and automation at scale
Disclaimer:
Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $99,600 to $234,600 per annum. May be eligible for bonus, equity, and compensation deferral.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$194k - $237k
...at the date of hire. This position is ineligible for employment Visa sponsorship. Overall Purpose The Principal Site Reliability Engineer partners with development teams by designing availability and resiliency patterns in applications and infrastructure....PrincipalHourly payWork at officeImmediate startVisa sponsorshipWork visaFlexible hours$115.28k - $196.13k
...Sr. Site Reliability Engineer- Hybrid We are Farmers – where ambition meets opportunity. At Farmers, we're not just known for unforgettable jingle – we're a team with a passion for purpose and making a real difference in people's lives. We deliver peace of mind when...SuggestedWork at officeFlexible hoursShift work$106k - $130k
...sponsorship. Overall Purpose To create and maintain the next generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best practices. Essential Functions Implement software and tools to...SuggestedHourly payWork experience placementWork at officeImmediate startVisa sponsorshipWork visaFlexible hours$142.7k - $158.3k
...Basic Qualifications Bachelor's degree in Software Engineering, or related Science, Technology, Engineering or Mathematics field... ...Responsibilities for this Position What You'll Own SLOs and reliability metrics. Define service level objectives for every AI service...SuggestedRemote workFlexible hours- ...we take care of ourselves, each other, and our communities. Job Summary: Job Description: PayPal, Inc. seeks Site Reliability Engineer in Scottsdale, AZ Job Duties: Monitor and analyze system metrics to ensure optimal availability, performance, and...SuggestedWork at officeLocal areaImmediate startRemote workFlexible hours
$58.8k - $156.7k
...Site Reliability Engineer - Local to Phoenix, AZ Category: Software Development/ Engineering Main location: United States, Arizona, Phoenix Position ID: J0526-0838 Employment Type: Full Time Position Description: CGI is looking to hire a Site Reliability...Permanent employmentFull timeLocal area- ...Title: Site Reliability Engineer Location: Phoenix, AZ Job Type: Full Time Minimum Qualifications •BS or MS degree in computer science, computer engineering, or other technical discipline, or equivalent 3-6 years of work experience in DevOps...Full timeWork experience placement
$104.9k - $174.7k
...Customer Data Management. You can learn more about LexisNexis Risk at the link below, the Role:We are hiring a hands-on Senior Site Reliability Engineer (SRE) to actively build, operate, and improve the reliability of our production systems. This is not a purely advisory...Work at officeLocal areaRemote work$186.07k - $218.9k
...*AI-Driven Innovation: *Join a high-performing team of skilled engineers driving AI transformation at Coinbase. This role involves leading... ...quick access to screen reading technology compatible with this site click here to download a free compatible screen reader (free...Local area- ...The Reinalt-Thomas Corp dba Discount Tire/Americas Tirehas an opening for a Software Engineer Principal - Hybris at our Phoenix, AZ office. Lead & support software application design, creation & documentation. Mail resumes to: Discount Tire, ATTN: HR/Jen Terhark, 2022...PrincipalWork at office
- ...Director, Site Reliability Engineering Phoenix, Arizona SmartRent (NYSE: SMRT) is revolutionizing how people live and work with the industry's only end-to-end platform designed for the rental housing industry. By uniting purpose-built software, integrated hardware...Flexible hours
- Required Skills Service reliability/operation experience running large-scale, high-performance applications in a hybrid environment (on-prem and cloud). Experience in writing automation scripts and building dashboards for Application Performance management to manage Transaction...
- ...improve software solutions to ensure system reliability and availability, mitigate operational... ...issues. # You will help lead chaos engineering efforts in a production-alike environment... ...professionals, with engineers focused on site reliability engineering and...Permanent employmentFlexible hours
- Job Title- Site-Reliability Engineer with GCP Location: Scottsdale, AZ (Onsite) Type: : Long Term Contract Interview process: - 1 level of internal evaluation with Implementation partner - 3 Levels of Client Interviews (2 Telephonic and 1 In person). Last round in person...Long term contractContract work
- A leading analytics company is seeking a Principal Engineer to innovate in Agentic AI. You will architect frameworks for autonomous AI agents, ensuring security and scalability. This role requires strong collaboration with diverse teams, bringing over 8 years of relevant...PrincipalFlexible hours
- Prattwhitney is looking for a Senior Platform Engineer for a remote role within the Collins Aerospace’s Connected Aviation Engineering team. This position will involve ownership of AWS infrastructure, design of CI/CD pipelines, and collaboration with data teams. The ideal...PrincipalRemote job
- ...As a Principal Software Engineer – PLM & Digital Engineering you will be responsible for understanding requirements/problem statements from Product engineering, Systems Engineering and extended functional groups to architect and deploy PLM extended solutions. Responsibility...Principal
- ...Prattwhitney in Scottsdale, AZ, is seeking a Senior Principal Software Engineer to design and maintain software solutions for weapon systems factory support. The role requires guiding teams in test equipment development, managing software engineering tasks, and ensuring...Principal
- ...A leading aerospace company is seeking a Principal Software Engineer to drive innovation and technical excellence in flight management systems. In this role, you'll lead software projects, mentor a team, and ensure high-quality outcomes. The ideal candidate has over 7...Principal
$167.64k - $251.46k
...AD&D Systems Engineer - Advanced Flight Controls Sr Principal Job Category: Engineering Requisition Number: ADDSY013315 Apply now Posting Details ~ Posted : April 24, 2026 Full-Time On-site Salary Range : $167,639.95 USD to $251,459.93 USD Locations...PrincipalFull timeTemporary workWork at officeWeekend work- ...Principal Software Engineer As a Principal Software Engineer here at Honeywell Aerospace, you will be responsible for acting as the subject‑matter expert by providing technical guidance, vision, and leadership on critical software development projects related to flight...PrincipalPermanent employmentTemporary workFlexible hours
- ...Software Engineer Principal At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are all united in delivering the best experience for our customers. We work together each day to foster an inclusive workplace culture...PrincipalWork at office
- PNC in Phoenix, AZ, is seeking a Principal Software Engineer to lead design and development for scalable applications. The role includes mentoring engineers, championing Agile methodologies, and collaborating with stakeholders to define technical requirements. The ideal...Principal
- ...Job Description Forhyre is looking for engineers who can bring unique perspectives and... ...practices while building a culture of reliability and observability Engage in and improve... ...& Skills We are looking for Principal SRE with proven experience in running distributed...
- ...A leading technology firm is seeking a Principal Engineer - DevOps for their Phoenix location. This role involves leading infrastructure design, implementing Cloud solutions, and ensuring system stability and performance. Candidates must have extensive experience in DevOps...Principal
$96.8k - $251.6k
...media, creative, AI, and high-performance workloads where scale, reliability, cost, and customer trust all matter. This role offers the... ...-grade creative workflows in the cloud while improving the engineering systems, operational practices, and AI-enabled delivery patterns...PrincipalTemporary workFlexible hours- ...Solutions Engineers are part problem solvers and part architect. They operate in close partnership with our sales, solutions strategy,... ...featuring the Acxiom name or its variations, other than those listed here: and are fraudulent. Please do not engage with these sites....PrincipalInternshipLocal areaRemote work
$150.05k - $225.07k
...AD&D Engineer Principal - A&E Systems Job Category: Engineering Requisition Number: ADDEN012583 Full-Time On-site Salary Range: $150,047.77 USD to $225,071.65 USD Duluth MN 4514 Taylor Cir Duluth, MN 55811, USA +1 more locations Description Job Summary...PrincipalFull timeTemporary work- ...As a Principal Systems Engineer here at Honeywell, you will design, develop and integrate highly complex systems. You will be integral in creating system solutions that meet the evolving needs of our customers. You will have ownership for systems at every aspect of...PrincipalPermanent employmentTemporary workFlexible hours
$96.8k - $306.4k
...Job Description This Sr Principal Software Engineer role is a senior technical leadership position focused on designing and building secure, scalable cross-domain solutions for mission-critical systems. The role requires deep expertise in security-critical software,...PrincipalTemporary workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!
- principal battery engineer Phoenix, AZ
- senior civil engineer project manager Phoenix, AZ
- senior chief engineer Phoenix, AZ
- engineering director Phoenix, AZ
- chief engineer Phoenix, AZ
- principal network engineer Phoenix, AZ
- data center chief engineer Phoenix, AZ
- principal infrastructure engineer Phoenix, AZ
- director of electrical engineering Phoenix, AZ
- project engineer assistant project manager Phoenix, AZ


