Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Site Reliability Engineer

$84.9k - $209.5k

Oracle

Job Description

Designs and architects infrastructure and service to ensure reliability and functionality. Forecasts demands and responds to capacity needs. Collaborates with software development teams to develop reliable and scalable infrastructures. Exercises judgment when performing data collection to maintain and optimize operations and reliability. Leverages advanced knowledge to perform incident response and/or maintenance tasks. Provides comprehensive health and performance reporting. Identifies and recommends opportunities for automation. Communicates comprehensive information about services and proactively anticipates and articulates the potential impact of changes. Provides comprehensive support for technology and documents incidents. Conducts advanced experiments with new tools and develops and maintains advanced knowledge of site reliability trends.

Responsibilities

Key Responsibilities

Capacity Ingestion and Management:

- Designs and architects infrastructure and/or service according to terms for reliability and functionality.

- Forecasts demands for infrastructure and responds to capacity needs, ensuring systems have sufficient resources to handle current and future workloads and identifying resource gaps.

- Collaborates with the software development team to develop infrastructures, ensuring features are reliable and scalable according to deployment requirements.

- Proactively identifies opportunities for prototyping and drives prototyping initiatives (e.g., testing new applications or infrastructures, assisting in onboarding) to explore novel approaches.

Incident and Service Lifecycle Management:

- Exercises judgment when performing data collection, triage, technical analysis, and redirection to maintain and optimize operations and infrastructure reliability.

- Takes proactive steps to monitor services, maintain up-to-date knowledge of their performance, and document their condition.

- Leverages advanced knowledge to perform incident response, root cause analyses, and/or maintenance on assigned services (e.g., software installs, version upgrades, security updates, backup and recovery).

- Provides comprehensive health and performance reporting and takes appropriate actions based on trends in data.

- May perform provisioning to support infrastructure, applications, and services.

- May experiment with new approaches for and performs decommissioning (e.g., shutting down servers, removing data from databases) to remove objects that are no longer needed.

Automation:

- Identifies and recommends opportunities for automation and assesses potential benefits to enhance operational efficiency.

- Develops and implements design, automation tools, or scripts to provide solutions, gather metrics, monitor, analyze, mitigate, or remediate issues/defects within infrastructures.

- Conducts testing on moderately complex automations to ensure they perform tasks correctly and produce expected results.

Technical Communication and Guidance:

- Writes release notes and/or communicates comprehensive information about the scale, capacity, security, performance attributes, and requirements of services and technology with customers and immediate and related teams.

- Proactively anticipates and articulates the potential impact of infrastructure, feature, and tool changes, considering their impact across team operations.

- Serves as a resource to team members on what information to communicate and how to communicate.

Troubleshooting and Resolution:

- Provides comprehensive operational support for technology, serving as a key escalation point for incidents and moderately complex issues arising within Oracle services.

- Drives and actively participates in on-call shifts to address issues.

- Executes the resolution of technical issues spanning multiple services, applying advanced investigation and debugging techniques to achieve SLOs (service level objectives).

- Documents incidents according to reporting methods and performs root cause analyses, capturing essential information for analysis and future reference.

- Performs post-mortem procedures to prevent incident reoccurrence.

Innovation and Improvement:

- Conducts advanced experiments and evaluations of cutting-edge tools and technologies to optimize infrastructure performance and reliability, taking proactive steps to adhere to security standards.

- Identifies and seeks opportunities to execute improvements for performance bottlenecks and deployments, ensuring efficient resource usage, speed, and scalability.

- Develops and maintains advanced knowledge of site reliability trends, sharing valuable insights and information with senior team members, management, and beyond to promote innovative building, testing, deploying, and running services.

- Performs moderately complex analyses and provides clear data on production to drive business development decisions (e.g., design changes).

Core Responsibilities

Planning & Execution:

- Manages and coordinates moderately complex tasks, monitoring timelines and deliverables to ensure timely completion and adherence to requirements for a moderately sized project or initiative. Efficiently delegates, monitors, and prioritizes work across multiple projects, providing technical oversight and adjusting plans to address shifts in resources or timelines.

Collaboration & Partnership:

- Collaborates across the organization to align on expectations and achieve shared objectives. Leverages understanding of business leaders, stakeholders, and/or customers to ensure proposed solutions meet their needs. Supports inclusivity by actively seeking and listening to diverse perspectives, ensuring others feel heard and respected.

Problem Solving:

- Identifies and addresses moderately complex issues by analyzing a wide range of data and/or information to identify solutions in accordance with standard practices. Proactively escalates unresolved or critical issues with a thorough assessment and suggests potential solutions. Reviews, contributes to, and documents problem solving strategies.

Continuous Learning:

- Pursues learning opportunities to expand knowledge and skills and/or tools in new areas and stays abreast of the latest industry trends and best practices. Proactively seeks and leverages ongoing feedback and training to improve skills. Coaches and mentors junior team members, fostering continuous learning and knowledge sharing within and across teams.

Continuous Improvement:

- Develops ideas, recommends updates, and/or collaborates on the implementation of process improvements to increase the efficiency and effectiveness of processes, protocols, and workflows across teams, and evaluates the impact on key stakeholders. Solicits feedback from others on ideas for alternative approaches and methods for continued improvement.

Performance and Development:

- Contributes to the talent development pipeline by participating in candidate interviews, assessing candidates, and providing hiring recommendations.

Qualifications

Disclaimer:

Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.

Range and benefit information provided in this posting are specific to the stated locations only

US: Hiring Range in USD from: $84,900 to $209,500 per annum. May be eligible for bonus and equity.

Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.

Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance

The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4

About Us

Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.

True innovation starts when everyone is empowered to contribute. That's why we're committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Principal Site Reliability Engineer in Nashville, TN vacancy
  • Designs and architects infrastructure and service to ensure reliability and functionality. Forecasts demands and responds to capacity needs...  ...new tools and develops and maintains advanced knowledge of site reliability trends. Only Oracle brings together the data, infrastructure... 
    Principal
    Full time
    Flexible hours

    Oracle

    Nashville, TN
    3 days ago
  •  ...Role: Site Reliability Engineer (SRE) Location: Brentwood, TN (Onsite) Contract Experience: 6-8+ years Role Description: Combines software engineering and IT operations to ensure the reliability, scalability, and performance of systems, with... 
    Suggested
    Contract work

    AceStack LLC

    Brentwood, TN
    5 days ago
  •  ...Site Reliability Engineer Visa: USC,GC only Rate: DOE Position is remote to start, then after conversion to W2, moves into one of three offices: Nashville, Los Angeles or New York Job Description: Strong problem solving/triage skills Strong cloud/infrastructure... 
    Suggested
    Remote work

    ShiftCode Analytics

    Nashville, TN
    2 days ago
  •  ...Job Title: Site Reliability Engineer - Compute Focus Duration: 6 Months to Hire Location: On-Site 2-3 days/week in Nashville, TN Job Description: We are seeking a skilled Site Reliability Engineer with a focus on compute infrastructure to join our dynamic... 
    Suggested
    2 days per week
    3 days per week

    United IT Solutions

    Nashville, TN
    4 days ago
  • $51.9 per hour

     ...job is responsible for the reliability, availability, and performance...  ...healthcare IT systems, principally in the Environment of Care (...  .... This role blends software engineering, clinical engineering, and security...  ...cross-functionally with AHN site leaders and teams to... 
    Suggested
    For contractors
    Local area

    Highmark Health

    Nashville, TN
    4 days ago
  •  ...delivery systems that demand ultra‑low latency, exceptional reliability, and global performance. This is a senior individual contributor...  ...complex distributed systems problems, and raises the bar for engineering excellence across teams. Why OCI At OCI, you will work on... 
    Principal

    Ll Oefentherapie

    Nashville, TN
    6 days ago
  • Site Reliability Engineer II About the Role This role focuses on enhancing system reliability and scalability for PROS’s platform, contributing to automation and self‑service tool development. The engineer will optimize performance, monitor service reliability, implement... 

    PROS Holdings, Inc.

    Nashville, TN
    2 days ago
  • A leading analytics and data platform company is seeking a Principal Engineer to drive innovation in Agentic AI. This role involves architecting secure frameworks for autonomous AI agents and collaborating with cross-functional teams to implement cutting-edge capabilities... 
    Principal
    Flexible hours

    Teradata Corporation (SE)

    Nashville, TN
    6 days ago
  •  ...Principal Software Engineer Department: Transportation Reports To: Chief Technology Officer Location: Remote (U.S. based) Travel: No Summary of Position The Software Engineer works as part of the Software Engineering team to understand, design, and implement features.... 
    Principal
    Remote work

    i3-Milestone

    Nashville, TN
    3 days ago
  • $143k - $243k

    A leading healthcare company is seeking a Senior Principal Actuary to provide actuarial direction and thought leadership. This remote position involves creating actuarial modeling concepts and strategic consulting. Candidates should have 10 years of actuarial experience... 
    Principal
    Remote work

    Prime Therapeutics

    Nashville, TN
    5 days ago
  •  ...improve software solutions to ensure system reliability and availability, mitigate operational...  ...issues. You will help lead chaos engineering efforts in a production‑alike environment...  ...professionals, with engineers focused on site reliability engineering and observability... 
    Permanent employment
    Flexible hours

    Teradata

    Nashville, TN
    2 days ago
  •  ...seasonal items at everyday low prices in convenient neighborhood locations. Learn more about Dollar General at Job Details A Principal Software Engineer (PSE) is recognized as a master software engineer able to solve the most complex technical problems. They lead and manage... 
    Principal
    Work experience placement
    Seasonal work

    ∙ Elijah House Foundation

    Goodlettsville, TN
    2 days ago
  •  ...experiences. You will have a direct impact on a greenfield platform backed by significant investment. Collaborate with a highly technical, distributed engineering team to define the monetization architecture for next-generation video delivery. #J-18808-Ljbffr Ll Oefentherapie
    Principal

    Ll Oefentherapie

    Nashville, TN
    6 days ago
  • Position: Linux Site Reliability Engineer Location: Nashville, TN Job Id: 1133 # of Openings: 1 Linux Site Reliability Engineer will be a...  ...hosted/managed technologies. Additionally, Linux SRE will be principally involved in the engineering of infrastructural solutions,... 

    GoTo Meeting

    Nashville, TN
    4 days ago
  • Work Locations Nashville, TN Austin, TX Broomfield, CO As a Principal Engineer inside the Oracle Cloud Infrastructure Interactive Media team, you will have the opportunity to solve challenging technical problems and function as a lead developer in the development,... 
    Principal

    Ll Oefentherapie

    Nashville, TN
    6 days ago
  • $96.8k - $306.4k

     ...embarking on ambitious new initiatives such as canonical implementation of core components for data planes. We are hoping to enhance engineering efficiency by concentrating our expertise on building low level systems with high performance that can be adopted by our core... 
    Principal
    Temporary work
    Work experience placement
    Worldwide
    Flexible hours

    Oracle

    Nashville, TN
    2 days ago
  • Ll Oefentherapie in Nashville, TN is seeking a Principal Engineer to lead the development of innovative cloud services within the Oracle Cloud Infrastructure Interactive Media team. The successful candidate will solve challenging technical problems and mentor a talented... 
    Principal

    Ll Oefentherapie

    Nashville, TN
    3 days ago
  • Ll Oefentherapie is looking for a Principal Product Manager to drive product strategy and execution for OCI Developer Tool Products. This role emphasizes improving developer experience across various aspects of software development. Your responsibilities will include leading... 
    Principal

    Ll Oefentherapie

    Nashville, TN
    2 days ago
  • $104.5k - $234.6k

     ...software development lifecycle; provides guidance and coaching to engineers to drive improvements. Utilizes advanced knowledge to...  ...to ensure service/product availability, health, support, and reliability. Core Responsibilities Planning & Execution: ~ Manages... 
    Principal
    Temporary work
    Flexible hours
    Shift work

    Oracle

    Nashville, TN
    2 days ago
  •  ...cultures and affords personal and professional growth opportunities. Learn more at . Overview of Job Function Verint’s Principal Software Engineer designs and develops key cloud-first, full-stack software products. This role works with Product Management, Development,... 
    Principal
    Local area
    Shift work

    Verint Systems

    Nashville, TN
    5 days ago
  • $144.5k - $195.5k

     ...Principal Software Engineer We are looking for a software engineering leader who is passionate about creating next-generation healthcare software that will dramatically improve the lives of patients, clinicians, and caregivers. This person will have the opportunity... 
    Principal
    Full time
    Temporary work
    Local area
    Flexible hours

    TENDO

    Nashville, TN
    2 days ago
  • $156.6k - $215.4k

    Humana Inc in Nashville, TN is seeking an engineer to transform IT Service Management and Technology Lifecycle Management. This role will leverage automation and AI to improve data quality and decision-making. Key responsibilities include automating workflows, integrating... 
    Principal

    Humana Inc

    Nashville, TN
    2 days ago
  • PeopleFind is seeking a Reliability Engineer in Knoxville, TN, to lead and develop the reliability program within a growing manufacturing operation...  ...root cause analysis, and supporting maintenance teams on-site. Candidates should possess a bachelor's degree in engineering... 
    Relocation package

    PeopleFind

    Nashville, TN
    3 days ago
  • Ll Oefentherapie in Nashville, TN, is seeking a Lead Principal Security Researcher to ensure a large-scale ad system is compliant with policies and regulations. The role involves validating system behavior and producing executive-ready reports on assurance and remediation... 
    Principal

    Ll Oefentherapie

    Nashville, TN
    5 days ago
  • $139.4k - $306.4k

    ORACLE Deutschland B.V. & Co. KG in Nashville, Tennessee, is looking for a Lead Principal Security Researcher to ensure compliance of a large-scale advertising system. The role involves validating platform behavior and designing processes for empirical validation of controls... 
    Principal

    ORACLE Deutschland B.V. & Co. KG

    Nashville, TN
    5 days ago
  •  ...s fun to work in a company where people truly BELIEVE in what they're doing! Job Description Summary: The Customer Solutions Engineer a highly skilled Mainframe Modernization Senior Consultant to provide technical support and/or leadership in the creation and delivery... 
    Principal
    Local area
    Remote work
    Worldwide

    Rocket Software

    Nashville, TN
    1 day ago
  • Dollar General is looking for a Principal Software Engineer to lead software development efforts and manage on-shore and off-shore teams. This role requires an expert understanding of IT tools, ability to mentor team members, and a strong focus on code quality and standards... 
    Principal

    ∙ Elijah House Foundation

    Goodlettsville, TN
    4 days ago
  • Overview OCI is hiring a Sr Principal Software Engineer for the Crypto organization's KMS team. This senior IC will guide the architecture and operational maturity of a Tier 0 key management service that protects customer cryptographic keys and internal OCI service... 
    Principal
    Full time
    Flexible hours

    Oracle

    Nashville, TN
    4 days ago
  • Dormont Manufacturing Co is seeking a Structural Integrity Engineer for their Tennessee location. The role involves performing analysis on mechanical parts associated with aircraft equipment, focusing on static stress, fatigue, vibration, and thermal analysis. Candidates... 
    Principal

    Dormont Manufacturing Co

    Nashville, TN
    2 days ago
  • $144.2k - $288.4k

    Position Summary As a Principal Software Engineer, you will define and drive the technical direction for modern, cloud‑native applications built...  ...platforms. You will establish clear expectations for reliability, security, performance, and operability, ensuring strong... 
    Principal
    Hourly pay
    Full time
    Temporary work
    Local area

    Hispanic Alliance for Career Enhancement

    Nashville, TN
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!