Director, Site Reliability Engineering
$121.5k - $306.4kOracle
Job Description
Provides leadership to one or more teams designing and architecting infrastructure and service and provides input on best practices for reliability and functionality. Establishes direction to ensure accurate forecasting and ensure systems have adequate resources. Builds collaborative relationships with the software development team to create reliable, scalable infrastructures. Ensures alignment regarding data collection and contributes to standards for optimizing operations and infrastructure reliability. Defines approaches for incident response activities to ensure service reliability. Ensures in-depth reports. Plays a key role in developing standards for identifying and recommending automation. Anticipates and explains the impact of changes, mentoring other managers on what to communicate. Defines approaches for escalating incidents and refines methods for documentation. Encourages experimenting with new technology, executing improvements, building site reliability knowledge, and providing clear data.
#LI-ES2
Responsibilities
Key Responsibilities
Capacity Ingestion and Management:
-Provides leadership for one or more teams designing and architecting infrastructure and/or service, providing input on the development of best practices for adhering to terms for reliability and functionality.
-Establishes direction for other managers and senior-level individuals to drive the forecasting of demands for infrastructure and respond to capacity needs, ensuring that systems have sufficient resources to meet current and future workloads and identifying and addressing resource gaps.
-Builds collaborative relationships with senior software development team members to design and develop infrastructures that are highly reliable and scalable, meeting stringent deployment requirements.
-Ensures teams align on expectations for identifying opportunities for prototyping and oversees prototyping initiatives (e.g., testing new applications or infrastructures, assisting in onboarding), experimenting with cutting-edge approaches.
Incident and Service Lifecycle Management:
-Ensures alignment across teams regarding performing data collection, triage, technical analysis, and redirection, contributing to the development of standards to maintain and optimize operations and infrastructure reliability.
-Shares techniques across teams for monitoring of services, maintaining up-to-date knowledge of their performance, and thoroughly documenting their condition.
-Defines approaches for performing incident response, root cause analysis, and/or maintenance on assigned services (e.g., software installs, version upgrades, security updates, backup and recovery) and drives execution.
-Ensures teams provide in-depth health and performance reporting and coordinates managerial actions based on trends in data.
-Refines procedures for performing provisioning to support infrastructure, applications, and services, mentoring team members.
-Provides input on standards for decommissioning (e.g., shutting down servers, removing data from databases) to remove objects that are no longer needed.
Automation:
-Plays a key role in developing standards for identifying and recommending opportunities for automation and reviewing potential benefits in terms of metrics across teams to ensure expectations are met.
-Ensures alignment on expectations for developing and drives the implementation of design, automation tools, or scripts.
-Refines strategies for conducting testing on highly complex automations to ensure they perform tasks correctly and produce expected results.
-Provides guidance and expertise to others testing automations.
Technical Communication and Guidance:
-Shares expectations for release notes and communication of in-depth information about the scale, capacity, security, performance attributes, and requirements of services and technology with customers, cross-functional teams and leadership.
-Anticipates and explains the potential impact of infrastructure, feature, and tool changes, considering the strategic implications and goals.
-Takes a leadership role in mentoring other managers on what information to communicate and how to communicate.
Troubleshooting and Resolution:
-Defines approaches for escalating incidents and other highly complex issues arising within Oracle services within and across teams.
-Coordinates with other team leaders to review service performance and ensure the resolution of technical issues spanning multiple services and customers, encouraging collaboration across teams and leveraging advanced investigation and debugging techniques to ensure the achievement of SLOs (service level objectives).
-Refines standard reporting methods for incident documentation and performing root cause analyses, aiming to capture insights and lessons learned for continuous improvement and knowledge sharing.
-Plays a key role in creating guidelines for post-mortem procedures to prevent incident reoccurrence.
-Communicates with other team leaders to ensure adherence to service level agreements (SLAs) made with customers.
Innovation and Improvement:
-Encourages creativity and innovation and coordinates with other leaders to drive the exploration and adoption of innovative tools and technologies to transform infrastructure performance and reliability, investigating implications of adherence to security standards on other integrations.
-Provides input on initiatives to improve performance bottlenecks and optimize deployments, aligning other leaders on expectations for efficient resource usage, speed, and scalability and driving roadmap development.
-Refines standards for developing and maintaining knowledge of site reliability trends and sharing valuable insights and information cross-functionally to drive innovation in building, testing, deploying, and running services.
-Plays a key role in the review of analyses and data, driving and influencing business development decisions (e.g., design changes).
Core Responsibilities
Planning & Execution:
-Oversees and guides multiple teams on managing complex projects or initiatives, monitoring timelines, deliverables, and budgets when applicable to ensure strategic objectives are met. Serves as a role model for appropriately delegating work, setting priorities, and ensuring alignment with business needs. Coaches others on adjusting resources or project timelines in anticipation of business changes.
Collaboration & Partnership:
-Role models leading cross-functional collaborative efforts to ensure alignment of expectations and strategic objectives. Empowers team to build and maintain partnerships with business leaders, stakeholders, and/or customers to address barriers and contribute to organizational success. Drives transparency and inclusivity by modeling actively seeking, listening to, and leveraging diverse perspectives.
Problem Solving:
-Shares problem-solving strategies across teams, providing oversight on complex operational and/or technical issues, as needed. Coaches teams on analyzing highly complex data and/or information to identify solutions to ambiguous issues and provides direction on identifying root causes to prevent recurrence of issues.
Continuous Learning:
-Pursues strategic learning opportunities to maintain expertise and apply best practices at the organizational level. Creates opportunities for team members and leaders to build their expertise in new areas, coaching them to build innovative skills. Identifies skill gap trends across the organization, and upholds a culture that places significant emphasis on sharing knowledge and pursuing learning opportunities that advance the organization. Evaluates efficiency of learning strategies and recommends adjustments as needed.
Continuous Improvement:
-Empowers team to own the development and implementation of ideas that increase the efficiency and effectiveness of processes, protocols, and workflows across the department. Coaches teams to gain buy-in for ideas and to seek feedback on approaches and methods for continued improvement. Prioritizes and reviews the roadmap of improvement initiatives to ensure alignment with strategic direction and maximize return on investments.
Performance and Development:
-Serves as a role model for driving performance across teams through tailored feedback and coaching in alignment with performance management processes, guidelines, and expectations. Drives consistency in the application of talent development procedures and socializes performance expectations across the organization. Ensures that individual development goals are aligned with organizational strategic initiatives. Collaborates with HR to implement talent strategy through hiring and promotion processes.
Disclaimer:
Certain U.S. based or U.S. customer or client-facing roles may be required to comply with applicable requirements, such as immunization/occupational health mandates, and/or drug testing requirements.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $121,500 to $306,400 per annum. May be eligible for bonus, equity, and compensation deferral.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
Medical, dental, and vision insurance, including expert medical opinion
Short term disability and long term disability
Life insurance and AD&D
Supplemental life insurance (Employee/Spouse/Child)
Health care and dependent care Flexible Spending Accounts
Pre-tax commuter and parking benefits
401(k) Savings and Investment Plan with company match
Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
11 paid holidays
Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
Paid parental leave
Adoption assistance
Employee Stock Purchase Plan
Financial planning and group legal
Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - M4
About Us
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$95k - $171k
.... Opportunities exist to focus on GPU infrastructure, Kubernetes, and ensuring reliability for AI workloads within Akamai's serverless inference platform. As an Site Reliability Engineer II, you will be responsible for: Building and maintaining dashboards, alerts...SuggestedPermanent employmentWork experience placementWork at officeRemote workWork from homeWorldwideFlexible hours$75.7k - $136.3k
...solve complex challenges? Do you have a passion for automation and building systems that scale? Join our highly skilled Site Reliability Engineering team! Our team designs, develops, and manages applications and infrastructure that support Akamai Cloud's products and...SuggestedWork experience placementWork at office$124k - $186k
...Site Reliability Engineer II Relentless protection. Resilient world. Mimecast was born in 2003 with a focus on delivering relentless protection. Each day, we take on cyber disruption for our tens of thousands of customers around the globe; always putting them first...SuggestedWork at officeLocal areaImmediate start2 days per week- Job Title Primary Skill PCF (Pivotal Cloud Foundry) and Mongo DB Exposure to at least 1 Observability Tool such as AppDynamics, Splunk, Grafana Change Mgmt using CI/CD pipeline. Harness or equivalent tools Secondary Skill SSL Certificate management...Suggested
- ...Site Reliability Engineer (SRE) Location: Columbus, OH | Iselin, NJ (Onsite) Job Type: Long Term Contract Key Responsibilities Enhance platform reliability, performance, and observability Build dashboards and alerts using APM tools (Splunk, ELK, Grafana...SuggestedLong term contract
$121.4k - $218.6k
...will be responsible for ensuring best-in-class uptime and reliability of our AI hardware infrastructure offerings. **Partner with... ...and defend them when they are breached. As a Senior Site Reliability Engineer, you will be responsible for: + Developing and scaling robust...Work experience placementWork at office$109.5k - $150.55k
...strive for the best, own our actions, and grow and evolve. Job Description Renaissance is looking for an experienced Sr Site Reliability Engineer to be part of the Engineering Enablement group's Site Reliability Team with a focus on Application and Infrastructure...For contractorsLocal areaRemote workWorldwideWork visaFlexible hoursWeekend work$84.9k - $209.5k
...Designs and architects infrastructure and service to ensure reliability and functionality. Forecasts demands and responds to capacity needs... ...new tools and develops and maintains advanced knowledge of site reliability trends. #LI-E2 Responsibilities Key Responsibilities...Temporary workImmediate startFlexible hoursShift work- ...Director of Site Reliability Engineering You have discovered the perfect setting to expand your skills and make a meaningful impact. Partner with an organization committed to defining the future of site reliability in the financial sector. As a Director of Site...
- Fairygodboss is looking for a Technology Engineer to join their Site Reliability Center. This role involves leading incident responses, conducting root cause analysis, and ensuring system availability and performance. You will also develop monitoring dashboards and automate...
- DevOps Site Reliability Engineer page is loaded## DevOps Site Reliability Engineerremote type: On-Sitelocations: Columbus, Ohiotime type: Full timeposted on: Posted 10 Days Agojob requisition id: R-102302**Job Description:******Overview****As a DevOps Site Reliability Engineer...Remote work
- ...Job Description The Ohio State University College of Medicine and the Wexner Medical Center seek a Director of IBD to join the Division of Gastroenterology, Hepatology, and Nutrition. Academic rank track commensurate with academic record and experience. Position...TraineeshipWork at officeRelocation
- ...improve software solutions to ensure system reliability and availability, mitigate operational... ...issues. You will help lead chaos engineering efforts in a production‑alike environment... ...professionals, with engineers focused on site reliability engineering and observability...Permanent employmentFlexible hours
- Job Summary Vertiv is seeking a skilled Platform Operations Engineer (Site Reliability Engineer) to serve as the owner of cross‑platform observability, incident management, and operational reliability within Vertiv’s Digital organization. This individual contributor role...Temporary work
- ...our Dialysis Services team. Position Overview As the Executive Director of Dialysis Services, you will oversee the administration and clinical... ...CareOversee the clinical services across multiple dialysis sites to ensure high-quality care in compliance with regulatory...RelocationRelocation package
$300k
...Job Description Healthcare United is seeking a Board Certified Physician for a Director / Lead Physician role within a progressive, value-based care model serving medically complex patient populations in the Columbus area. This position offers a strong blend of clinical...Relocation package- ...a full-time MD or DO to join our 7-7-7 ACGME-accredited Family Medicine Residency Program as an Assistant or Associate Program Director. Our unopposed, community-based program is supported by 7 core faculty and a team of adjunct faculty, providing full-spectrum primary...Full timeH1b
$238.83k - $341.19k
...different than most primary care providers. We’re rapidly expanding and we need great people to join our team. The Clinical Director will directly supervise and train primary care providers (PCPs) in his/her assigned center. The incumbent in this role is accountable...Work at office$238.83k - $341.19k
...different than most primary care providers. We’re rapidly expanding and we need great people to join our team. The Clinical Director will directly supervise and train primary care providers (PCPs) in his/her assigned center. The incumbent in this role is accountable...Work at office$180k - $303.6k
...About the Role PagerDuty is seeking a Director of Pricing & Monetization to own the... ...frameworks - in partnership with Product and Engineering Build and maintain a monetization... ...-specific offerings, on our benefits site ( . Your package may include:...Local areaFlexible hours$135.4k - $208.1k
...of those use cases as a player coach, shaping architecture, building or co-building proofs of concept, and reviewing the team's engineering work from idea through production. Move the strongest proofs of concept into scalable, supportable, production ready solutions...Full timeTemporary workFor contractorsLocal areaFlexible hours- ...Director, Inventory Control 1-800-Flowers.com is seeking a strategic and results-oriented... ...Bachelor's degree in Supply Chain, Business, Engineering, or a related field. 10+ years of... ...accuracy initiatives within complex, multi-site operations. Experience with Warehouse...Seasonal work
$157.98k - $252.77k
...Job Description The Director, Demand Generation is a strategic and execution-focused marketing leader responsible for driving growth... ...aware of scams from individuals, organizations, and internet sites claiming to represent Blue Cross and Blue Shield of North Carolina...Work at officeLocal areaRemote workFlexible hours2 days per week$122k - $150k
...to making a positive impact on people's lives. Position Summary Reporting to the Vice President of Customer Success, the Director of Patient Experience will be responsible for the overall performance of a large call center assisting patients in multiple channels...Full timeRemote workMonday to FridayShift workNight shiftWeekend work$148.84k - $198.45k
...the challenge. Join us in building the future. The Role Director II, SLED Capture & Proposal Management - Public Sector Location... ..., and the ability to influence across sales, solutions engineering, product, and operational teams. The director will oversee SLED...Full timeContract workTemporary workLocal areaRemote work$185k - $220k
...Job Summary The Director, Medical Affairs (Nutrition) is responsible for leading and managing medical affairs for approved nutrition products... ...area who are able to regularly work at our Lake Zurich, IL site. *This position does not offer visa sponsorship either now or in...Permanent employmentWork at officeNight shift- ...innovation, growth and success. We strive to provide them with opportunities to grow in a motivating environment. Job Description The Director, Talent Management contributes to human resources strategy and executes employment programs related to the growth and measurement...Temporary work
- ...maintains corporate offices in London, New York, Dallas, and Seattle. Location: Le Meridien Columbus, The Joseph Overview: The Director of Banquets is responsible for coordinating, supervising and directing all aspects of the hotel’s banquet operations, while...Hourly payLocal areaImmediate start
$55k - $62k
...Director of Academics Eastland Preparatory Academy About the Opportunity: Assume ownership and responsibility for developing and supporting the school's instructional staff by: Guiding teachers in the effective use of instructional and support materials...Temporary workLocal area$139.4k - $291.8k
...Cloud Infrastructure (OCI) is seeking a Director, Commissioning QA/QC to lead startup, commissioning... ...-focused leadership role with expected site engagement of 60-80% during turnover,... ..., including commissioning managers, engineers, vendors, and contractors; ability to set...Temporary workFor contractorsRemote workRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Director, Site Reliability Engineering. Be the first to apply!
- principal developer Columbus, OH
- engineering director Columbus, OH
- senior chief engineer Columbus, OH
- chief engineer Columbus, OH
- data center chief engineer Columbus, OH
- civil engineer project manager Columbus, OH
- senior civil engineer project manager Columbus, OH
- director data engineering Columbus, OH
- hotel chief engineer Columbus, OH
- project engineer assistant project manager Columbus, OH



