Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Site Reliability Engineer

Oracle

Job Description

We are looking for a Site Reliability Engineer 3 to support mission-critical cloud services and production operations. The role focuses on improving service reliability, reducing operational risk, automating repetitive tasks, and driving faster detection and resolution of issues.

The engineer will work closely with development, infrastructure, security, and operations teams to monitor service health, troubleshoot production issues, participate in incident response, improve observability, and implement reliability best practices. This role also includes analyzing recurring failures, building automation, supporting deployments, and contributing to capacity planning, disaster recovery, and operational readiness.

Also works on number of different region/realm rollouts, deployments. Forecasts demands and responds to capacity needs. Collaborates with software development teams to develop reliable and scalable infrastructures. Performs data collection to maintain and optimize operations and reliability. Leverages knowledge to perform incident response and/or maintenance tasks. Provides health and performance reporting. Identifies opportunities for automation. Communicates about services and identifies and explains the potential impact of changes. Provides support for technology and document incidents. Experiments with new tools and assesses potential impact and develops knowledge of site reliability trends.

Responsibilities

Key Responsibilities
Capacity Ingestion and Management:
-Takes proactive steps to design and architect infrastructure and/or service according to terms for reliability and functionality.
-Forecasts demands for infrastructure and responds to capacity needs, ensuring systems have sufficient resources to handle current and future workloads.
-Collaborates with the software development team to develop infrastructures and features that are reliable and scalable according to deployment requirements.
-Independently identifies opportunities for and drives prototyping (e.g., testing new applications or infrastructures, assisting in onboarding).
Incident and Service Lifecycle Management:
-Performs data collection, triage, technical analysis, and redirection to maintain and optimize operations and infrastructure reliability.
-Independently monitors services, maintains up-to-date knowledge of their performance, and documents their condition.
-Leverages comprehensive knowledge to perform incident response, root cause analyses, and/or maintenance on assigned services (e.g., software installs, version upgrades, security updates, backup and recovery).
-Provides health and performance reporting and takes appropriate actions based on trends in data.
-May independently perform provisioning to support infrastructure, applications, and services.
-May perform standard and non-standard decommissioning (e.g., shutting down servers, removing data from databases) to remove objects that are no longer needed.
Automation:
-Identifies opportunities for automation and assesses potential benefits.
-Develops automation tools or scripts to provide solutions, gather metrics, monitor, analyze, mitigate, or remediate issues/defects within infrastructures.
-Independently conducts testing to ensure automation performs the task correctly and produces expected results.
Technical Communication and Guidance:
-Communicates the scale, capacity, security, performance attributes, and requirements of services and technology within and sometimes beyond immediate team.
-Identifies and explains the potential impact of infrastructure, feature, and tool changes, considering their impact on team operations.
Troubleshooting and Resolution:
-Provides operational support for technology, escalating incidents and other standard and non-standard issues arising within Oracle services.
-Participates in on-call shifts to address issues.
-Resolves technical issues spanning various services, investigating and debugging products in order to reach SLOs (service level objectives).
-Documents incidents and performs root cause analyses according to standard reporting methods.
-Independently performs post-mortem procedures to prevent incident reoccurrence.
Innovation and Improvement:
-Experiments with new tools and technologies to assess their potential impact on and improve infrastructure performance and reliability, ensuring adherence to security standards.
-Independently identifies and executes improvements for performance bottlenecks and deployments to ensure efficient resource usage, speed, and scalability.
-Develops knowledge of site reliability trends and shares new information with team members, management, and beyond to help others build, test, deploy and run services.
-Performs standard and non-standard analyses and provides clear data on production to contribute to business development decisions (e.g., design changes).

Core Responsibilities
Planning & Execution:
Independently manages work, monitoring timelines and deliverables to ensure projects or initiatives stay on track and meet requirements. Proactively prioritizes work and adapts to resource or timeline shifts, suggesting adjustments to maintain project efficiency.
Collaboration & Partnership:
Collaborates across teams to align on expectations and achieve shared objectives. Builds and maintains a comprehensive understanding of business, stakeholder, and/or customer needs to build and support effective partnerships. Actively listens to diverse perspectives and asks questions to ensure understanding of others.
Problem Solving:
Independently identifies and addresses standard and non-standard issues in accordance with standard practices, escalating more complex issues as appropriate. Analyzes data and/or information from multiple sources to troubleshoot standard and non-standard errors. Contributes to knowledge sharing and best practices.
Continuous Learning:
Embraces continuous learning by actively seeking to build knowledge and new skills and/or tools and staying current with industry trends and best practices. Seeks out and leverages feedback and training to improve skills. Contributes to a culture of continuous learning and knowledge sharing with team members.
Continuous Improvement:
Develops ideas and recommends updates to increase the efficiency and effectiveness of processes, protocols, and workflows within a team. Seeks input from team members on alternative approaches and methods for improving work.

IAC: Terraform, Chef, Ansible

Languages: Python, Java, Bash


Orchestration: Kubernetes, Helm

CI/CD: Jenkins

Observability: Grafana, Prometheus

Qualifications

Minimum Job Qualifications
Education and/or Experience:
8 years of experience in software engineering, infrastructure management, or related field

OR

Bachelor's Degree in Computer Science, Engineering, or related field AND 4 years of experience in software engineering, infrastructure management, or related field

OR

Master's Degree in Computer Science, Engineering, or related field AND 2 year of experience in software engineering, infrastructure management, or related field.

OR

Doctorate in Computer Science, Engineering, or related field

Job Skills:
Same skills as prior level plus;
Operating Systems Demonstrated ability in or knowledge of operating systems, including installing, upgrading, and troubleshooting various operating environments.

Automation Experience:
3 years of experience in automation.

Programming Experience:
3 years of experience in programming and/or scripting.

Preferred Job Qualifications
Education and/or Experience:
9 years of experience in software engineering, infrastructure management, or related field

OR

Bachelor's Degree in Computer Science, Engineering, or related field AND 5 years of experience in software engineering, infrastructure management, or related field

OR

Master's Degree in Computer Science, Engineering, or related field AND 3 years of experience in software engineering, infrastructure management, or related field

OR

Doctorate in Computer Science, Engineering, or related field AND 1 year of experience in software engineering, infrastructure management, or related field.
Automation Experience:
5 years of experience in automation.
Programming Experience:
5 years of experience in programming and/or scripting.

About Us

Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.

True innovation starts when everyone is empowered to contribute. That's why we're committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing View email address on click.appcast.io or by calling View phone number on click.appcast.io in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer in Pleasanton, CA vacancy
  •  ...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Who is Mastercard? At Mastercard technology, we work to connect and power an inclusive, digital economy that... 
    Senior
    Full time
    Worldwide

    Mastercard

    Dublin, CA
    4 days ago
  •  ...Workday, Inc. is looking for a Sr Software Development Engineer in Pleasanton, California. The role involves designing and leading implementations of core features for the Agent System of Record. We're seeking candidates with expertise in various programming languages... 
    Senior
    Remote work
    Flexible hours

    Workday

    Pleasanton, CA
    3 days ago
  • Kappaalphapsi1911 is seeking a Senior Software Engineer based in Pleasanton, CA. The ideal candidate will design and optimize core software platforms while enhancing the performance of existing systems. With 8+ years in software development, you will lead design efforts... 
    Senior

    Kappaalphapsi1911

    Pleasanton, CA
    5 days ago
  • $108.5k - $135.6k

    Seeds Renewables is seeking a Senior Reliability Engineer for an onsite position in El Segundo, CA. The role involves evaluating and enhancing product reliability through statistical analysis, reliability modeling, and testing. Responsibilities include performing life data... 
    Senior

    Seeds Renewables

    Livermore, CA
    2 days ago
  • Cts Technology Solutions, Inc. is seeking a Senior Software Engineer in Pleasanton, California, to develop hardware and software security solutions for commercial and aerospace applications. In this role, you will lead activities for software development, integration,... 
    Senior
    Full time
    Contract work

    Cts Technology Solutions, Inc.

    Pleasanton, CA
    5 days ago
  •  ...Sr. Software Engineer Perfict Global is a leading IT consulting services provider focused on providing innovative and successful business workforce solutions to Fortune 500 companies. Our trained and experienced professionals constantly strive to bring together the... 
    Senior
    Contract work
    Work experience placement
    Immediate start

    Perfict Global, Inc.

    Dublin, CA
    2 days ago
  • Workday, Inc. is seeking a Principal Product Manager for the Workday Build developer platform in Pleasanton, California. In this role, you will own the transformation of our developer platform into a frictionless space for enterprise AI building, focusing on enhancing the...
    Senior

    Workday

    Pleasanton, CA
    2 days ago
  •  ....g. OAuth2, OIDC. Experience with Apigee or another API Gateway is a plus - Preferred experience using Docker or another container engine, OpenShift - Hands on experience in designing systems and developing applications - Knowledge of developing software deployed on cloud... 
    Senior

    Omni Inclusive

    Pleasanton, CA
    4 days ago
  • $114.6k - $183.4k

     ...We're looking for a Senior Software Engineer This role is Office Based, Dublin Office We are seeking a Senior Software Engineer...  ...senior engineers, architects, and product teams to deliver reliable, scalable solutions that drive customer value. In this... 
    Senior
    Full time
    Work at office
    Local area

    Cornerstone OnDemand

    Dublin, CA
    2 days ago
  • Workday, Inc. is seeking a Senior Product Manager for the Developer Platform in Pleasanton, CA. In this role, you will enhance the developer community's efficiency by managing a platform that leverages ML/AI for enterprise application development. The ideal candidate has... 
    Senior
    Remote job

    Workday, Inc.

    Pleasanton, CA
    2 days ago
  • $60 - $67 per hour

    The Fountain Group is seeking a Senior Software Systems Engineer in Pleasanton, CA. In this role, you will lead software requirements engineering for medical device solutions. Your expertise will guide cross-functional teams to ensure compliance with industry standards... 
    Senior

    The Fountain Group

    Pleasanton, CA
    3 days ago
  • $139.9k - $280.6k

     ...seeks mission-focused candidates for critical roles in the Livermore, CA facility. Successful applicants will develop tactics, drive engineering projects, and collaborate across disciplines to ensure mission success. Ideal candidates have a Bachelor's degree, strong... 
    Senior
    Flexible hours

    Accreditation Council for Graduate Medical Education

    Livermore, CA
    3 days ago
  • $156k - $196k

     ...Point of Sale systems and other applications. As a Sr. Software Engineer in Test, you will play a crucial role in the success of our...  ...scale in the areas of system performance, scalability, latency, reliability and security. Strong testing experience with cloud native/... 
    Senior
    Temporary work
    Work experience placement
    Work at office
    Shift work
    3 days per week

    BlackLine

    Pleasanton, CA
    2 days ago
  • $139.9k - $280.6k

    Job Overview The Telemetry (TM) team at Sandia National Laboratories seeks an experienced R&D Mechanical Engineer to serve as a Mechanical Lead. The role involves design, development, testing, and qualification of telemetry products across the ND portfolio, providing technical... 
    Senior

    Accreditation Council for Graduate Medical Education

    Livermore, CA
    2 days ago
  • $156k - $196k

     ...skillset that will accelerate their careers. Work, Play and Grow at BlackLine! Make Your Mark: As a Sr. Software Engineer, you will play a crucial role in delivering high quality releases to our customers by designing, developing, troubleshooting, maintaining... 
    Senior
    Temporary work
    Work experience placement
    Work at office
    Shift work
    3 days per week

    BlackLine

    Pleasanton, CA
    4 days ago
  •  ...deployment, proficiency working on CI/CD pipelines, and with well-developed organizational, analytical and problem-solving skills. The Senior Developer must have solid back-end Java, Docker, Kubernetes, as well as Angular experience with version 9 and newer, (the team is... 
    Senior
    Work experience placement

    Perfict Global, Inc.

    Pleasanton, CA
    3 days ago
  • A global technology consulting firm is seeking a Senior Product Manager to lead internal platforms for software development. You will...  ...role is crucial for optimizing development processes and impacting organizational engineering excellence. #J-18808-Ljbffr Ampcus, Inc
    Senior

    Ampcus, Inc

    Pleasanton, CA
    4 days ago
  • $108.5k - $135.6k

    Position Summary We are seeking a Senior Reliability Engineer for an onsite position in El Segundo, CA, to evaluate, predict, and enhance product reliability through statistical analysis, reliability modeling, and testing. The role involves life data analysis, reliability... 
    Senior
    Work experience placement

    Seeds Renewables

    Livermore, CA
    2 days ago
  • $190.1k - $285.1k

     ...layer, delivering the scalability, reliability, and efficiency needed to support Workday...  ...the Role We are seeking a Senior Software Development Engineer with a passion for architectural...  ...Careers. Please be aware of sites that may ask for you to input your... 
    Senior
    Work experience placement
    Work at office
    Remote work
    Worldwide
    Home office
    Flexible hours

    Workday

    Pleasanton, CA
    2 days ago
  •  ...Job Title : Senior Agentic AI Developer Location: This is a remote role in the US...  ...looking for a highly experienced Agentic AI Engineer with 7+ years of experience to design,...  ...impact through automation and AI • Improve system reliability, reasoning, and performance
    Senior
    Remote work

    BayOne Solutions

    Pleasanton, CA
    1 day ago
  • $142.11k - $186.06k

     ...exploration, and telecommunications. Our team of engineers, scientists, software developers, and...  ..., and telecommunications. As a Senior Rust Software Engineer, you will serve as...  ...Architecture: Lead the design and development of reliable, high-performance, production-quality... 
    Senior
    Permanent employment
    Immediate start
    Flexible hours

    Vector Atomic

    Pleasanton, CA
    4 days ago
  • Workday, Inc. is seeking a Senior Software Development Engineer for their Pleasanton, CA location. In this role, you'll lead the design and implementation of robust APIs, focusing on performance and observability. With a minimum of 8 years of software engineering experience... 
    Senior
    Remote job

    Workday, Inc.

    Pleasanton, CA
    5 days ago
  • HR Tech Job is seeking a Senior Product Manager to join their Developer Experience team in Pleasanton, California. In this role, you...  ...tools and platforms. You'll collaborate closely with engineering teams and customers to refine product features and ensure high... 
    Senior
    Remote work

    HR Tech Job

    Pleasanton, CA
    2 days ago
  • $105.3k - $136.92k

    Blackhawk Network seeks a Software Engineer II in Pleasanton, CA, responsible for building high-scaled payment network components. Candidates should have over 4 years of experience in Java and a degree in Computer Science. The role includes mentoring responsibilities and... 
    Senior

    Fyrfly

    Pleasanton, CA
    2 days ago
  • $176k - $264k

    Workday, Inc. is seeking a Senior Software Development Engineer to lead the design and implementation of high-performance, low-latency APIs. You will play a crucial role in shaping API architecture and standards, collaborating with cross-functional teams to drive product... 
    Senior

    Workday

    Pleasanton, CA
    5 days ago
  • Blackhawk Network is seeking a Senior Software Engineer to build world-class payment applications. This hybrid role offers flexibility with in-office collaboration on Tuesdays and Wednesdays at our Pleasanton headquarters. The ideal candidate has strong Java experience... 
    Senior
    Work at office
    Worldwide

    Fyrfly

    Pleasanton, CA
    3 days ago
  • Workday in Pleasanton is seeking a Senior/Principal Machine Learning Engineer to design and build core ML systems for AI agents. This role involves close collaboration with teams handling product management, data science, and software engineering. The ideal candidate boasts... 
    Senior
    Flexible hours

    HR Tech Job

    Pleasanton, CA
    1 day ago
  • $130k - $163k

     ...Grow at BlackLine! Make Your Mark As a Senior AI Developer on BlackLine’s Corporate AI...  ..., not just prototypes—covering reliability, maintainability, and scalability. Experience...  ...Pinecone, or Weaviate. Advanced prompt engineering skills and a deep understanding of how... 
    Senior
    Temporary work
    For contractors
    Work at office
    Shift work
    3 days per week

    BlackLine

    Pleasanton, CA
    2 days ago
  • A leading technology provider in Livermore, California is seeking a Senior Principal Software Development Engineer responsible for designing and testing complex software systems. The ideal candidate has over 7 years of experience in software development, with expertise... 
    Senior

    FormFactor

    Livermore, CA
    4 days ago
  • $86.8k - $198k

     ...Job Number: R0240024 Application Developer, Senior The Opportunity: At a certain point, experience-based system design can start...  ...our total benefits by visiting the Resource page on our Careers site and reviewing Our Employee Benefits page. Salary at Booz... 
    Senior
    Full time
    Contract work
    Part time
    Work at office
    Local area
    Remote work

    Booz Allen Hamilton

    Dublin, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!