Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

National Oilwell Varco

As a Site Reliability Engineer, you will be responsible for: Operational Excellence & Incident Management Maintain and monitor production systems for availability, latency, and performance. Lead incident response efforts, including communication, resolution, and postmortem documentation. Design and implement health checks, alerting systems, and automated remediation workflows. Drive root cause analysis and implement permanent resolutions for recurring issues. Observability & Insights Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK. Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement. Conduct post-incident reviews and use insights to inform future engineering investments. Performance & Systems Optimization Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency. Work with developers to evolve architecture and improve system throughput, latency, and stability. Optimize PostgreSQL performance, queries, and maintenance strategies. CI/CD & Automation Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI. Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency. Standardize infrastructure as code practices across environments. We’d love to talk to you if you have: 5+ years of experience in SRE, DevOps, or Infrastructure Engineering roles. Expertise in Kubernetes and container orchestration at scale. Strong experience with AKKA.NET or similar actor-based frameworks. Proficiency with scripting and automation (Bash, PowerShell, Python). Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK). Hands-on experience with cloud platforms (AWS, Azure, or GCP). Strong PostgreSQL knowledge—performance tuning, query optimization, maintenance. Proven ability to lead incident management and drive postmortem processes. A builder’s mindset with high standards for operational excellence and technical ownership. Preferred Tools & Ecosystem Experience CI/CD: GitHub Actions, Azure Pipelines, GitLab CI Infrastructure: Kubernetes, Docker, Terraform Monitoring: Phobos (AKKA.NET), Datadog, Prometheus Source Control: GitHub, GitLab, Azure DevOps Programming: C#, Python, Bash, PowerShell #J-18808-Ljbffr National Oilwell Varco

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Houston, TX vacancy
  •  ...Site Reliability Engineer As a Site Reliability Engineer, you will be responsible for: Operational Excellence & Incident Management - Maintain and monitor production systems for availability, latency, and performance. - Lead incident response efforts, including... 
    Suggested
    Permanent employment

    NOV

    Houston, TX
    4 days ago
  •  ...Senior Site Reliability Engineer The Senior Site Reliability Engineer is responsible for improving the reliability, availability, scalability, and operational excellence of our critical infrastructure platforms and services. This role partners closely with Engineering... 
    Suggested
    Work at office
    Local area

    Castleton Commodities International

    Houston, TX
    3 days ago
  •  ...strategic and analytical operator to shape how product initiatives are brought to life and measured across Tekmetric. Senior Software Engineer Tekmetric is looking for a Senior Software Engineer to take part in the full development lifecycle, from ideation and... 
    Suggested
    Remote work
    Work from home

    Tekmetric

    Houston, TX
    1 day ago
  • **Site Reliability Engineer II****About** **PROS:**PROS, Inc. is the leading offer management provider to the airline industry, helping airlines deliver seamless retail experiences designed to maximize revenue and margin growth. Powered by AI, the PROS Platform enables... 
    Suggested
    Flexible hours

    PROS Holdings, Inc.

    Houston, TX
    1 day ago
  • National Oilwell Varco in Houston is seeking a Site Reliability Engineer to maintain and optimize production systems while leading incident response efforts. The successful candidate will have over 5 years of experience, preferably in SRE, DevOps, or Infrastructure Engineering... 
    Suggested

    National Oilwell Varco

    Houston, TX
    2 days ago
  •  ...ownership. Job Responsibilities Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance...  ..., and Skills Formal training or certification on software engineering concepts and 5+ years applied experience. Advanced... 

    Fairygodboss

    Houston, TX
    2 days ago
  • Position Title: DEVOPS & SRE ENGINEER Location: HOUSTON, TX FLSA Class: EXEMPT Responsible to: Director of Software Engineering Position Summary: DevOps / Site Reliability Engineer to implement and evolve the infrastructure, deployment pipelines, and reliability posture... 
    Local area

    VoltaGrid LLC.

    Houston, TX
    5 days ago
  •  ...and shape the future of technology at a globally recognized firm, driven by pride in ownership. As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Infrastructure Platforms-Data Protection and Recovery organization, you are the non-functional... 

    Koitecc Solutions

    Houston, TX
    4 days ago
  • A leading technology solutions provider in Houston is seeking professionals for various roles, including Account Executives and Customer Success Managers. This company fosters a collaborative and innovative environment, encouraging employees to make an impact while providing...

    Tekmetric

    Houston, TX
    2 days ago
  •  ...Please extend your support for this role. Local candidate will get 1st preference. Job Title: SRE Engineer Location: Houston, TX and Jersey City, NJ - 3 Days Onsite Role FTE role with Mphasis Client: Mphasis H1B transfer will work... 
    Work experience placement
    H1b
    Local area

    Texas State Library and Archives Commision

    Houston, TX
    23 hours ago
  • $140.88k - $153.75k

     ...data, and software businesses. WHERE YOU’LL FIT WITHIN THE TEAM Senior DevOps / SRE Engineers own the CI/CD pipelines, GitOps infrastructure, Kubernetes operations, and reliability engineering practices that keep the PE platform running at production quality on Microsoft... 
    Full time
    Work at office
    Local area
    1 day per week

    Bain & Company

    Houston, TX
    1 day ago
  •  ...Description Salary: JOB SUMMARY The Lead/Principal Mechanical Engineer will play a key role in planning, coordinating and managing...  ...provide direction to drafting and engineering to develop PFDs, P&IDs, site plans, and piping drawings in order to create a construction set... 
    For contractors

    EnSiteUSA

    Houston, TX
    7 days ago
  •  ...Technical Release Train Engineer (RTE) The driving force behind our success has always been our people. We foster a culture of engineering excellence, continuous improvement, and collaboration across product management, architecture, quality, and delivery teams. Our... 

    Aspen Technology

    Houston, TX
    2 days ago
  • $101.2k - $161.6k

     ...heart of our success. We've earned national recognition, including being named a Great Place to Work and consistently ranking on Engineering News Record's Top 500 Design Firms in the United States and we're still growing! Summary We are seeking a Lead Mechanical Engineer... 
    Work experience placement
    Work at office
    Local area

    HR Green

    Houston, TX
    16 days ago
  • $110k - $115k

     ...Reliability Engineer A Reliability Engineer opportunity is available in Houston, Texas, with a confidential employer in the chemical industry. This full-time, direct-hire role offers a competitive salary range of $110,000 to $115,000 annually. You will drive... 
    Hourly pay
    Full time
    Work experience placement
    Remote work

    Top Engineer

    Houston, TX
    4 days ago
  •  ...R10094732 Reliability Engineer (Open) Location Houston, TX (HO) - NAM Corporate Air Liquide Large Industries provides our customers with...  ...benchmarks, equipment, materials and man-hours) Provide Site assistance to reply to Site queries ; can be required to go on... 

    Air Liquide

    Houston, TX
    11 hours ago
  •  ...Company is actively recruiting for a Reliability Engineer who directly reports to Reliability Engineering and Project Manager and will interface...  ...appropriate company and contractor resources at the Houston site, support the development of new and improve existing plant reliability... 
    For contractors

    NPAworldwide

    Houston, TX
    4 days ago
  •  ...position you will be recognized as the reliability subject matter expert and be responsible...  ...projects with RAM analysis, and partners with site teams to develop and execute reliability...  ...model. * Act as a liaison with engineers, operations, specialists, and other cross... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Home office
    2 days per week

    LyondellBasell Industries

    Houston, TX
    7 days ago
  •  ...the transformation of low-Earth orbit into a global space marketplace. Our mission-driven team is seeking a bold and dynamic Reliability Engineer who is fueled by high ownership, execution horsepower, growth mindset, and driven to understand our world, science/... 
    Permanent employment
    Work at office
    Weekend work
    Afternoon shift

    Axiom Space

    Houston, TX
    3 days ago
  •  ...Associate or equivalent degree (minimum two-year) in Electronics or related field required. Bachelor's degree in Electronics Engineering or related field preferred. Certificate in Root Cause Analysis preferred. Experience Minimum of 3 years of experience... 
    Casual work
    Work at office
    Local area
    Flexible hours

    Grandir UK

    Houston, TX
    1 day ago
  • VoltaGrid LLC. in Houston, TX is seeking a DevOps & SRE Engineer to implement and evolve infrastructure and deployment pipelines. Candidates should have strong experience with AWS, Kubernetes, Docker, and Terraform, alongside scripting skills in Bash, Python, or Go. The... 

    VoltaGrid LLC.

    Houston, TX
    5 days ago
  • Why a Great Opportunity Be part of a  newly created and highly visible position  as our client considers expanding its listing in the United States and the requirements for compliance with the Sarbanes Oxley Act. Hybrid role - in the office 3 days a week in Richmond...
    Work at office
    3 days per week

    NPAworldwide

    Houston, TX
    3 days ago
  •  ...diverse market connectivity to producers and end-users who need reliable sources of natural gas for power generation, home heating or...  ...found online at We are currently looking for a Lead/Sr. Engineer System Planning for our Houston, TX office. POSITION... 
    Work at office

    Boardwalk Pipelines

    Houston, TX
    2 days ago
  •  ...Job Description Job Description Structural Integrity Associates, Inc. (SIA) is currently looking for a Mechanical Engineering Consultant to join our Process Pressure Vessels team. This role would participate as part of a dynamic team in the engineering analysis, condition... 
    Temporary work
    Work at office
    Remote work
    Flexible hours

    SI Solutions, LLC

    Houston, TX
    13 days ago
  •  ...production teams and seamless operations on the factory floor. The ideal candidate will work directly with manufacturing, IT, and engineering teams to maintain system performance and provide end-user support to ensure operational continuity. Key Responsibilities: 1.... 

    Foxconn Technology Group

    Houston, TX
    3 days ago
  • Scan Ninja Inc. is seeking a skilled infrastructure engineer based in Houston, Texas, to enhance their platform's reliability and security. In this role, you will design and maintain cloud infrastructure, automate deployment workflows, and implement crucial monitoring tools... 

    Scan Ninja Inc.

    Houston, TX
    1 day ago
  •  ...Job Title: Maintenance & Reliability Engineer (Rotating Equipment Focus) Location: Houston, TX; Gonzales, LA; Delaware City, DE; Cincinnati...  ...expert for rotating machinery across multiple manufacturing sites, driving reliability improvements, asset effectiveness... 
    Permanent employment
    Full time
    Contract work
    For contractors
    Worldwide

    NES Fircroft

    Houston, TX
    2 days ago
  • $105k - $160k

     ...Your Job Flint Hills Resources is seeking a self-motivated Electrical Reliability Engineer to join our Pipelines and Terminals ICE (Instrumentation, Control, Electrical) Engineering team. In this role, you will help advance electrical and instrumentation reliability... 
    For contractors
    Work experience placement
    Flexible hours
    Shift work

    Flint Hills Inc

    Houston, TX
    1 day ago
  •  ...embedding AI into the core of consulting delivery , at scale, across industries and geographies. We're hiring AI Solutions Engineers to join our global AI Accelerator team - a small, high-impact group building practical, deployable solutions that consulting teams... 

    The ERM International Group Limited

    Houston, TX
    23 hours ago
  •  ...Senior Systems Software Engineer This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days...  ...realistic scenarios to uncover edge cases, performance limits, and reliability opportunities Debug and resolve challenging issues that... 
    Work experience placement
    Work at office
    Local area
    Immediate start
    2 days per week

    Hewlett Packard Enterprise

    Houston, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!