Site Reliability Engineer
National Oilwell Varco
As a Site Reliability Engineer, you will be responsible for: Operational Excellence & Incident Management Maintain and monitor production systems for availability, latency, and performance. Lead incident response efforts, including communication, resolution, and postmortem documentation. Design and implement health checks, alerting systems, and automated remediation workflows. Drive root cause analysis and implement permanent resolutions for recurring issues. Observability & Insights Set up and maintain full observability stacks (logging, metrics, tracing) using tools like Prometheus, Grafana, Datadog, OpenTelemetry, or ELK. Analyze telemetry and logs to identify trends, anomalies, and opportunities for improvement. Conduct post-incident reviews and use insights to inform future engineering investments. Performance & Systems Optimization Tune and optimize distributed systems, including AKKA.NET actors, for performance and resource efficiency. Work with developers to evolve architecture and improve system throughput, latency, and stability. Optimize PostgreSQL performance, queries, and maintenance strategies. CI/CD & Automation Design and maintain modern CI/CD pipelines using GitHub Actions, Azure Pipelines, or GitLab CI. Automate deployment, testing, and rollback processes to reduce friction and increase deployment frequency. Standardize infrastructure as code practices across environments. We’d love to talk to you if you have: 5+ years of experience in SRE, DevOps, or Infrastructure Engineering roles. Expertise in Kubernetes and container orchestration at scale. Strong experience with AKKA.NET or similar actor-based frameworks. Proficiency with scripting and automation (Bash, PowerShell, Python). Experience with observability tools (Phobos,Datadog, Prometheus, Grafana, OpenTelemetry, ELK). Hands-on experience with cloud platforms (AWS, Azure, or GCP). Strong PostgreSQL knowledge—performance tuning, query optimization, maintenance. Proven ability to lead incident management and drive postmortem processes. A builder’s mindset with high standards for operational excellence and technical ownership. Preferred Tools & Ecosystem Experience CI/CD: GitHub Actions, Azure Pipelines, GitLab CI Infrastructure: Kubernetes, Docker, Terraform Monitoring: Phobos (AKKA.NET), Datadog, Prometheus Source Control: GitHub, GitLab, Azure DevOps Programming: C#, Python, Bash, PowerShell #J-18808-Ljbffr National Oilwell Varco
- ...Site Reliability Engineer As a Site Reliability Engineer, you will be responsible for: Operational Excellence & Incident Management - Maintain and monitor production systems for availability, latency, and performance. - Lead incident response efforts, including...SuggestedPermanent employment
- ...Senior Site Reliability Engineer The Senior Site Reliability Engineer is responsible for improving the reliability, availability, scalability, and operational excellence of our critical infrastructure platforms and services. This role partners closely with Engineering...SuggestedWork at officeLocal area
- ...strategic and analytical operator to shape how product initiatives are brought to life and measured across Tekmetric. Senior Software Engineer Tekmetric is looking for a Senior Software Engineer to take part in the full development lifecycle, from ideation and...SuggestedRemote workWork from home
- **Site Reliability Engineer II****About** **PROS:**PROS, Inc. is the leading offer management provider to the airline industry, helping airlines deliver seamless retail experiences designed to maximize revenue and margin growth. Powered by AI, the PROS Platform enables...SuggestedFlexible hours
- National Oilwell Varco in Houston is seeking a Site Reliability Engineer to maintain and optimize production systems while leading incident response efforts. The successful candidate will have over 5 years of experience, preferably in SRE, DevOps, or Infrastructure Engineering...Suggested
- ...ownership. Job Responsibilities Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance... ..., and Skills Formal training or certification on software engineering concepts and 5+ years applied experience. Advanced...
- Position Title: DEVOPS & SRE ENGINEER Location: HOUSTON, TX FLSA Class: EXEMPT Responsible to: Director of Software Engineering Position Summary: DevOps / Site Reliability Engineer to implement and evolve the infrastructure, deployment pipelines, and reliability posture...Local area
- ...and shape the future of technology at a globally recognized firm, driven by pride in ownership. As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Infrastructure Platforms-Data Protection and Recovery organization, you are the non-functional...
- A leading technology solutions provider in Houston is seeking professionals for various roles, including Account Executives and Customer Success Managers. This company fosters a collaborative and innovative environment, encouraging employees to make an impact while providing...
- ...Please extend your support for this role. Local candidate will get 1st preference. Job Title: SRE Engineer Location: Houston, TX and Jersey City, NJ - 3 Days Onsite Role FTE role with Mphasis Client: Mphasis H1B transfer will work...Work experience placementH1bLocal area
$140.88k - $153.75k
...data, and software businesses. WHERE YOU’LL FIT WITHIN THE TEAM Senior DevOps / SRE Engineers own the CI/CD pipelines, GitOps infrastructure, Kubernetes operations, and reliability engineering practices that keep the PE platform running at production quality on Microsoft...Full timeWork at officeLocal area1 day per week- ...Description Salary: JOB SUMMARY The Lead/Principal Mechanical Engineer will play a key role in planning, coordinating and managing... ...provide direction to drafting and engineering to develop PFDs, P&IDs, site plans, and piping drawings in order to create a construction set...For contractors
- ...Technical Release Train Engineer (RTE) The driving force behind our success has always been our people. We foster a culture of engineering excellence, continuous improvement, and collaboration across product management, architecture, quality, and delivery teams. Our...
$101.2k - $161.6k
...heart of our success. We've earned national recognition, including being named a Great Place to Work and consistently ranking on Engineering News Record's Top 500 Design Firms in the United States and we're still growing! Summary We are seeking a Lead Mechanical Engineer...Work experience placementWork at officeLocal area$110k - $115k
...Reliability Engineer A Reliability Engineer opportunity is available in Houston, Texas, with a confidential employer in the chemical industry. This full-time, direct-hire role offers a competitive salary range of $110,000 to $115,000 annually. You will drive...Hourly payFull timeWork experience placementRemote work- ...R10094732 Reliability Engineer (Open) Location Houston, TX (HO) - NAM Corporate Air Liquide Large Industries provides our customers with... ...benchmarks, equipment, materials and man-hours) Provide Site assistance to reply to Site queries ; can be required to go on...
- ...Company is actively recruiting for a Reliability Engineer who directly reports to Reliability Engineering and Project Manager and will interface... ...appropriate company and contractor resources at the Houston site, support the development of new and improve existing plant reliability...For contractors
- ...position you will be recognized as the reliability subject matter expert and be responsible... ...projects with RAM analysis, and partners with site teams to develop and execute reliability... ...model. * Act as a liaison with engineers, operations, specialists, and other cross...Work experience placementWork at officeLocal areaRemote workHome office2 days per week
- ...the transformation of low-Earth orbit into a global space marketplace. Our mission-driven team is seeking a bold and dynamic Reliability Engineer who is fueled by high ownership, execution horsepower, growth mindset, and driven to understand our world, science/...Permanent employmentWork at officeWeekend workAfternoon shift
- ...Associate or equivalent degree (minimum two-year) in Electronics or related field required. Bachelor's degree in Electronics Engineering or related field preferred. Certificate in Root Cause Analysis preferred. Experience Minimum of 3 years of experience...Casual workWork at officeLocal areaFlexible hours
- VoltaGrid LLC. in Houston, TX is seeking a DevOps & SRE Engineer to implement and evolve infrastructure and deployment pipelines. Candidates should have strong experience with AWS, Kubernetes, Docker, and Terraform, alongside scripting skills in Bash, Python, or Go. The...
- Why a Great Opportunity Be part of a newly created and highly visible position as our client considers expanding its listing in the United States and the requirements for compliance with the Sarbanes Oxley Act. Hybrid role - in the office 3 days a week in Richmond...Work at office3 days per week
- ...diverse market connectivity to producers and end-users who need reliable sources of natural gas for power generation, home heating or... ...found online at We are currently looking for a Lead/Sr. Engineer System Planning for our Houston, TX office. POSITION...Work at office
- ...Job Description Job Description Structural Integrity Associates, Inc. (SIA) is currently looking for a Mechanical Engineering Consultant to join our Process Pressure Vessels team. This role would participate as part of a dynamic team in the engineering analysis, condition...Temporary workWork at officeRemote workFlexible hours
- ...production teams and seamless operations on the factory floor. The ideal candidate will work directly with manufacturing, IT, and engineering teams to maintain system performance and provide end-user support to ensure operational continuity. Key Responsibilities: 1....
- Scan Ninja Inc. is seeking a skilled infrastructure engineer based in Houston, Texas, to enhance their platform's reliability and security. In this role, you will design and maintain cloud infrastructure, automate deployment workflows, and implement crucial monitoring tools...
- ...Job Title: Maintenance & Reliability Engineer (Rotating Equipment Focus) Location: Houston, TX; Gonzales, LA; Delaware City, DE; Cincinnati... ...expert for rotating machinery across multiple manufacturing sites, driving reliability improvements, asset effectiveness...Permanent employmentFull timeContract workFor contractorsWorldwide
$105k - $160k
...Your Job Flint Hills Resources is seeking a self-motivated Electrical Reliability Engineer to join our Pipelines and Terminals ICE (Instrumentation, Control, Electrical) Engineering team. In this role, you will help advance electrical and instrumentation reliability...For contractorsWork experience placementFlexible hoursShift work- ...embedding AI into the core of consulting delivery , at scale, across industries and geographies. We're hiring AI Solutions Engineers to join our global AI Accelerator team - a small, high-impact group building practical, deployable solutions that consulting teams...
- ...Senior Systems Software Engineer This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days... ...realistic scenarios to uncover edge cases, performance limits, and reliability opportunities Debug and resolve challenging issues that...Work experience placementWork at officeLocal areaImmediate start2 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site reliability engineer Houston, TX
- on-site clinical research associate (traveling/remote) Houston, TX
- junior website developer Houston, TX
- IT site lead Houston, TX
- site leader Houston, TX
- site safety Houston, TX
- site recruiter Houston, TX
- on site coordinator Houston, TX
- site services specialist Houston, TX
- website coordinator Houston, TX


