Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Product Reliability Engineer [Remote]

Full-time

jobgether

United States
  • Remote job

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Product Reliability Engineer based in United States.

This role sits at the critical intersection of software engineering, customer reliability, and production operations for infrastructure software deployed in complex, real-world environments. You will ensure that production systems running in customer-owned Kubernetes environments remain stable, observable, and continuously improvable. The work goes beyond incident response, focusing on eliminating entire categories of failures through better tooling, automation, and product design. You will partner closely with customers, engineers, and solution teams to investigate complex issues, drive root-cause analysis, and translate findings into long-term system improvements. This is a highly hands-on role where debugging, automation, and product thinking come together to define reliability as a core product capability. Your work will directly shape how enterprise customers experience stability, performance, and trust in the platform.

Accountabilities:

  • Partner with customers and internal teams to investigate and resolve complex production issues across Kubernetes-based on-prem and hybrid deployments.
  • Lead deep root-cause analysis for escalations, reproduce issues, and collaborate with engineering teams to implement durable fixes.
  • Build and maintain reliability tooling such as diagnostics systems, health checks, support bundles, and environment validation utilities.
  • Own and improve test automation frameworks, focusing on CI stability, reducing flaky tests, and strengthening integration and end-to-end coverage.
  • Define and maintain performance baselines, regression testing frameworks, and reliability gates to prevent production regressions.
  • Improve installation, upgrade, and deployment reliability by identifying recurring failure patterns and building preventive solutions.
  • Develop production-grade internal tools and product enhancements using Python, Go, or Rust to strengthen observability and system resilience.
  • Establish a closed feedback loop from customer issues to engineering improvements in testing, observability, documentation, and defaults.

Requirements:

  • 4–7 years of experience in production engineering, SRE, platform engineering, or similar roles focused on reliability and distributed systems.
  • Strong software engineering fundamentals, including debugging, testing, system design, and production-grade coding practices.
  • Hands-on Kubernetes expertise, including troubleshooting workloads, networking, storage, RBAC, and multi-environment deployments.
  • Strong experience with observability tools and techniques, including logs, metrics, and tracing for distributed system debugging.
  • Proficiency in at least one programming language such as Python, Go, or Rust, with experience building internal tools or production systems.
  • Strong analytical and communication skills, with the ability to break down complex incidents into clear root causes and actionable recommendations.
  • Experience working in cross-functional environments with engineering, product, and customer-facing teams in fast-moving contexts.
  • Self-directed and comfortable working in remote-first environments with shifting priorities driven by customer needs and escalations.

Benefits:

  • Competitive compensation package aligned with experience and seniority
  • Fully remote work environment across Canada and the United States
  • Opportunity to work on real-world production infrastructure used in complex enterprise environments
  • Strong technical ownership with high impact on product reliability and customer experience
  • Collaboration with experienced engineers in infrastructure, automation, and platform engineering
  • Learning and growth opportunities in Kubernetes, observability, and large-scale distributed systems
  • Inclusive and diverse team culture focused on collaboration and continuous improvement
  • Exposure to open-source-driven infrastructure innovation

How Jobgether works:

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

#LI-CL1

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Product Reliability Engineer [Remote] in United States vacancy
  •  ...Reliability Product Approval Engineer As part of the Thermo Fisher Scientific team, you'll discover meaningful work that makes a positive impact on a global scale. Join our colleagues in bringing our Mission to life every single day to enable our customers to make the... 
    Suggested

    Thermo Fisher

    Marietta, OH
    4 days ago
  •  ...At AMD, our mission is to build great products that accelerate next-generation computing...  .... THE ROLE: Join a global product reliability team that drives silicon and package qualifications...  ...a highly visible role within the AMD engineering team and is responsible for defining... 
    Suggested

    Advanced Micro Devices

    Austin, TX
    4 days ago
  • Sms Infocomm Corporation is seeking an Engineer to provide essential technical support in production and to conduct failure analysis on customer products. The role requires proficiency in multiple operating systems and analytical tools, fostering a positive work environment... 
    Suggested

    Sms Infocomm Corporation

    Edgewood, TX
    5 days ago
  • $110.5k - $152k

    A leading materials engineering company is hiring a Product Quality & Reliability Engineer III in Santa Clara, CA. This full-time role focuses on developing and testing quality standards to meet customer expectations and conducting evaluations of product reliability. Candidates... 
    Suggested
    Full time

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  • $133.5k - $183.5k

    Applied Materials, Inc. is seeking a Product Quality and Reliability Engineer IV in Santa Clara, CA. The role involves leading quality engineering projects and conducting reliability analyses in a cross-functional environment. The ideal candidate has a Bachelor's degree... 
    Suggested
    Full time

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  • Applied Materials, Inc. is seeking a Product Quality and Reliability Engineer in Santa Clara, California. This highly technical role involves leading quality engineering projects and ensuring Design For Reliability (DfR) methods are integrated into new product developments... 

    Applied Materials, Inc.

    Santa Clara, CA
    5 days ago
  •  ...company in Richardson, Texas, is seeking a High Performance Analog Product Quality Engineer. The role involves supporting product development, enhancing product quality, and ensuring compliance with reliability standards. Successful candidates will have a BS in Engineering... 

    Qorvo, Inc.

    Richardson, TX
    3 days ago
  • $125.8k - $170.2k

    Senior Product Quality Engineer Mountain View, CA Apply for this job About us: Aeva is building the next generation of sensing and perception for autonomous vehicles and beyond. With its unique ability to measure instantaneous velocity for each pixel, long-range performance... 
    Contract work
    Work experience placement
    Flexible hours

    Aeva Inc.

    Mountain View, CA
    1 day ago
  • $133.5k - $183.5k

    ## Product Quality and Reliability Engineer IVApplylocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2618641**Who We Are**Applied Materials is a global leader in materials engineering solutions used to produce virtually every new... 
    Full time
    Work experience placement
    Worldwide
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    2 days ago
  •  ...At AMD, our mission is to build great products that accelerate next‑generation computing...  ...visible role drives product Quality & Reliability related data analysis presenting to higher...  ...quality, reliability, or product engineering, preferably in semiconductor industry.... 

    Advanced Micro Devices

    Austin, TX
    1 day ago
  • $110.5k - $152k

    Product Quality & Reliability Engineer III - (E3) page is loaded## Product Quality & Reliability Engineer III - (E3)locations: Santa Clara,CAtime type: Full timeposted on: Posted Yesterdayjob requisition id: R2616799**Who We Are**Applied Materials is a global leader in... 
    Full time
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    3 days ago
  • Applied Materials, Inc. is looking for a Product Quality & Reliability Engineer III located in Santa Clara, CA. In this role, you will lead quality and reliability activities throughout the product lifecycle, ensuring products meet quality and reliability standards. You... 

    Applied Materials, Inc.

    Santa Clara, CA
    3 days ago
  • $110.5k - $152k

    Product Quality & Reliability Engineer III page is loaded## Product Quality & Reliability Engineer IIIlocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2620374**Who We Are**Applied Materials is a global leader in materials engineering... 
    Full time
    Relocation

    Applied Materials, Inc.

    Santa Clara, CA
    3 days ago
  • Hyster-Yale Materials Handling, Inc. in Greenville, NC is hiring a Product Support Solutions Engineer I-II. This role involves analyzing product performance, resolving field issues, and contributing to product development. Candidates should have a Bachelor's in Engineering... 

    Hyster-Yale Materials Handling, Inc.

    Greenville, TX
    5 days ago
  • Alumni Ventures is seeking a Product Reliability Engineer to enhance the quality and reliability of products and processes. As part of our dynamic team, you will optimize production methods, lead quality control initiatives, and engage with cross-functional teams to align... 

    Alumni Ventures

    Torrance, CA
    3 days ago
  • $124.4k - $138k

     ...Basic Qualifications Bachelor's degree in Systems Engineering, or a related Science, Engineering or Mathematics field, plus a minimum...  ...Bachelor's degree plus 8 years of experience in product ownership, product management, or business analysis in a technology... 
    Remote work
    Flexible hours

    General Dynamics Mission Systems

    Pittsfield, MA
    3 days ago
  • Autodesk, Inc. is looking for a leader to transform customer experience into systemic product and service improvements. This role involves advocating for AECO customers, influencing product prioritization, and establishing feedback loops to drive measurable outcomes. Candidates... 

    Autodesk, Inc.

    Boston, MA
    2 days ago
  • $136k - $218.5k

     ...of design, architecture, marketing, and productization—owning the journey from the...  ...Embedded markets. As a Silicon Speed Features Engineer, you will co‑design system‑level speed...  ..., hardware, firmware/software, process/reliability, and operations teams to co‑design system... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $106k - $170.2k

     ...health for humanity. Learn more at jnj.com. Position Overview Design for Reliability (DfR) Engineer - Robotics & Digital R&D Team, Santa Clara, CA. Join our team to embed reliability thinking into product design, develop specifications for surgical robotic systems, and... 
    Temporary work
    Local area

    6267-Auris Health Inc. Legal Entity

    Santa Clara, CA
    1 day ago
  • $140k - $180k

    Senior Design Reliability Engineer Department: Propulsion Employment Type: Full Time Location: Redondo Beach Reporting To: Kevin Miller, SVP...  ..., margining, configuration, and release processe Ensure product integrity by holding the line on quality standards, extreme-... 
    Permanent employment
    Full time
    Flexible hours

    Impulsespace

    Redondo Beach, CA
    5 days ago
  • $146.9k - $183.6k

     ...outdoors and a desire to protect it for future generations. Role Summary Reliability is at the core of Electric Adventure Vehicles. We are looking for engineers who have experience in bringing products from the early concept stages to production by working closely with... 
    Full time
    Contract work
    Temporary work
    Part time
    Local area
    Shift work

    Rivian

    Palo Alto, CA
    1 day ago
  • $140k - $180k

    Impulsespace is looking for a Senior Design Reliability Engineer in Redondo Beach. You will standardize and implement design processes, lead cross-functional teams, and ensure product integrity through rigorous quality standards. The ideal candidate holds a Bachelor’s degree... 

    Impulsespace

    Redondo Beach, CA
    4 days ago
  • $192k - $264k

    Who We Are Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor... 
    Full time
    Relocation

    Applied Materials

    Santa Clara, CA
    19 hours ago
  • $175k - $200k

     ...fundamentally different class of spacecraft. Engineered to survive the harshest radiation...  ...requirements and deliver exceptionally reliable flight systems. You will own and continuously...  ...gatekeeper during development. As production begins, you will guide a disciplined... 
    Permanent employment
    Shift work

    K2 Space

    Los Angeles, CA
    22 days ago
  •  ...credit funds and capital markets counterparties. We’re building trust in this system of credit. We are looking for a Product Reliability Engineer to join our team! This is an opportunity to have a big impact by building out Setpoint’s reliability function from... 
    Remote job
    Currently hiring
    Local area
    Flexible hours

    Setpoint

    Austin, TX
    more than 2 months ago
  •  ...solutions for an evolving grid. RSMC is a leader in the engineering, design, and manufacturing of critical componentry for...  ...long-standing partnership with utilities companies, the reliability of our products, and the technical and hands-on capabilities of our team.... 
    Local area

    Royal Switchgear Manufacturing

    Bessemer, AL
    4 days ago
  • $45 - $55 per hour

    Selectek is looking for a Fiber Network Engineer/Coordinator based in Tucker, GA. This in-office role involves key responsibilities in the design, planning, and coordination of a middle-mile fiber network that supports critical utility operations. The ideal candidate will... 
    Hourly pay
    Work at office
    Monday to Friday

    Selectek

    Tucker, GA
    4 days ago
  •  ...Job Description Job Description The Staff Design Engineer creates, analyzes and documents new product designs from the requirements provided. In addition, the Staff Design Engineer supports the development and sustainment of bill of materials, drawings, specifications... 
    Work experience placement
    Work at office

    Daikin

    Waller, TX
    6 days ago
  •  ...Job Description Job Description The Design Engineer designs, analyzes, and documents new or existing products from the requirements provided. Creates and sustains specifications using engineering analysis and judgment. Oversees the development and testing of engineering... 
    Work at office

    Daikin

    Waller, TX
    6 days ago
  • $88.93k - $113.73k

     ...compliance with regulatory requirements and ATS policies and procedures. Partners with internal/external customer for engineered solutions to improve reliability and throughput. Identifies opportunities for Capital Expenditures for equipment replacement with supervision (... 
    Full time
    Work at office

    Advanced Technology Services

    Findlay, OH
    10 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Product Reliability Engineer [Remote]. Be the first to apply!