Product Reliability Engineer [Remote]
jobgether
- Remote job
This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Product Reliability Engineer based in United States.
This role sits at the critical intersection of software engineering, customer reliability, and production operations for infrastructure software deployed in complex, real-world environments. You will ensure that production systems running in customer-owned Kubernetes environments remain stable, observable, and continuously improvable. The work goes beyond incident response, focusing on eliminating entire categories of failures through better tooling, automation, and product design. You will partner closely with customers, engineers, and solution teams to investigate complex issues, drive root-cause analysis, and translate findings into long-term system improvements. This is a highly hands-on role where debugging, automation, and product thinking come together to define reliability as a core product capability. Your work will directly shape how enterprise customers experience stability, performance, and trust in the platform.
Accountabilities:
- Partner with customers and internal teams to investigate and resolve complex production issues across Kubernetes-based on-prem and hybrid deployments.
- Lead deep root-cause analysis for escalations, reproduce issues, and collaborate with engineering teams to implement durable fixes.
- Build and maintain reliability tooling such as diagnostics systems, health checks, support bundles, and environment validation utilities.
- Own and improve test automation frameworks, focusing on CI stability, reducing flaky tests, and strengthening integration and end-to-end coverage.
- Define and maintain performance baselines, regression testing frameworks, and reliability gates to prevent production regressions.
- Improve installation, upgrade, and deployment reliability by identifying recurring failure patterns and building preventive solutions.
- Develop production-grade internal tools and product enhancements using Python, Go, or Rust to strengthen observability and system resilience.
- Establish a closed feedback loop from customer issues to engineering improvements in testing, observability, documentation, and defaults.
Requirements:
- 4–7 years of experience in production engineering, SRE, platform engineering, or similar roles focused on reliability and distributed systems.
- Strong software engineering fundamentals, including debugging, testing, system design, and production-grade coding practices.
- Hands-on Kubernetes expertise, including troubleshooting workloads, networking, storage, RBAC, and multi-environment deployments.
- Strong experience with observability tools and techniques, including logs, metrics, and tracing for distributed system debugging.
- Proficiency in at least one programming language such as Python, Go, or Rust, with experience building internal tools or production systems.
- Strong analytical and communication skills, with the ability to break down complex incidents into clear root causes and actionable recommendations.
- Experience working in cross-functional environments with engineering, product, and customer-facing teams in fast-moving contexts.
- Self-directed and comfortable working in remote-first environments with shifting priorities driven by customer needs and escalations.
Benefits:
- Competitive compensation package aligned with experience and seniority
- Fully remote work environment across Canada and the United States
- Opportunity to work on real-world production infrastructure used in complex enterprise environments
- Strong technical ownership with high impact on product reliability and customer experience
- Collaboration with experienced engineers in infrastructure, automation, and platform engineering
- Learning and growth opportunities in Kubernetes, observability, and large-scale distributed systems
- Inclusive and diverse team culture focused on collaboration and continuous improvement
- Exposure to open-source-driven infrastructure innovation
How Jobgether works:
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
#LI-CL1
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
- ...Reliability Product Approval Engineer As part of the Thermo Fisher Scientific team, you'll discover meaningful work that makes a positive impact on a global scale. Join our colleagues in bringing our Mission to life every single day to enable our customers to make the...Suggested
- ...At AMD, our mission is to build great products that accelerate next-generation computing... .... THE ROLE: Join a global product reliability team that drives silicon and package qualifications... ...a highly visible role within the AMD engineering team and is responsible for defining...Suggested
- Sms Infocomm Corporation is seeking an Engineer to provide essential technical support in production and to conduct failure analysis on customer products. The role requires proficiency in multiple operating systems and analytical tools, fostering a positive work environment...Suggested
$110.5k - $152k
A leading materials engineering company is hiring a Product Quality & Reliability Engineer III in Santa Clara, CA. This full-time role focuses on developing and testing quality standards to meet customer expectations and conducting evaluations of product reliability. Candidates...SuggestedFull time$133.5k - $183.5k
Applied Materials, Inc. is seeking a Product Quality and Reliability Engineer IV in Santa Clara, CA. The role involves leading quality engineering projects and conducting reliability analyses in a cross-functional environment. The ideal candidate has a Bachelor's degree...SuggestedFull time- Applied Materials, Inc. is seeking a Product Quality and Reliability Engineer in Santa Clara, California. This highly technical role involves leading quality engineering projects and ensuring Design For Reliability (DfR) methods are integrated into new product developments...
- ...company in Richardson, Texas, is seeking a High Performance Analog Product Quality Engineer. The role involves supporting product development, enhancing product quality, and ensuring compliance with reliability standards. Successful candidates will have a BS in Engineering...
$125.8k - $170.2k
Senior Product Quality Engineer Mountain View, CA Apply for this job About us: Aeva is building the next generation of sensing and perception for autonomous vehicles and beyond. With its unique ability to measure instantaneous velocity for each pixel, long-range performance...Contract workWork experience placementFlexible hours$133.5k - $183.5k
## Product Quality and Reliability Engineer IVApplylocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2618641**Who We Are**Applied Materials is a global leader in materials engineering solutions used to produce virtually every new...Full timeWork experience placementWorldwideRelocation- ...At AMD, our mission is to build great products that accelerate next‑generation computing... ...visible role drives product Quality & Reliability related data analysis presenting to higher... ...quality, reliability, or product engineering, preferably in semiconductor industry....
$110.5k - $152k
Product Quality & Reliability Engineer III - (E3) page is loaded## Product Quality & Reliability Engineer III - (E3)locations: Santa Clara,CAtime type: Full timeposted on: Posted Yesterdayjob requisition id: R2616799**Who We Are**Applied Materials is a global leader in...Full timeRelocation- Applied Materials, Inc. is looking for a Product Quality & Reliability Engineer III located in Santa Clara, CA. In this role, you will lead quality and reliability activities throughout the product lifecycle, ensuring products meet quality and reliability standards. You...
$110.5k - $152k
Product Quality & Reliability Engineer III page is loaded## Product Quality & Reliability Engineer IIIlocations: Santa Clara,CAtime type: Full timeposted on: Posted Todayjob requisition id: R2620374**Who We Are**Applied Materials is a global leader in materials engineering...Full timeRelocation- Hyster-Yale Materials Handling, Inc. in Greenville, NC is hiring a Product Support Solutions Engineer I-II. This role involves analyzing product performance, resolving field issues, and contributing to product development. Candidates should have a Bachelor's in Engineering...
- Alumni Ventures is seeking a Product Reliability Engineer to enhance the quality and reliability of products and processes. As part of our dynamic team, you will optimize production methods, lead quality control initiatives, and engage with cross-functional teams to align...
$124.4k - $138k
...Basic Qualifications Bachelor's degree in Systems Engineering, or a related Science, Engineering or Mathematics field, plus a minimum... ...Bachelor's degree plus 8 years of experience in product ownership, product management, or business analysis in a technology...Remote workFlexible hours- Autodesk, Inc. is looking for a leader to transform customer experience into systemic product and service improvements. This role involves advocating for AECO customers, influencing product prioritization, and establishing feedback loops to drive measurable outcomes. Candidates...
$136k - $218.5k
...of design, architecture, marketing, and productization—owning the journey from the... ...Embedded markets. As a Silicon Speed Features Engineer, you will co‑design system‑level speed... ..., hardware, firmware/software, process/reliability, and operations teams to co‑design system...$106k - $170.2k
...health for humanity. Learn more at jnj.com. Position Overview Design for Reliability (DfR) Engineer - Robotics & Digital R&D Team, Santa Clara, CA. Join our team to embed reliability thinking into product design, develop specifications for surgical robotic systems, and...Temporary workLocal area$140k - $180k
Senior Design Reliability Engineer Department: Propulsion Employment Type: Full Time Location: Redondo Beach Reporting To: Kevin Miller, SVP... ..., margining, configuration, and release processe Ensure product integrity by holding the line on quality standards, extreme-...Permanent employmentFull timeFlexible hours$146.9k - $183.6k
...outdoors and a desire to protect it for future generations. Role Summary Reliability is at the core of Electric Adventure Vehicles. We are looking for engineers who have experience in bringing products from the early concept stages to production by working closely with...Full timeContract workTemporary workPart timeLocal areaShift work$140k - $180k
Impulsespace is looking for a Senior Design Reliability Engineer in Redondo Beach. You will standardize and implement design processes, lead cross-functional teams, and ensure product integrity through rigorous quality standards. The ideal candidate holds a Bachelor’s degree...$192k - $264k
Who We Are Applied Materials is a global leader in materials engineering solutions used to produce virtually every new chip and advanced display in the world. We design, build and service cutting-edge equipment that helps our customers manufacture display and semiconductor...Full timeRelocation$175k - $200k
...fundamentally different class of spacecraft. Engineered to survive the harshest radiation... ...requirements and deliver exceptionally reliable flight systems. You will own and continuously... ...gatekeeper during development. As production begins, you will guide a disciplined...Permanent employmentShift work- ...credit funds and capital markets counterparties. We’re building trust in this system of credit. We are looking for a Product Reliability Engineer to join our team! This is an opportunity to have a big impact by building out Setpoint’s reliability function from...Remote jobCurrently hiringLocal areaFlexible hours
- ...solutions for an evolving grid. RSMC is a leader in the engineering, design, and manufacturing of critical componentry for... ...long-standing partnership with utilities companies, the reliability of our products, and the technical and hands-on capabilities of our team....Local area
$45 - $55 per hour
Selectek is looking for a Fiber Network Engineer/Coordinator based in Tucker, GA. This in-office role involves key responsibilities in the design, planning, and coordination of a middle-mile fiber network that supports critical utility operations. The ideal candidate will...Hourly payWork at officeMonday to Friday- ...Job Description Job Description The Staff Design Engineer creates, analyzes and documents new product designs from the requirements provided. In addition, the Staff Design Engineer supports the development and sustainment of bill of materials, drawings, specifications...Work experience placementWork at office
- ...Job Description Job Description The Design Engineer designs, analyzes, and documents new or existing products from the requirements provided. Creates and sustains specifications using engineering analysis and judgment. Oversees the development and testing of engineering...Work at office
$88.93k - $113.73k
...compliance with regulatory requirements and ATS policies and procedures. Partners with internal/external customer for engineered solutions to improve reliability and throughput. Identifies opportunities for Capital Expenditures for equipment replacement with supervision (...Full timeWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Product Reliability Engineer [Remote]. Be the first to apply!
- data center design engineer United States
- composite design engineer United States
- senior design verification engineer United States
- embedded design engineer United States
- product engineering technician United States
- product compliance engineer United States
- director of product engineering United States
- ic design engineer United States
- rf microwave design engineer United States
- design quality engineer United States



