Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Staff Reliability Engineer - AI & Hyperscale Server NPI and Mfg.

$105k - $140k

ZT Group Intl, Inc. dba ZT Systems

About the role Our reliability team evaluates, develops, designs and implements software and product reliability test regimens to ensure ZT products of the highest quality are delivered to our customers. We are looking for a passionate Reliability Engineer with exceptional knowledge and experience in developing and manufacturing scalable infrastructures, working with technologies used for building hyperscale cloud services. What you’ll do Apply Design for Reliability principles to ensure cloud hardware developed and delivered to data centers meets specified use‑conditions and stresses, and meets its design intent. Act as an internal consultant on all reliability matters and interface with program management, vendors, and design engineering on key reliability programs/issues; support software/script development needs of the reliability team. Create or revise reliability engineering guidelines to improve product field performance through design enhancements that meet reliability goals. Use performance evaluation and prediction principles to improve reliability and maintainability of cloud infrastructure servers. Identify, collect, analyze, and manage various types of data to minimize failures and improve product performance. Develop scripts that represent expected environment and operational conditions. Collaborate with other development functional teams and internal stakeholders regarding the application of Design for Reliability principles to ensure products meet customer expectations. What you’ll bring Minimum B.S. in Electrical Engineering, Computer Science/Engineering, or Software Development and 5+ years of relevant work experience (or MS degree and 3+ years). Knowledge of computer systems/hardware structure, switch/network interfaces. Knowledge and/or experience with programming languages such as Python or Unix (Bash and/or PowerShell). Knowledge of statistical and probability techniques and reliability modeling. Ability to communicate, collaborate and lead cross‑functionally to resolve issues, including those with customers. Preferred qualifications Fundamental knowledge of computer architecture, server architecture at the block level, and hardware‑firmware‑OS interactions. Working knowledge of PCBA (printed circuit board assembly) design, fabrication, and validation testing. Experience using tools such as ReliaSoft and JMP statistical software packages. Working knowledge of electronic components/devices and their failure modes and mechanisms. Knowledge of industry standards, IPC, JEDEC, Telcordia, and MIL‑STD. Compensation and benefits The typical base salary for this position is expected to be between $105,000 and $140,000 per year. Final base salary will be determined on an individual basis taking into consideration experience, skills, knowledge, education and/or certifications. Base salary is just one component of ZT Systems total rewards philosophy. Other rewards may include bonus, paid time off, generous 401(k) match, tuition reimbursement, wellbeing resources, and more. Equal Opportunity Statement ZT Group Int’l. is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind. ZT Systems provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Certain positions may require U.S. citizenship or permanent residency status, as applicable. #J-18808-Ljbffr ZT Group Intl, Inc. dba ZT Systems

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Staff Reliability Engineer - AI & Hyperscale Server NPI and Mfg. in Secaucus, NJ vacancy
  • $124.5k - $166k

    About the Role Our reliability team is responsible to evaluate,...  ...looking for a passionate Sr. Staff Reliability Engineer with exceptional knowledge...  ...that go into building a hyperscale cloud services. What You'...  ...maintainability of Cloud Infrastructure servers. Identifies, collects,... 
    Suggested
    Permanent employment
    Work experience placement
    Work at office
    Local area

    Sanmina-SCI Systems de México

    Secaucus, NJ
    1 day ago
  • $105k - $140k

    A technology company in Secaucus, NJ is seeking a Reliability Engineer to evaluate and implement reliability test regimens for cloud services. The candidate should have a minimum of a B.S. in a relevant field and at least 5 years of experience in reliability engineering... 
    Suggested

    Sanmina-SCI Systems de México

    Secaucus, NJ
    4 days ago
  • $105k - $140k

    ZT Group Intl, Inc. dba ZT Systems is looking for a Reliability Engineer based in Secaucus, New Jersey. The role involves applying Design for Reliability principles to ensure that cloud hardware meets quality standards before delivery. The ideal candidate should possess... 
    Suggested
    Work experience placement

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $124.5k - $166k

    ZT Group Intl, Inc. dba ZT Systems is looking for a passionate Sr. Staff Reliability Engineer to evaluate and implement reliability testing for cloud services. The role requires a collaborative individual with strong experience in hardware design, data analysis, and programming... 
    Suggested

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $124.5k - $166k

    A leading technology company in Secaucus, NJ, is seeking a passionate Sr. Staff Reliability Engineer. The role involves ensuring cloud hardware meets reliability standards, collaborating with teams, and developing reliability engineering guidelines. With a Bachelor's degree... 
    Suggested

    Sanmina-SCI Systems de México

    Secaucus, NJ
    1 day ago
  • $116.25k - $193.75k

    About the Role The Senior Staff Electrical Engineer will provide technical leadership across cloud compute server systems and PCB boards, including add‑on cards, front panels, and IO backplane products. The engineer will make decisions on specifications, design, BOM, PCB... 
    Permanent employment
    For contractors
    Local area

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $200k - $250k

     ...Job Description Job Description Tabs is the leading AI-native revenue platform for modern finance and accounting...  ...finance and AI. About the Role We’re looking for a Staff Site Reliability Engineer to lead the evolution of Tabs’ platform as we scale. In this... 
    Full time
    Contract work
    Work at office

    TABS inc.

    New York, NY
    3 days ago
  •  ...Shape the future of trust in the age of AI At Oscilar, we're building the most advanced...  ...a experienced SRE to take ownership of reliability across our multi-region, cloud-native...  ...simulations to harden the platform. Mentor engineers and set best practices for SRE across... 
    Remote work

    Oscilar

    New York, NY
    5 days ago
  •  ...provides the infrastructure foundation for AI teams. With instant GPU access, sub-...  ...international Olympiad medalists, and experienced engineering and product leaders with decades of...  ...company, we seek to improve our reliability dramatically while scaling the size of our... 

    Modal

    New York, NY
    3 days ago
  • $240k - $300k

     ...time Location Type On-site Department Engineering & Product Engineering Compensation $2...  ...-by-side every step of the way. Our AI-native workspace empowers legal...  ...intelligent future of law? The role As a Staff Site Reliability Engineer you'll play a lead role on the... 
    Full time
    Work at office

    Menlo Ventures

    New York, NY
    3 days ago
  • $124.5k - $182.6k

     ...The Product Operations Engineering team serves as the technical...  .... As a Senior Staff Product Operations Engineer...  ...manufacturing readiness for hyperscale compute, storage, and high‑density AI GPU systems. Lead the technical...  ...technical inputs for NPI, manufacturing... 
    Permanent employment
    Work at office
    Local area

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $211.7k - $292k

     ...first - and that mission depends on reliable, secure, and scalable systems. As a Staff SRE on the infrastructure team,...  ...and building tools that empower our engineers to ship safely and confidently....  ...including artificial intelligence (AI), to assist with parts of our recruiting... 
    Local area
    Flexible hours

    Ro

    New York, NY
    22 days ago
  • $116.25k - $193.75k

    ZT Group Intl, Inc. dba ZT Systems is looking for a Senior Staff Electrical Engineer to provide technical leadership in the design of PCBs for cloud compute server systems. The successful candidate will lead cross-functional teams and be responsible for supervising junior... 

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $116.25k - $155k

    About The Role The NPI-focused Manufacturing Quality Engineer provides oversight and guidance in manufacturing operations. They coach manufacturing personnel on proper compliance and empower them to identify and upscale quality issues, to ensure quality products. Our Manufacturing... 
    Work at office
    Local area

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $116.25k - $155k

    About The RoleThe NPI-focused Manufacturing Quality Engineer provides oversight and guidance in manufacturing operations. They coach manufacturing personnel on proper compliance and empower them to identify and escalate quality issues, to ensure quality products. Our Manufacturing... 
    Permanent employment
    Work at office
    Local area

    ZT Systems

    Secaucus, NJ
    4 days ago
  •  ...computing experiences—from AI and data centers, to...  ...Data Center Platform Engineering Group (DPEG) designs,...  ...cloud-enabling server solutions that help the...  ...and execution of AI and hyperscale server platform programs...  ...platform validation, reliability, and firmware domains.... 

    Advanced Micro Devices , Inc.

    Secaucus, NJ
    3 days ago
  • $93k - $136.4k

    About the Role The Manufacturing Process Engineer is responsible for leading the measurement and analysis of manufacturing processes, developing processes and related tooling, implementing standard operating procedures, driving continuous improvements in the factory, implementing... 
    Permanent employment
    Work at office
    Local area
    Immediate start

    ZT Systems

    Secaucus, NJ
    3 days ago
  • $93k - $136.4k

    About the Role The Manufacturing Process Engineer leads the measurement and analysis of manufacturing processes, develops processes and related tooling, implements standard operating procedures, drives continuous improvements in the factory, introduces new technologies... 
    Work at office
    Local area

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • $200k - $350k

     ...they had to do. Powerful AI will be the biggest lever...  ...looking for a seasoned Senior / Staff Network Security Engineer to spearhead our security...  ...secure, resilient, and reliable at massive scale. Operating...  ...large-scale cloud or hyperscale environments and complex distributed... 
    Local area

    Fluidstack

    New York, NY
    3 days ago
  • $320k - $405k

     ...Staff Infrastructure Engineer, Cluster Infrastructure San Francisco, CA | New York City,...  ...Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe...  ...scale compute infrastructure at hyperscale (100+ clusters, 10K+ nodes) ~... 
    Work at office
    Visa sponsorship
    Flexible hours

    anthropic

    New York, NY
    4 days ago
  •  ...top remote mechanical engineer jobs and find flexible...  ...Associates focusing on hyperscale data centers and...  ...and mentoring junior staff while interfacing with...  ...facing experiences for an AI-native B2B SaaS platform...  ...meeting operational and reliability standards. Electrical... 
    Remote work
    Flexible hours

    Kickstart Remote

    New York, NY
    5 days ago
  • $194k - $267k

     ...Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building...  ...on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta

    New York, NY
    more than 2 months ago
  • $147k - $202k

     ...Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures...  ...Position Overview: We are seeking a highly technical Staff Observability Site Reliability Engineer with a specialty in Splunk to own and evolve our... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta

    New York, NY
    a month ago
  •  ...is looking for a Principal Electrical Architect to architect server and rack level solutions for customers in the cloud computing...  ...ideal candidate will possess extensive experience in system-level engineering, server architecture, and will be responsible for developing... 

    ZT Group Intl, Inc. dba ZT Systems

    Secaucus, NJ
    5 days ago
  • A leading technology company in Secaucus, NJ is seeking a Reliability Technician to support lab operations for advanced GPU technology....  ...teamwork and communication skills. Responsibilities include managing server upgrades, performing diagnostics, and executing reliability... 
    Monday to Friday
    Weekend work

    Advanced Micro Devices

    Secaucus, NJ
    3 days ago
  • $139k - $242k

     ...Senior Software Engineer, Server Fleet Infrastructure Livingston, NJ...  ...is The Essential Cloud for AI™. Built for pioneers by pioneers...  ...the company's delivery of reliable and efficient infrastructure...  ...solving complex problems at hyperscale, deploying purpose built AI... 
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    New York, NY
    3 days ago
  •  ...location ) 8-10+ years in DBA / Platform Engineering Strong multi-cloud experience (Azure /...  ...SaaS/DBaaS environments preferred Site Reliability Engineer (SRE) in DBaaS Support Role...  ...Secondary Database: MySQL, Oracle, MS SQL Server Database Backup & Recovery: Tools and... 
    Work at office

    Rackspace Technology

    New York, NY
    5 days ago
  • Ehvert Inc is looking for a skilled Electrical Engineer to join their team in New York. This remote role focuses on designing and coordinating complex electrical systems for hyperscale and AI data centers. The successful candidate will lead design activities, model systems... 
    Remote job

    Ehvert Inc

    New York, NY
    5 days ago
  • A technology infrastructure company in Secaucus, NJ, is seeking a Staff Thermal Design Engineer responsible for developing thermal components and optimizing hardware for data centers. The ideal candidate has a Bachelor's in Mechanical or Electrical Engineering, 8+ years... 

    Sanmina-SCI Systems de México

    Secaucus, NJ
    2 days ago
  • $208k - $282k

     ...Staff Data Engineer At Komodo Health, our mission is to reduce the global...  ...measurable through improved reliability, observability, and cost-efficiency...  ..., Rust, C++, and emerging AI-enabled engineering...  ...as ICD-10, CPT, NDC, RxNorm, NPI, or taxonomy data. Data Product... 
    Work experience placement
    Local area
    Flexible hours

    Komodo Health

    New York, NY
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Staff Reliability Engineer - AI & Hyperscale Server NPI and Mfg.. Be the first to apply!