Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Test Engineer, Server Compute Firmware - AI Data Center 1

Celestica International LP

Req ID: 137753 
Region: Americas 
Country: USA 
State/Province: Texas 
City:  Austin 

General Overview

Functional Area: Engineering        
Career Stream: Design - Software Engineering
SAP Short Name: LEN-ENG-DSE
Job Level: Level 08
IC/MGR: Individual Contributor        
Direct/Indirect Indicator: Indirect

Summary

The Senior Lead Server Compute CPU & GPU Firmware Test Engineer will play a pivotal role in the design, development, and execution of comprehensive test strategies for our AI data center's server infrastructure. This leadership position requires deep expertise in server architectures, enterprise storage systems, networking, and a strong understanding of the unique performance and reliability demands of AI/ML workloads. The ideal candidate will be a hands-on technical leader, capable of mentoring junior engineers, driving test automation, and collaborating across engineering teams to deliver robust and high-performing solutions

Knowledge / Skills / Competencies

  • Define, develop, and implement comprehensive test plans and strategies for all storage and server hardware, firmware, and software components within the AI data center environment.

  • Lead the test team in designing, executing, and analyzing complex test cases, including functional, performance, reliability, stress, and endurance testing.

  • Mentor and provide technical guidance to junior test engineers, fostering a culture of technical excellence and continuous improvement.

  • Design and implement automated test frameworks and scripts using languages like Python, Go, or similar, to improve efficiency and coverage of testing.

  • Conduct in-depth performance analysis and bottleneck identification for server platforms (e.g., CPU, GPU, memory, PCIe, networking), OpenBMC interfaces/features and storage systems (e.g., NVMe, SSD, HDD arrays, distributed storage, SAN/NAS)

  • This includes debugging issues related to BMC functionality and its interaction with server hardware.

  • Develop and maintain robust testbeds and infrastructure for continuous integration and validation.

  • Utilize open-source and commercial test tools relevant to server, OpenBMC and storage validation.

  • Collaborate closely with hardware design, software development, infrastructure, and AI/ML engineering teams to understand requirements and integrate testing throughout the product lifecycle.

  • Communicate test progress, results, and critical issues effectively to stakeholders, including executive leadership.

  • Develop specialized test methodologies to validate performance and reliability under heavy AI/ML workloads (e.g., large model training, inference at scale, data ingestion).

  • Understand and test the interactions between GPU-accelerated computing, high-speed networking, and storage systems.

Qualifications

  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related technical field.

  • 5+ years of experience in hardware and/or software testing, with at least 5 years focused on enterprise-level storage and server systems.

  • 1+ years of experience in a lead or senior technical role, mentoring junior engineers or leading test initiatives.

  • Proven experience in a lead or senior technical role, mentoring and guiding other engineers.

  • Deep expertise in server architectures (x86, ARM, GPU servers), CPU/memory subsystems, PCIe, and power management.

  • Strong understanding of server architectures (x86, ARM, GPU servers), CPU/memory subsystems, PCIe, power management, and Baseband Management Controllers (BMC) functionality.

  • Strong understanding of storage technologies such as NVMe, SAS/SATA SSDs/HDDs, RAID, distributed file systems (e.g., Ceph, Lustre, GPFS), SAN, and NAS.

  • Proficiency in scripting languages (e.g., Python, Bash) for test automation and data analysis.

  • Experience with Linux operating systems (e.g., Ubuntu, CentOS, RHEL) and command-line tools.

  • Familiarity with networking concepts (Ethernet, TCP/IP, InfiniBand) and network testing methodologies.

  • Experience with test methodologies such as performance testing, reliability testing, stress testing, and fault injection.

  • Excellent problem-solving, analytical, and debugging skills.

  • Strong communication and interpersonal skills, with the ability to collaborate effectively across diverse teams.

Preferred Qualifications:

  • Familiarity with OCP (Open Compute Project)

  • Experience with cloud environments (AWS, Azure, GCP) and virtualization technologies.

  • Knowledge of containerization technologies (Docker, Kubernetes).

  • Familiarity with AI/ML frameworks (e.g., TensorFlow, PyTorch) and their infrastructure requirements.

  • Experience with performance profiling tools (e.g., fio, Iometer, Perf, VTune).

  • Contributions to open-source projects related to storage, servers, or testing.

  • Certifications in relevant technologies (e.g., NetApp, Dell EMC, HPE, NVIDIA).

Notes

This job description is not intended to be an exhaustive list of all duties and responsibilities of the position. Employees are held accountable for all duties of the job. Job duties and the % of time identified for any function are subject to change at any time.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Celestica's policy on equal employment opportunity prohibits discrimination based on race, color, creed, religion, national origin, gender, sexual orientation, gender identity, age, marital status, veteran or disability status, or other characteristics protected by law. 
This policy applies to hiring, promotion, discharge, pay, fringe benefits, job training, classification, referral and other aspects of employment and also states that retaliation against a person who files a charge of discrimination, participates in a discrimination proceeding, or otherwise opposes an unlawful employment practice will not be tolerated. All information will be kept confidential according to EEO guidelines.

COMPANY OVERVIEW:
Celestica (NYSE, TSX: CLS) enables the world's best brands. Through our recognized customer-centric approach, we partner with leading companies in Aerospace and Defense, Communications, Enterprise, HealthTech, Industrial, Capital Equipment and Energy to deliver solutions for their most complex challenges. As a leader in design, manufacturing, hardware platform and supply chain solutions, Celestica brings global expertise and insight at every stage of product development – from drawing board to full-scale production and after-market services for products from advanced medical devices, to highly engineered aviation systems, to next-generation hardware platform solutions for the Cloud. Headquartered in Toronto, with talented teams spanning 40+ locations in 13 countries across the Americas, Europe and Asia, we imagine, develop and deliver a better future with our customers.

Celestica would like to thank all applicants, however, only qualified applicants will be contacted.
Celestica does not accept unsolicited resumes from recruitment agencies or fee based recruitment services.

This location is a US ITAR facility and these positions will involve the release of export controlled goods either directly to employees or through the employee's movement within the facility. As such, Celestica will require necessary information from all applicants upon an applicant's acceptance of employment to determine if any export control exemptions or licenses must be filed.

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Lead Test Engineer, Server Compute Firmware - AI Data Center 1 in Austin, TX vacancy
  • Celestica Inc. is seeking a Staff Engineer, specializing in server compute firmware testing within the AI data center. The role involves developing and implementing test strategies for hardware and software components, ensuring high performance and reliability. Candidates... 
    Suggested

    Celestica Inc.

    Austin, TX
    17 hours ago
  • A leading technology firm located in Austin, Texas, is seeking an experienced Test Engineer to define and implement comprehensive test plans for storage and server systems. The ideal candidate will have over...  ...challenges in a dynamic AI data center environment. Additional... 
    Suggested

    Celestica Inc.

    Austin, TX
    3 days ago
  • $220.92k - $311.89k

    Intel’s AI SoC organization develops cutting...  ...edge devices to data center accelerators. If you are an engineer with strong technical...  ...Role Overview As a Lead Senior Design Engineer...  ...electrical engineering, computer engineering,...  ...Experienced Hire Shift Shift 1 (United States of... 
    Suggested
    Local area
    Shift work

    Intel Corporation

    Austin, TX
    2 days ago
  •  ...of the world’s leading innovators in Artificial...  ...Intelligence compute. It is...  ...next generation of AI breakthroughs and...  ...System Validation Engineer to design and implement...  ...validation tests for Arm-based data center SoCs using a...  ...across hardware, firmware, and software domains... 
    Suggested

    Cerebras

    Austin, TX
    2 days ago
  • A leading semiconductor company is seeking an Engineering Technician for their AI Data Center Systems in Austin, Texas. This on-site role involves...  ...of AI data center servers and GPU platforms. Candidates...  ...safety, and high-performance computing. #J-18808-Ljbffr Advanced... 
    Suggested

    Advanced Micro Devices , Inc.

    Austin, TX
    1 day ago
  •  ...that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded...  ...career. THE ROLE: We are seeking Lead Gen AI and ML Engineer to join our team that will...  ...CPU/GPU validation, BIOSQA, or firmware development Experience in system... 

    Advanced Micro Devices , Inc.

    Austin, TX
    17 hours ago
  • Graphcore is seeking a Senior Firmware Validation Engineer in Austin, Texas to ensure the quality of ARM-based server firmware. The role involves defining validation strategies...  ...should hold a degree in Electrical or Computer Engineering and have over 6 years of relevant... 

    jobr.pro

    Austin, TX
    4 days ago
  •  ...accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and...  ...Silicon debug of AMD EPYC Server & AMD Instinct products....  ...level failures working with engineering teams across AMD....  ...quality. The system debug lead will also help in driving... 

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...next‑generation computing experiences—from AI and data centers, to PCs, gaming...  ...the AMD EPYC Server team, we are committed...  ...for system engineering leaders who are...  ..., validation, firmware and software...  ...an EPYC Server Lead System Engineer...  ...resources, and oversee test plan execution... 

    Advanced Micro Devices

    Austin, TX
    4 days ago
  • A leading technology company in Austin, Texas, is seeking an Engineering Technician for AI Data Center Systems. This role supports the bring-up and sustainment of servers and GPU platforms, requiring a strong background in hardware, troubleshooting skills, and excellent... 

    Advanced Micro Devices, Inc.

    Austin, TX
    2 days ago
  •  ...accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and...  ...THE ROLE Cluster Thermal Engineer We are seeking a Cluster...  ...Help define and execute test plans to validate thermal...  ...power, networking, platform, firmware, and controls teams. Participate... 
    Internship

    AMD

    Austin, TX
    2 days ago
  •  ...accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and...  ...the design and validation testing on AMD ICs (GPU, CPU, DPU...  ...and testing per IEC 62368-1. Knowledge of Telcordia...  ...CREDENTIALS BS in Electrical Engineering or equivalent MS or a... 
    Worldwide

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...Devices seeks a skilled Platform SoC Debug Lead Engineer in Austin, TX. This role involves...  ...validation and debug efforts for server EPYC and AI platforms, requiring strong collaboration...  ...AMD and contribute to innovative computing solutions. #J-18808-Ljbffr Advanced... 

    Advanced Micro Devices , Inc.

    Austin, TX
    2 days ago
  •  ...next-generation computing experiences—from AI and data centers, to PCs, gaming...  ...Analysis (FA) Engineer, you will play...  ...GPU-accelerated server platforms deployed...  ...electrical, firmware, and platform‑level...  ...manufacturing test failures (POST,...  ...Lead issue triage and... 

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...the world’s leading innovators...  ...Intelligence compute. It is...  ...generation of AI breakthroughs...  ..., software engineers and systems...  ...a Senior Firmware Validation...  ...ARM-based server platforms....  ...semiconductors and data centre...  ...automated test frameworks...  ...server or data center systems.... 
    Flexible hours

    Jobr

    Austin, TX
    4 days ago
  • Senior Lead Engineer, Software 2 (Austin) Location...  ...across all levels (firmware to application),...  ...spanning Storage, Compute and Networking appliance...  ...projects for data center and enterprise applications...  ..., and analyze tests and test-...  ...utilizing SDK for AI Accelerators, Network... 
    Work at office

    Celestica Inc.

    Austin, TX
    4 days ago
  •  ...accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and...  ...seeking a visionary Lead/Principal Diagnostics Engineer to drive our shift...  ...frameworks and System‑Level Tests (SLT) directly into...  ...interfaces, firmware, and power management... 
    Shift work
    Early shift

    Advanced Micro Devices , Inc.

    Austin, TX
    2 days ago
  •  ...accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and...  ...your career. Server Platform SoC Validation lead and Debug Engineer Roles and...  ...design, validation, firmware and software engineering...  ...features to drive test execution and resolve... 
    Remote work

    Advanced Micro Devices

    Austin, TX
    3 days ago
  • Celestica Inc. is seeking a Staff Test Engineer to define and implement test strategies for storage and server systems within the AI data center environment. This role requires 10+ years...  ...workloads. The ideal candidate will take a lead role in developing automated testing... 

    Celestica

    Austin, TX
    2 days ago
  • $144.23 - $168.27 per hour

    Lead Mechanical Engineer, Data Center Design This is a full-time, on-site role located in Austin...  ...portfolio of hyperscale AI data center developments....  ...infrastructure, or high‑performance computing facilities Bachelor's...  ...(opt‑out available), 1% annual increases up to 10... 
    Full time
    Local area

    Flux Resources, LLC

    Austin, TX
    4 days ago
  • Plasticos Castella SA in Austin, TX, seeks a Lead Software Development Test Engineer to enhance our Intelligent Infrastructure division. You will design test systems, develop software, and ensure high-quality standards in manufacturing processes. The ideal candidate has... 
    Remote job

    Plasticos Castella SA

    Austin, TX
    17 hours ago
  •  ...Micro Devices in Austin, Texas is seeking a Server Systems Performance Architect responsible...  ...verification and analysis, leveraging AI/ML techniques to enhance efficiencies. The...  ...ideas contribute to shaping the future of computing. #J-18808-Ljbffr Advanced Micro Devices

    Advanced Micro Devices

    Austin, TX
    3 days ago
  •  ...Summary The Vertiv Server Liquid Systems team is seeking a Test Engineer to perform...  ...validation of our data center liquid cooling infrastructure...  ...and product leads to ensure hardware...  ...Bachelor's Degree in Computer Science, Computer...  ...such as E, F-1, H-1, H-2, L, B, J, or... 
    Temporary work

    Vertiv Co

    Austin, TX
    4 days ago
  • $125k - $185k

     ...landscape for the AI era with our groundbreaking...  ...023 and backed by leading investors...  ...for a Controls & Firmware Engineer to help design and...  ...contribute to building test benches, CI/CD...  ...application to map controls data into historians,...  ..., Controls, Computer Engineering, or... 
    Relocation
    Relocation package

    Exowatt

    Austin, TX
    3 days ago
  • $198.7k - $298.1k

     ...Technologies, Inc.Job Area:Engineering Group, Engineering...  ...next-generation computing platforms. By joining...  ...The work performed will lead to lower time to resolve issues and improve data center uptime by providing secure...  ...attestation, hardware/firmware provenance, isolation... 
    Work experience placement
    Work from home

    Nutanix

    Austin, TX
    17 hours ago
  • Advanced Micro Devices is seeking a Lead System Debug Engineer to drive system and silicon debug of AMD EPYC Server and AMD Instinct products. The role involves leading debug initiatives, managing technical communications, and ensuring high-quality resolutions. Candidates... 

    Advanced Micro Devices

    Austin, TX
    2 days ago
  •  ...on the planet—designed for data center, AI, and edge workloads, with real...  ...to work alongside engineers who built iconic processors...  ...K6 and the first 64‑bit ARM server processor (X‑Gene at AppliedMicro...  ...Strong domain knowledge of computer architecture. Skills and Qualifications... 

    Ventana Micro Systems

    Austin, TX
    1 day ago
  • Who We Are Inorsa is an AI‑first automation...  ...serve telecom, fiber, data centers, and more, automating everything...  ...AI platform in the engineering vertical. This role is...  ...prompts, and run regression tests against gold‑standard...  ...the Technical Standard Lead by example on AI‑native... 
    Flexible hours

    Inorsa

    Austin, TX
    3 days ago
  •  ...that power generative AI and ML workloads, focusing...  ...Develop and maintain server‑side software for AI/ML...  ...profiling, and automated testing. Participate in the...  ...work on high‑performance computing (HPC) workloads and...  ...teams. #J-18808-Ljbffr Gravity Engineering Services Pvt Ltd.

    Gravity Engineering Services Pvt Ltd.

    Austin, TX
    2 days ago
  •  ...Controller (SMC) software and firmware for Machine Learning Accelerator (MLA) servers. Continuously...  ...and laboratory‑based testing. A day in the life...  ...learning algorithms. Data paths, I2C, SPI, PCIe...  ...Bachelor's degree in Computer Science, Engineering, Mathematics, or a related... 
    Internship

    Annapurna Labs

    Austin, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Test Engineer, Server Compute Firmware - AI Data Center 1. Be the first to apply!