Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. Lead Test Engineer, Compute Server & Storage - AI Data Center

Celestica International LP

Req ID: 127776 
Region: Americas 
Country: USA 
State/Province: Texas 
City:  Richardson 

General Overview

Functional Area: Engineering        
Career Stream: Design - Software Engineering
SAP Short Name: SLE-ENG-DSE
Job Level: Level 09
IC/MGR: Individual Contributor        
Direct/Indirect Indicator: Indirect

Summary

The Senior Lead server and storage Test Engineer will play a pivotal role in the design, development, and execution of comprehensive test strategies for our AI data center's server infrastructure. This position requires deep expertise in enterprise storage systems, server architectures, networking, and a strong understanding of the unique performance and reliability demands of AI/ML workloads. The ideal candidate will be a hands-on technical Individual Contributor capable of driving test automation, and collaborating across engineering teams to deliver robust and high-performing solutions.

Knowledge/Skills/Competencies

    • Define, develop, and implement comprehensive test plans and strategies for all server and storage hardware, firmware, and software components within the AI data center environment.

    • Mentor and provide technical guidance to junior test engineers, fostering a culture of technical excellence and continuous improvement.

    • Design and implement automated test frameworks and scripts using languages like Python, Go, or similar, to improve efficiency and coverage of testing.

    • Conduct in-depth performance analysis and bottleneck identification for storage systems (e.g., NVMe, SSD, HDD arrays, JBODs, distributed storage) and server platforms (e.g., CPU, GPU, BMC, BIOS, DIMM memory, PCIe, networking), and OpenBMC interfaces/features

    • This includes debugging issues related to BMC functionality and its interaction with server hardware.

    • Develop and maintain robust testbeds and infrastructure for continuous integration and validation.

    • Utilize open-source and commercial test tools relevant to storage, server, and OpenBMC validation.

    • Collaborate closely with hardware design, software development, infrastructure, and AI/ML engineering teams to understand requirements and integrate testing throughout the product lifecycle.

    • Communicate test progress, results, and critical issues effectively to stakeholders, including executive leadership.

    • Develop specialized test methodologies to validate performance and reliability under heavy AI/ML workloads (e.g., large model training, inference at scale, data ingestion).

    • Understand and test the interactions between GPU-accelerated computing, high-speed networking,PCIe Switches and storage systems.

Required Qualifications

  • Bachelor's or Master's degree in Computer Science, Electrical Engineering, or a related technical field.

  • 7+ years of experience in hardware and/or software testing, with at least 5 years focused on enterprise-level server and storage systems.

  • 3+ years of experience in a lead or senior technical role, mentoring junior engineers or leading test initiatives.

  • Proven experience in a lead or senior technical role, mentoring and guiding other engineers.

  • Deep expertise in various storage technologies including NVMe, SAS/SATA SSDs/HDDs, RAID, distributed file systems (e.g., Ceph, Lustre, GPFS), SAN, and NAS.

  • Strong understanding of server architectures (x86, ARM, AMD/NVIDIA GPU based servers), CPU/memory subsystems, PCIe, power management, and Baseband Management Controllers (BMC) functionality.

  • Proficiency in scripting languages (e.g., Python, Bash) for test automation and data analysis.

  • Experience with Linux operating systems (e.g., Ubuntu, CentOS, RHEL) and command-line tools.

  • Familiarity with networking concepts (Ethernet, TCP/IP, InfiniBand) and network testing methodologies.

  • Experience with test methodologies such as performance testing, reliability testing, stress testing, and fault injection.

  • Excellent problem-solving, analytical, and debugging skills.

  • Strong communication and interpersonal skills, with the ability to collaborate effectively across diverse teams.

Preferred Qualifications

  • Familiarity with OCP (Open Compute Project)

  • Experience with cloud environments (AWS, Azure, GCP) and virtualization technologies.

  • Knowledge of containerization technologies (Docker, Kubernetes).

  • Familiarity with AI/ML frameworks (e.g., TensorFlow, PyTorch) and their infrastructure requirements.

  • Experience with performance profiling tools (e.g., fio, Iometer, Perf, MLTT, VTune).

  • Contributions to open-source projects related to storage, servers, or testing.

  • Certifications in relevant technologies (e.g., NetApp, Dell EMC, HPE, NVIDIA).

Physical Demands

  • Duties of this position are performed in a normal office environment.
  • Duties may require extended periods of sitting and sustained visual concentration on a computer monitor or on numbers and other detailed data. 
  • Repetitive manual movements (e.g., data entry, using a computer mouse, using a calculator, etc.) are frequently required.
  • Occasional travel may be required.

Typical Experience

  • 6 to 12 years

Typical Education

Bachelor degree or consideration of an equivalent combination of education and experience.

Educational Requirements may vary by Geography

Notes

This job description is not intended to be an exhaustive list of all duties and responsibilities of the position. Employees are held accountable for all duties of the job. Job duties and the % of time identified for any function are subject to change at any time.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Celestica's policy on equal employment opportunity prohibits discrimination based on race, color, creed, religion, national origin, gender, sexual orientation, gender identity, age, marital status, veteran or disability status, or other characteristics protected by law. 
This policy applies to hiring, promotion, discharge, pay, fringe benefits, job training, classification, referral and other aspects of employment and also states that retaliation against a person who files a charge of discrimination, participates in a discrimination proceeding, or otherwise opposes an unlawful employment practice will not be tolerated. All information will be kept confidential according to EEO guidelines.

COMPANY OVERVIEW:
Celestica (NYSE, TSX: CLS) enables the world's best brands. Through our recognized customer-centric approach, we partner with leading companies in Aerospace and Defense, Communications, Enterprise, HealthTech, Industrial, Capital Equipment and Energy to deliver solutions for their most complex challenges. As a leader in design, manufacturing, hardware platform and supply chain solutions, Celestica brings global expertise and insight at every stage of product development – from drawing board to full-scale production and after-market services for products from advanced medical devices, to highly engineered aviation systems, to next-generation hardware platform solutions for the Cloud. Headquartered in Toronto, with talented teams spanning 40+ locations in 13 countries across the Americas, Europe and Asia, we imagine, develop and deliver a better future with our customers.

Celestica would like to thank all applicants, however, only qualified applicants will be contacted.
Celestica does not accept unsolicited resumes from recruitment agencies or fee based recruitment services.

This location is a US ITAR facility and these positions will involve the release of export controlled goods either directly to employees or through the employee's movement within the facility. As such, Celestica will require necessary information from all applicants upon an applicant's acceptance of employment to determine if any export control exemptions or licenses must be filed.

Vacancy posted 19 days ago
Similar jobs that could be interesting for youBased on the Sr. Lead Test Engineer, Compute Server & Storage - AI Data Center in Richardson, TX vacancy
  •  ...Sr Manager of AI Strategy & Customer Experience Public Storage is seeking an AI Business Manager...  ...requires a test-and-evolve...  ..., learns from data, and isn't afraid...  ...You'll Do Lead business-side...  ...Operations, Care Center, Marketing,...  ...side (not IT/engineering) ~ Experience... 
    Senior
    Visa sponsorship

    Public Storage

    Plano, TX
    4 days ago
  •  ...Role: Sr. Test Engineer (SDET) Location: Plano, TX (Onsite) Interview...  ...Skills: Bachelor's degree in Computer Science, Information Systems,...  ...highly valued. Exposure to AI-driven testing solutions or the...  ...desirable. Experience leading and managing QA teams and resources... 
    Senior

    Yantran LLC

    Plano, TX
    2 days ago
  •  ...innovation meets scale in the AI-driven data center era. With the recent...  .... The Quality Assurance Engineer is a core member of a crossfunctional...  ...and execution of automated tests using Python and pytest ,...  ...MSc or BSc in Computer Science, Telecommunications,... 
    Suggested
    Worldwide

    Nokia

    Plano, TX
    1 day ago
  •  ...Functional Area: Engineering         Career Stream...  ...concentration on a computer monitor or on numbers...  ...and other detailed data.  Repetitive manual...  ...enables critical data center infrastructure for AI, cloud, and hybrid...  ...customizable computing, storage and networking... 
    Suggested
    Work at office
    Local area
    Worldwide
    Shift work

    Celestica

    Richardson, TX
    7 hours ago
  •  ...Operations Engineering Support 1 Date...  ...act as a team lead. May demonstrate...  ..., and adjusts test sets and...  ...product technical data setup support...  ...knowledge of computer programming. Knowledge...  ...critical data center infrastructure for AI, cloud, and...  ...computing, storage and networking... 
    Suggested
    Local area
    Immediate start
    Worldwide
    Shift work

    Celestica

    Richardson, TX
    3 days ago
  •  ...SAP AI Lead Developer NTT DATA strives to hire exceptional, innovative...  ...deployment, integration, testing, and rollout. •...  ...or master's degree in computer science, Data Science, Engineering, or related field. •...  ...security, connectivity, data centers and application... 
    Work at office
    Remote work
    Flexible hours

    Sierra Systems, An Ntt Data Company

    Plano, TX
    1 day ago
  •  ...NTT DATA strives to hire exceptional, innovative...  ...seeking a SAP AI Lead Developer to join our...  ..., integration, testing, and rollout. • Create...  ...master’s degree in computer science, Data Science, Engineering, or related field....  ...connectivity, data centers and application... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    NTT America

    Plano, TX
    8 days ago
  •  ...Sr. Lead Software Engineer If you are looking for a game-changing...  ...Commercial & Investment Bank Compute Runtime Platforms,...  ...Leverage Generative AI and AI tooling to...  ...HPC/Grid Computing/Big Data technologies such as...  ...health and wellness centers, a retirement savings... 
    Senior

    Chase

    Plano, TX
    1 day ago
  •  ...products. As a Senior Lead Software Engineer at JPMorganChase, you...  ...development, testing, and operational stability...  ..., and deployment of AI systems Advanced...  .... Experience in Computer Science, Computer Engineering...  ...health and wellness centers, a retirement savings... 
    Senior
    For contractors

    JPMorgan Chase Bank, N.A.

    Plano, TX
    3 days ago
  •  ...Senior Lead AI Security Engineer As a Senior Lead AI Security Engineer in our...  ...cases, from problem framing and data integration to model...  ...adversarial prompt sets, jailbreak tests, and safety/quality KPIs....  ...on-site health and wellness centers, a retirement savings plan,... 
    Senior
    Work at office

    Chase

    Plano, TX
    4 days ago
  •  ...Overview Functional Area: Engineering         Career Stream:...  ...Summary The Lead Network and Security Compliance Test Engineer will be responsible...  ...security posture of our AI data center infrastructure. This role...  ...with OCP (Open Compute Project) networking principles... 
    Work experience placement

    Celestica

    Richardson, TX
    4 days ago
  •  ...Senior Lead Software Engineer Be an integral part of an agile team that's constantly...  ...modernization and integrate AI tools to effectively support...  ..., application development, testing, and operational stability...  ...experience Experience in Computer Science, Computer... 
    Senior
    For contractors

    Chase

    Plano, TX
    1 day ago
  •  ...Senior Lead Software Engineer Be an integral part of an agile...  ...application development, testing, and operational...  ...experience Experience in Computer Science, Computer...  ...workloads using AI frameworks such as Google...  ...health and wellness centers, a retirement savings... 
    Senior
    For contractors

    Chase

    Plano, TX
    3 days ago
  • $147.1k - $167.9k

     ...Principal Associate, Data Scientist - Frontier AI in Customer...  ...Protection Data is at the center of everything we do....  ...of a team that's leading the next wave of disruption...  ...using the latest in computing and machine learning...  ...us find fraud and engineer our way to using... 
    Full time
    Part time
    Local area

    Capital One

    Plano, TX
    3 days ago
  •  ...Lead Software Engineer We have an opportunity to impact your...  ...develops, and deploys AI/ML models and agent-based...  ...development, testing, and operational stability...  ...or Master's degree in Computer Science, Engineering,...  ...health and wellness centers, a retirement savings... 

    Chase

    Plano, TX
    1 day ago
  •  ...Senior AI/ML Enterprise Architect...  ...the world's leading commercial...  ...technology and data. Digital and...  ...is front and center of the CRE...  ...neural networks, computer vision, ML...  ...and PoCs to test new AI models...  ...and feature engineering on diverse datasets...  ...for data storage, compute, networking... 
    Senior

    CBRE Group

    Richardson, TX
    1 day ago
  •  ...innovating memory and storage solutions that...  ...memory solutions for AI. In Micron's AI-optimized...  ...design-for-test, design-for-yield implementation...  .... Extensive engineering experience with...  ...deeply technical, data‑driven analysis....  ...Masters or PhD in Computer Science, Computer Engineering... 
    Senior
    Local area
    Immediate start

    Micron Technology

    Richardson, TX
    4 days ago
  •  ...Req ID:  367016   NTT DATA strives to hire...  ...Salesforce Senior Technical Lead with Amazon Connect. to...  ...application, telephony, routing, AI, and DevOps layers) ~6+...  ...~ BS/MS degree in Computer Science, Information Technology...  ..., connectivity, data centers and application services... 
    Senior
    Work at office
    Remote work
    Flexible hours

    NTT America

    Plano, TX
    3 days ago
  •  ...Salesforce Senior Technical Lead With Amazon Connect NTT DATA Services is currently...  ...application, telephony, routing, AI, and DevOps layers) ~6+...  ...~ Degree BS/MS degree in Computer Science, Information...  ...security, connectivity, data centers and application services.... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Sierra Systems, An Ntt Data Company

    Plano, TX
    5 days ago
  • $123.5k - $206.75k

     ...As a Forward Deployed AI Engineering Lead specializing in Agentic AI enablement...  ...environments with real data. (Lead/Execute) Develop rapid...  ...MLflow, Arise); regression testing systems ML Operations:...  ...Stack, AWS Outposts); edge computing optimization; security-constrained... 

    PepsiCo

    Plano, TX
    1 day ago
  • A leading tech firm is seeking a talented Senior Staff Software Engineer to design and develop software for high-density Data Center Compute racks. This remote role requires expertise in GPU programming and LINUX driver development, with a focus on performance and efficiency... 
    Senior
    Remote work

    Confidential Company

    Richardson, TX
    4 days ago
  • $250.8k - $286.2k

    Sr. Lead Software Engineer, Full Stack Do you love building and pioneering in the technology...  ...passionate about marrying data with emerging technologies....  ...year experience with cloud computing (AWS, Microsoft Azure,...  ...Experience leveraging interactive AI tooling to accelerate... 
    Senior
    Full time
    Part time
    Internship
    Local area

    Capital One

    Plano, TX
    3 days ago
  • $123.5k - $206.75k

     ...Overview The AI Observability...  ...and engineering authority for...  ...The architect leads the convergence...  ...Responsible AI (RAI), data science,...  ..., and GPU/compute efficiency across...  ...Protocol) server interactions,...  ...standards, testing frameworks, and...  ...operations center components, incorporating... 
    Senior
    Shift work

    PepsiCo

    Plano, TX
    1 day ago
  •  ...protect and power our planet. Lead Product Test Engineer Qorvo is All Around...  ...RF solutions at the center of connectivity to enable high...  ...communications for 5G networks, cloud computing, the Internet of Things,...  ...test planning Analyze data & Generate product... 

    Qorvo

    Richardson, TX
    5 days ago
  •  ...opening our first self-storage facility in 1972,...  ...is the nation’s leading self-storage...  ...Senior Technical Lead, AI to serve as the...  ...manager. You will guide engineers, influence design...  ...standards, data privacy requirements...  ...’s or Master’s in Computer Science, AI/ML, Data... 
    Senior
    Work at office
    Remote work
    Visa sponsorship
    Flexible hours

    Public Storage

    Plano, TX
    23 days ago
  •  ...The Senior Firmware Engineer will lead the development of embedded...  ...platforms used in AI infrastructure, hyperscale data centers, telecom, and advanced computing applications. This...  ...Develop automated test and validation...  ...Experience with AI server power delivery systems... 
    Senior

    OmniOn Power

    Plano, TX
    3 days ago
  • $95k - $130k

     ...Details for your reference: Role Sr Test Embedded Engineer Location Plano Texas (Hybrid)...  ...root cause analysis of defects and lead initiatives to improve product quality...  ..., • Bachelor's degree or higher, in Computer Science, Engineering or related discipline... 
    Senior

    LanceSoft

    Plano, TX
    2 days ago
  •  ...Senior Lead Software Engineer Be an integral part of an agile team that's constantly...  ...Designs, develops, tests, and implements Equities Order...  ...practices Experience in Computer Science, Computer Engineering...  ...on-site health and wellness centers, a retirement savings plan,... 
    Senior
    For contractors

    Chase

    Plano, TX
    4 days ago
  •  ...At ChatBotz.ai, we are seeking a highly skilled and motivated Tech Lead to join our innovative team. As a Tech Lead -...  ...project goals. Design, develop, test, and deploy highly scalable...  ...: Bachelor's degree in Computer Science, Engineering, or a related field.... 

    forhyre.com

    Plano, TX
    9 hours ago
  •  ...Senior Lead Software Engineer Be an integral part of an agile team that's constantly...  ...to lead projects to test new technologies and methodologies...  ...experience Experience in Computer Science, Computer...  ...for small or large team Experience with AI/ML and NLP... 
    Senior
    For contractors

    Chase

    Plano, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. Lead Test Engineer, Compute Server & Storage - AI Data Center. Be the first to apply!