Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

HPC Performance and Validation Engineer

NorthMark Strategies LLC

The Company NorthMark Compute & Cloud (NMC²) operates at the bleeding edge of technology, aiming to scale and enhance high-performance computing (HPC) and cloud infrastructure that supports clients’ research, production, and delivery. Engineers build critical infrastructure to eliminate friction in scientific research, simulations, analysis, and decision-making, accelerating discovery and driving faster innovation. Responsibilities Architect and implement a validation framework to certify the readiness and utilization of GPU nodes across a large, distributed HPC environment. Define methodologies to continually assess performance and optimize infrastructure across AI/ML workloads. Develop and execute comprehensive performance testing using industry and customer‑specific benchmarks, ensuring optimal performance across HPC compute, storage, and networking. Contribute to research reports describing benchmarking discoveries, evaluating complete hardware performance and efficiency. Lead debugging, identify, and resolve bottlenecks in system performance. Build robust, scalable tools for automated validation and testing, utilizing Python, Go, Kubernetes, and CI/CD pipelines to streamline continuous validation and benchmarking processes. Implement monitoring solutions using Prometheus, Grafana, and other modern monitoring technologies to track performance metrics and real‑time health of the cluster. Define and implement best practices for continuous performance validation, ensuring that the infrastructure remains reliable and efficient as new technologies emerge. Stay informed on industry trends and advancements to ensure long‑term strategic alignment. Work cross‑functionally with engineering, infrastructure, and research teams to align validation efforts with broader business objectives, ensuring the platform meets evolving research demands. Requirements Accelerator performance experience, including profiling and tuning with large‑scale GPU clusters. In‑depth understanding of NVIDIA ClusterKit, Nsight, and Validation Suite, MLPerf and DCGM tools for GPU and DPUs. Networking & storage performance experience, including profiling and optimisation with NVIDIA ClusterKit, iPerf or equivalent across InfiniBand/RoCe network implementations. System benchmarking experience across Linux and familiarity with the Phronix suite or equivalent. Experience with HPC workloads across distributed global locations, providing data‑driven performance data to complement key architectural decisions. Strong proficiency in developing automation tools and micro‑benchmarking frameworks for validation using Python, Go, and Kubernetes in an Ubuntu Linux environment. Expertise with key monitoring platforms including OTEL, Prometheus, ELK, and Grafana, and in defining and implementing the overall observability strategy for HPC validation and performance monitoring. Deep understanding of emerging technologies, architectures, and strategies, with the ability to assess their potential impact on infrastructure and adopt them as part of a long‑term plan. Proven ability to lead complex technical projects, influence decisions, and engage with stakeholders across technical and research teams. NorthMark Compute & Cloud (NMC²) is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability, or veteran status. #J-18808-Ljbffr NorthMark Strategies LLC

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the HPC Performance and Validation Engineer in Dallas, TX vacancy
  • Texas Instruments in Dallas, Texas, is seeking a Validation Engineer to assure the quality and performance of innovative IC products. This role involves supporting strategies that achieve profitability targets through new product development and resolving customer quality... 
    Performance

    Texas Instruments

    Dallas, TX
    5 days ago
  • Texas Instruments in Dallas, Texas is seeking a motivated Validation Engineer to work on state-of-the-art technology characterization in the...  ...automation software, and ensuring that devices meet strict performance specifications. Candidates should have a degree in... 
    Performance

    Texas Instruments

    Dallas, TX
    3 days ago
  • Change the world. Love your job. Validation engineers are said to have a ';break-the-part'; mentality. This means they spend some of their...  ...reasons. Plus, we want to ensure that our parts are consistently performing to the specifications that we set in our data sheet, and... 
    Performance

    Texas Instruments

    Dallas, TX
    3 days ago
  •  ...innovative technologies are developed, built and optimized as a validation engineer. Work with TI teams, customers and external partners around...  ..., you play a vital role in assuring the quality and performance of TI's innovative IC products. In this role, you'll support... 
    Performance

    Texas Instruments

    Dallas, TX
    4 days ago
  •  ...the world. Love your job. The Product/Test Engineers at Texas Instruments are powered by a passion...  ...for‑test features and Applications team on validation, while also tackling challenging problems in achieving analog performance testing on ATE multi‑site hardware. This... 
    Performance

    Texas Instruments

    Dallas, TX
    3 days ago
  • $110k - $170k

     ...mission test scenarios that mimic real-world conditions to validate the performance of autonomy and flight control software. Participate in...  ...fast‑paced environment. B.S., M.S., PhD degree in Systems Engineering, Software Engineering, Computer Science or a related field... 
    Performance
    Full time
    Temporary work
    Part time
    Work experience placement

    Shield AI

    Dallas, TX
    3 days ago
  • Radiant, a leader in GPU-as-a-Service, is seeking a Datacentre/Hardware Engineer to manage their HPC infrastructure in Dallas Fort Worth. The successful candidate will ensure optimal performance and reliability of complex compute systems. This role includes troubleshooting... 
    Performance

    Radiant

    Dallas, TX
    2 days ago
  •  ...At Capgemini Engineering, the world leader in engineering services, we bring together a global...  ...We are hiring Verification and Validation Engineer (Electro- Mechanical production...  ...software teams to validate integrated system performance, identify defects, and contribute to... 
    Performance
    Full time
    Local area

    Capgemini

    Dallas, TX
    10 hours ago
  •  ...seeking a highly motivated Mixed-Signal Verification Engineer III to support verification of analog and mixed-...  ...SoC simulation while maintaining key analog performance characteristics. Define modeling assumptions, validation approaches, and abstraction strategies in... 
    Performance

    Chelsea Search Group

    Dallas, TX
    2 days ago
  •  ...A high-performance computing firm based in Texas is seeking an HPC Storage Solutions Architect to design and optimize storage architectures for HPC and AI workloads...  ...workloads and involves collaboration with engineering and product teams. Strong expertise in parallel/distributed... 
    Performance

    NMC2

    Dallas, TX
    1 day ago
  • As a Computer System Validation (CSV) Specialist based in Dallas, TX, you will be responsible...  ...), Operational Qualification (OQ), and Performance Qualification (PQ) protocols. Lead...  ...Bachelor’s degree in a scientific or engineering discipline (or equivalent experience).... 
    Performance

    Ushitecsolutions

    Dallas, TX
    5 days ago
  •  ...operational readiness and unparalleled performance at the highest standard through our rigorous...  ...Commissioning, Qualification, and Validation (CQV) activities with a strong focus on...  ...Experience - Bachelor’s degree (BS/BA) in Engineering, Chemistry, or Life Sciences (relevant... 
    Performance
    For contractors
    Worldwide

    CAI

    Dallas, TX
    8 days ago
  • NMC2 is seeking a Manager of HPC Solutions Architecture in Dallas, Texas. The role requires...  ...a team of architects and deliver high-performance computing solutions tailored to client...  ...also involves collaboration with engineering teams to create innovative architectures... 
    Performance

    NMC2

    Dallas, TX
    2 days ago
  • A leading technology firm is seeking a Manager of HPC Solutions Architecture in Dallas, Texas. In this role, you will lead a team of Solutions Architects to design high-performance computing solutions. Key responsibilities include managing customer relationships, guiding... 
    Performance

    NMC2

    Dallas, TX
    2 days ago
  • Career Techniques is seeking an HPC Network Engineer in Dallas, Texas, to lead the design and deployment of high-performance compute networks. The ideal candidate will have demonstrable experience in designing scalable Ethernet and InfiniBand networks, and a passion for... 
    Performance

    Career Techniques Inc

    Dallas, TX
    1 day ago
  • A leading technology firm in Dallas is seeking an HPC Kubernetes Solutions Architect to guide customers in adopting GPU-accelerated...  ...architectural blueprints and integration strategies for high-performance computing (HPC) workloads, engaging with clients on their performance... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    4 days ago
  • NorthMark Strategies LLC in Dallas, Texas, is seeking an engineer to architect a validation framework for HPC environments. Responsibilities include performance testing, optimization of infrastructure across AI/ML workloads, and developing automation tools. The ideal candidate... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    5 days ago
  • A cutting-edge technology firm in Dallas seeks an HPC Network Solutions Architect to design high-performance networking architectures. The role involves acting as a trusted advisor, collaborating closely with clients, and working on scalable HPC environments. Ideal candidates... 
    Performance

    NMC2

    Dallas, TX
    2 days ago
  • NorthMark Compute and Cloud LLC, located in Dallas, Texas, is seeking a Manager of HPC Solutions Architecture to oversee a team that delivers high-performance computing solutions. This role involves engaging with customers, guiding the architectural design process, and... 
    Performance

    NorthMark Compute and Cloud LLC

    Dallas, TX
    5 days ago
  • A leading investment firm is seeking an HPC Storage Solutions Architect to design and optimize high-performance storage for HPC and AI/ML workloads. This customer-facing...  ...role involves collaborating with clients and engineering teams to implement scalable storage solutions,... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    4 days ago
  •  ...leading technology firm in Dallas is seeking an experienced Engineering Manager, HPC Kubernetes Platform. This hands-on leadership role focuses...  ...the bare-metal Kubernetes environment, optimizing performance and reliability for GPU- and CPU-intensive workloads. The... 
    Performance

    NMC2

    Dallas, TX
    1 day ago
  •  ...this role, you will lead a team of architects in designing high-performance computing solutions tailored to customer needs. You will guide...  ...and platform scalability. Ideal candidates will have significant HPC experience, strong leadership skills, and the ability to... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    5 days ago
  • A leading investment firm is seeking a Category Manager for HPC Infrastructure to oversee global sourcing and procurement for high-performance computing and AI data center infrastructure. The ideal candidate will manage an annual spend of $500M to $1B and lead strategic... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    4 days ago
  •  ...leading investment firm is looking for an Emerging Network Architect to design and integrate networking technologies for high-performance computing (HPC). This role requires deep expertise in network architectures and hands-on experience with high-speed fabric solutions.... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    4 days ago
  • A leading technology investment firm is seeking an HPC Network Solutions Architect in Dallas, TX. The role involves designing and optimizing high-performance networking architectures for HPC and AI/ML workloads. Responsibilities include collaborating with customers to... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    5 days ago
  • NorthMark Compute and Cloud LLC in Texas is looking for an HPC Network Solutions Architect to design and optimize high-performance networking architectures. This role collaborates with customers, guiding them through the lifecycle of adopting cutting-edge network solutions... 
    Performance

    NorthMark Compute and Cloud LLC

    Dallas, TX
    5 days ago
  •  ...a highly motivated individual to join our HPC Scheduling team. The position focuses on developing and managing a high-performance compute (HPC) platform and emphasizes Kubernetes...  ...should have experience in software engineering, batch workloads, and high-performance computing... 
    Performance

    NorthMark Compute and Cloud LLC

    Dallas, TX
    5 days ago
  • A high-performance computing firm based in Texas is seeking an Emerging Network Architect to evaluate and integrate next-generation networking technologies for HPC platforms. The ideal candidate will focus on enhancing performance metrics and collaborating with vendors... 
    Performance

    NMC2

    Dallas, TX
    2 days ago
  •  ...technology firm in Dallas is seeking a motivated Software Engineer 1 to join its HPC Scheduling team. You will design and develop software...  ...experience in software engineering, and knowledge of high-performance computing. The ideal candidate will thrive in optimizing... 
    Performance

    NorthMark Strategies LLC

    Dallas, TX
    5 days ago
  •  ...seeking a motivated individual to join their HPC Scheduling team. This role involves designing...  ...ideal candidate should possess strong software engineering skills, experience with Kubernetes, and a passion for high-performance computing. Join an innovative team dedicated... 
    Performance

    NMC2

    Dallas, TX
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to HPC Performance and Validation Engineer. Be the first to apply!