Senior HPC Engineer
$175k - $250kMillennium Management Corp
Senior HPC Engineer
Millennium's Infrastructure organization designs, engineers, and operates a robust global computing platform supporting WorldQuant's quantitative research. We are seeking a Senior HPC Engineer to join our team in a senior, hands-on role building and evolving large-scale, high-throughput HPC and GPU platforms that underpin AI- and machine-learning-driven research. In this role, you will be part of a small, senior HPC team, taking end-to-end ownership of a significant area of the platform while collaborating closely with other subject-matter experts. You will be a systems-level engineer who is comfortable owning complex technical decisions and designing and building production infrastructure, rather than advising from the sidelines. We aim to build infrastructure that is reliable, understandable, and adaptable, and we value engineers who care about simplicity, clarity, and maintainability as much as raw performance. We recognize that strong candidates may bring different experiences, perspectives, and working styles. What you'll do- Design, build, and operate large-scale, high-throughput HPC and GPU clusters (for example, tens of thousands of CPU cores and hundreds of GPUs) supporting AI and machine-learning workloads.
- Collaborate with other HPC engineers and subject-matter experts to co-design system architectures, review designs, and share knowledge.
- Partner with storage specialists to architect and maintain high-performance, low-latency storage solutions, including parallel or scale-out file systems.
- Work closely with researchers, data scientists, and engineers to understand computational needs and translate them into effective, scalable system designs.
- Monitor, analyze, and optimize performance across compute, scheduling, networking, and storage layers.
- Build and maintain automation and infrastructure-as-code for provisioning, configuration, monitoring, and lifecycle management, with an emphasis on repeatability and simplicity.
- Participate in design reviews, operational discussions, and post-incident reviews with a focus on learning, collaboration, and system improvement rather than blame.
- Explore alternative approaches to scheduling, data layout, cluster architectures, and GPU utilization through small experiments or prototypes, using data to guide decisions.
- Produce clear documentation, diagrams, and reusable tooling that enable others to operate, debug, and extend the platform.
- Stay current with advancements in HPC, GPU computing, networking, and storage, and help assess where new technologies can add real value.
- Bachelor's degree in Computer Science, Engineering, or a related technical field; a Master's or PhD is a plus.
- Typically 7+ years of hands-on experience designing, building, and operating HPC or large-scale compute environments.
- Deep, practical experience with at least one major HPC scheduler (such as Slurm), including using it to operate large-scale or high-throughput clusters in production.
- Hands-on experience with GPU-accelerated computing, including NVIDIA GPUs and associated software ecosystems.
- Strong Linux systems engineering skills and comfort working close to the operating system, drivers, and hardware.
- Experience designing or operating high-performance storage systems, including parallel or scale-out file systems.
- Curious, evidence-driven problem solving, including experimenting with different approaches and using data to inform decisions.
- A collaborative working style that values listening, respectful discussion, and incorporating different perspectives - whether you are more quiet and reflective or more vocal in group settings.
- Clear written and verbal communication skills, and an ability to explain complex ideas in a way that works for different audiences.
- A strong sense of ownership for outcomes, paired with openness to feedback, learning, and evolving systems over time.
- Experience with Kubernetes, Run:ai, or other workload orchestration platforms alongside traditional HPC schedulers.
- Familiarity with Lustre, GPFS / Spectrum Scale, or similar high-performance storage technologies.
- Exposure to cloud-based HPC environments (e.g., GCP or other major cloud providers).
- Experience supporting quantitative research, finance, or other demanding compute-intensive workloads.
- Interest in applying AI or ML techniques to infrastructure (for example, optimization, anomaly detection, or predictive analysis).
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior HPC Engineer in New York, NY vacancy
$230k - $300k
...Senior Detection Engineer For It Fluidstack operates the compute infrastructure powering frontier AI. The work running on it is among the most... ...Background in detection or security engineering at GPU compute, HPC, or other hyperscale infrastructure. Salary & benefits...SeniorLocal area- ...Principal Systems Engineer (HPC, Python/Go) New York, NY (Hybrid, 3 days in office) Highly competitive compensation package Join... ...at its source. Solve Deep Technical Challenges: Serve as a senior escalation point for complex Linux systems issues, diagnosing...SuggestedWork at office
- ...behaves predictably during failures, maintenance, and scaling events Improving how storage integrates with compute environments (GPU/HPC, Kubernetes, data pipelines) Driving faster and more reliable incident detection, resolution, and prevention Improving capacity planning...SeniorImmediate startRemote work
- ...message the job poster from EIT Professionals Corp Role: HPC Observability Engineer (Python, HPC) Location: Remote Contract Description: The client... ...Cloud Platform (good to have) Knowledge of Git (must) Seniority level: Mid-Senior level Employment type: Contract Job function...SuggestedContract workRemote work
- Framework Ventures is seeking a SOC Analyst III to enhance security posture by analyzing alerts and leading incident responses. Ideal candidates will have 4–6 years in security operations and strong skills in security monitoring, incident investigation, and threat hunting...Senior
- ...A leading AI marketing platform is seeking a Senior Software Engineer to shape their next-generation event streaming platform. This role will focus on developing high-throughput solutions that enhance messaging and personalization capabilities. Ideal candidates should...Senior
- ...A leading aerospace firm is seeking a Senior Principal Systems Engineer to design and analyze sophisticated satellite systems. This role offers fully remote opportunities across the United States and requires extensive experience in the aerospace sector, including systems...SeniorRemote work
- ...switching to Arista switch migration. The customer requires the deployment to be managed through Arista Cloud (so no API deployment). The engineer will work in a lab environment during equipment staging. During implementation, the engineer must be at the installation site to...Senior
- ...A micromobility leader is seeking a Principal Systems Engineer to innovate urban transport. This remote role requires 10+ years of development experience, mentorship capabilities, and strong technical decision-making. You will architect foundational platforms and lead...SeniorRemote workWorldwide
- ...Senior Hypervisor Engineer Jersey City, NJ Contract We are seeking a highly skilled Senior Hypervisor Engineer with extensive experience in open- source development and hypervisor technology. The Ideal candidate will be responsible for designing, implementing...SeniorContract workWork experience placement
- ...Job Description: We are looking for a Senior Low Latency Engineer to join our core technology team in New York. The ideal candidate will have hands-on experience building, optimizing, and maintaining ultra-low latency systems for real-time trading. You will work closely...Senior
- ...Overview Discover exciting DevOps job opportunities and connect with 28,396 DevOps professionals. Responsibilities As a Senior Infrastructure Engineer at Remotive, you will play a pivotal role in enhancing our cloud infrastructure, ensuring seamless deployment and...SeniorRemote work
- ...Cyberark Defender Certified - Senior Engineer Location: New York, NY (Onsite) Duration: W2 / C2C Contract Experience: 8+ Years Job Description Technical security implementation Minimum Cyberark Defender certified Preferred CCD Cyberark certified - Good...SeniorContract work
- ...related to macOS hardware, software, and integration with enterprise tools. Provide mentorship and technical guidance to junior engineers and IT support teams. Monitor system performance, create documentation, and generate reports on compliance and inventory....Senior
- ...EPAM Systems, Inc. is looking for a Lead HPC Network Engineer to pioneer architecture and engineering standards for cutting-edge AI and GPU infrastructure. The role entails overseeing the architectural vision for performant network fabrics, mentoring engineers, and defining...Senior
- Responsibilities The optimisation team is responsible for building out the automated decision systems that help scale DSP revenues without significant human intervention. As a core member, your code and systems will directly impact revenue daily. Qualifications 2-5 years...Senior
- ...Job description Our core mission at Railway is to make software engineers higher leverage. We believe that people should be given powerful tools so that they can spend less time setting up to do, and more time doing. Many infrastructure platforms simply focus on how you...SeniorMonday to Friday
$150k
...About the job Senior API Engineer Senior API Engineer needed for this well-funded start-up. The company are a next-generation market data provider. Your role will be to work with Engineers and Product teams to develop web backend and APIs for our core products...SeniorWork experience placementRemote work- ...Title: Senior DataStage ETL Engineer About the Role: We are seeking an experienced Senior DataStage ETL Engineer to join a dynamic team within the banking industry. As a senior member, you will be responsible for developing and maintaining ETL jobs, designing...SeniorLong term contract
- ...Position: Senior Data Engineer Location: Jersey City, NJ (2-3 times per week) Responsibilities Work with developers and architects to advise on standards and best practices. Good understanding of Database architectures and data models....Senior
$140k - $150k
...others along the way, come join the Broadridge team. Broadridge is growing! We are seeking an enthusiastic Sr. FIX Onboarding Engineer to support the onboarding and certification of external clients onto our electronic trading platforms. This role requires a strong...SeniorLocal area$160k - $220k
...QRT's culture of innovation continuously drives our ambition to deliver high quality returns for our investors. Senior Detection and Response Engineer at Qube Research & Technologies (QRT) will be tasked with improving and optimizing our capability to effectively monitor...Senior$61.29 - $94.49 per hour
...Senior Arista Switch Engineer Location: New York, NY Onsite Flexibility: Onsite Contract Details Position Type: Contract Contract Duration: 6 months Pay Rate: $61.29–$94.49 / Hour (USD) Shift / Schedule: 9 AM – 5 PM EST, Monday–Friday Work Authorization...SeniorContract workRemote workWork visaMonday to FridayShift work$150k - $240k
...architectures purpose-built for GPU-centric compute. We are looking for an Engineering Manager, Datacenter Storage Engineering to lead the team... ...experience with Lustre or similar parallel filesystems used in HPC and AI environments. End-to-End Performance Ownership: Drive...Remote workFlexible hours- ...Senior ICA/PKI Engineer Location: Charlotte or Denver Teammates in this role deliver moderately complex tools and systems that mitigate the risk of malicious cyber attacks. Individuals in this role contribute to the protection of system boundaries, keeping...Senior
$180k - $200k
...in a collaborative, fast-moving environment where trust and impact matter, you'll feel at home here. Aircall is hiring a Senior GRC Engineer to build and operate the engineering backbone of our Governance, Risk & Compliance program. You'll join the Security Engineering...SeniorWorldwide- ...generate insights that are reshaping how patients are diagnosed, monitored, and treated. About the Role We are seeking a Senior Information Security Engineer to maintain and evolve critical security monitoring, auditing, and automation processes. In this role, you’ll...SeniorRemote workWork from homeWorldwide
- ...Playlab is looking for a Senior Learning Engineer to lead the AI Lab Schools cohort. This role involves anchoring relationships with school teams, building bespoke AI tools, and mentoring regional Learning Engineers. You will have a significant impact on how 13 schools...SeniorRemote work
- ...about building creative, equitable futures for students and teachers, we hope you'll join us. The Role Playlab seeks a Senior Learning Engineer to anchor our work across NYC and the Northeast, home to some of the largest school systems, districts, and CMOs. This...SeniorWork at officeRemote workFlexible hours
- ...Overview Keeper Security is hiring an experienced Senior Vulnerability Engineer to design, build, and scale enterprise vulnerability management capabilities across our cloud, application, and corporate environments. This is a 100% remote position, with an opportunity to...SeniorRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior HPC Engineer. Be the first to apply!
Related searches
- senior development executive New York, NY
- senior technical manager New York, NY
- senior medical writer New York, NY
- senior procurement specialist New York, NY
- senior software development engineer in test New York, NY
- senior communications specialist New York, NY
- senior manager data science New York, NY
- senior platform engineer New York, NY
- senior procurement New York, NY
- senior director product management New York, NY

