Senior GPU Supercomputer Scheduler Engineer
$152k - $241.5kNVIDIA Gruppe
NVIDIA is a pioneer in accelerated computing, known for inventing the GPU and driving breakthroughs in gaming, computer graphics, high-performance computing, and artificial intelligence. Our technology powers everything from generative AI to autonomous systems, and we continue to shape the future of computing through innovation and collaboration. Within this mission, our team, Managed AI Research Superclusters (MARS), builds and scales the infrastructure, platforms, and tools that enable researchers and engineers to develop the next generation of AI/ML systems. By joining us, you’ll help design solutions that power some of the world’s most advanced computing workloads. As a member of the Scheduling team, you will participate in the design and implementation of groundbreaking GPU compute clusters that run demanding deep learning, high performance computing, and computationally intensive workloads. We seek engineers with deep technical expertise to identify architectural directions and new approaches for AI workload scheduling to serve many simultaneous and large multi-node GPU workloads with complex requirements and dependencies. This role offers you an excellent opportunity to deliver production grade solutions, get hands on with ground-breaking technology, and work closely with technical leaders solving some of the biggest challenges in machine learning, cloud computing, and system co-design. What you'll be doing: Design and develop new scheduling features and add-on services to improve GPU compute clusters across many dimensions, such as resource usage fairness, GPU occupancy, GPU waste, application resilience, application performance and power usage. Design and develop batch workload management and orchestration services Provide support to staff and end users to resolve batch scheduler issues Build and improve our ecosystem around GPU-accelerated computing Performance analysis and optimizations of deep learning workflows Develop large scale automation solutions Root cause analysis and suggest corrective action for problems large and small scales Finding and fixing problems before they occur What we need to see: Bachelor's degree in Computer Science, Electrical Engineering or related field or equivalent experience 5+ years of work experience Strong understanding of batch scheduling, preferably with experience in schedulers such as SLURM or K8s batch schedulers (Kueue, Volcano, etc.) Significant experience in systems programming languages such as C/C++ & Go as well as scripting languages such as Python and bash Established experience in Linux operating system, environment and tools Experience analyzing and tuning performance for a variety of AI workloads In-depth understating of container technologies like Docker, Singularity, Podman Flexibility/adaptability for working in a dynamic environment with different frameworks and requirements Excellent communication, interpersonal and customer collaboration skills Ways to stand out from the crowd: Knowledge in High-performance computing Open Source Software Contribution Experience with deep learning frameworks like PyTorch and TensorFlow Passionate about SW development processes Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 24, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe
$152k - $241.5k
NVIDIA Gruppe is seeking an experienced engineer to join the Scheduling team to design and enhance GPU compute clusters for AI/ML workloads. Candidates should have a Bachelor's degree in Computer Science and 5+ years of relevant experience in system programming and batch...Senior$272k - $431.25k
...software professional to work on the CUDA Driver, an essential part of our platform for accelerating general purpose computation on the GPU. This role involves delivering features to enhance NVIDIA hardware for various computational workloads including AI, data science,...Senior- NVIDIA Gruppe is seeking an experienced Compiler Engineer in Santa Clara to design and optimize compiler passes and infrastructure for GPU kernels. You'll work with a dynamic team and be involved in architecture decisions while collaborating across various teams. The ideal...Senior
$184k - $287.5k
NVIDIA is seeking a Senior Systems Software Engineer focusing on GPU Performance at Scale. This role involves driving innovation in AI and GPU computing, collaborating with developers and researchers to enhance system workflows. Key duties include leading performance practices...Senior$152k - $287.5k
NVIDIA Corporation is seeking a motivated Performance Engineer to enhance the roadmap of communication libraries. In this role, you will conduct in-depth performance characterization on multi-GPU clusters and analyze the interaction of libraries with hardware and software...Senior- NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer focused on LLM performance in Santa Clara. You will optimize GPU-accelerated software for large language model deployment, working on performance tuning for various models. The ideal candidate has over 8...Senior
- Intel Corporation is seeking a Senior Compiler Engineer to develop and optimize compiler software for next-generation GPU architectures. The role involves collaborating on cutting-edge compiler technologies that enhance AI and high-performance computing performance. The...Senior
$207k - $300k
Google is seeking an experienced AI/ML Software Engineer to enhance GPU architectures and optimize performance benchmarks. The role involves collaborating with teams to solve ML model challenges and architect transformative AI solutions, contributing to Google's machine...Senior- NVIDIA Corporation, located in Santa Clara, CA, is seeking a Senior Systems Software Engineer focused on GPU Performance at Scale. This role entails leading performance practices in large-scale GPU infrastructure and aligning AI workloads with next-generation datacenter...Senior
$184k - $356.5k
NVIDIA Gruppe is seeking a Senior System Software Engineer in Santa Clara, California, focused on developing GPU system software components. The ideal candidate will have over 8 years of experience in software development, particularly in low-level firmware and operating...Senior$136k - $258.75k
NVIDIA is seeking a creative and self-driven Product Engineer in Santa Clara, California. This role focuses on transferring GPU products from design to mass production, requiring collaboration with multiple teams for product optimization. Ideal candidates have over 5 years...Senior- NVIDIA Gruppe seeks a Senior Developer Technology Engineer in Santa Clara to drive GPU-acceleration innovations for high-performance databases. This role involves researching memory management techniques and optimizing data-intensive applications for modern architectures...Senior
- NVIDIA Gruppe is seeking a Senior Formal Verification Engineer for GPU Kernels, focused on creating verification tools that ensure correct behavior in various environments. This role involves designing verification tools, integrating AI into workflows, and participating...Senior
- NVIDIA Corporation is seeking a Senior Memory Controller Verification Engineer to join their GPU Memory Subsystem IP verification team in Santa Clara, CA. The role involves developing verification infrastructure, ensuring code coverage, and collaborating with design and...Senior
- NVIDIA Gruppe in Santa Clara is seeking a Senior Validation Engineer for the DGX Server Product Engineering Team. In this role, you will work closely... .../SW engineers to develop automated test plans for leading GPU computing products. Responsibilities include system...Senior
- Crusoe is seeking a Virtualization Validation Engineer in Sunnyvale, California, responsible for the end-to-end validation of large-scale GPU clusters. The role involves executing multi-node scaling tests, validating high-speed interconnects, and benchmarking collective...Senior
$168k - $322k
NVIDIA Gruppe is seeking a System Design Engineer to join their System Product Team in Santa Clara, California. In this role, you will review... ...and collaborate on board designs for NVIDIA products including GPU cards and automotive systems. The ideal candidate will possess...Senior$184k - $287.5k
...to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving... ...and the world's best talent. We are now looking for a Senior System Software Engineer! NVIDIA is searching for a world-class system software engineer...SeniorWork experience placementLocal areaRemote work$184k - $356.5k
...developer in Santa Clara, California to design and implement innovations for GPU-based client products. This role requires 8+ years of experience, a Bachelor's degree in Electrical Engineering or Computer Science, and a solid understanding of UEFI architecture and firmware...Senior- NVIDIA Gruppe in Santa Clara is hiring for a role in their Hardware Infrastructure EDA Compute team to optimize workload scheduling systems and improve overall service reliability. The successful candidate will manage and scale job scheduling systems while driving measurable...Senior
$160k - $322k
NVIDIA Gruppe in Santa Clara is seeking a Senior Technical Marketing Engineer focused on GPUs and scale-up architecture. The role involves showcasing NVIDIA's GPU architecture and server-level platforms, aiming to maximize performance for AI applications. The ideal candidate...Senior$136k - $218.5k
...NVIDIA, we push the boundaries of computing innovation. Our ASIC Verification Engineers focus on developing the world’s top SoCs and GPUs. Joining us as a Senior ASIC Verification Engineer - GPU means working on modern technology powering consumer graphics and AI...Senior$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves... ...high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming experience...Senior$184k - $287.5k
Overview We are looking for a Senior Formal Verification Engineer for GPU Kernels. NVIDIA's Deep Learning Safety Team is hiring engineers to build verification tools that prove GPU kernels behave correctly, enabling their deployment in a wide range of environments, including...SeniorWork experience placement- NVIDIA Corporation is seeking a Senior Software Engineer to join its DGX Cloud Production Engineering team in Santa Clara, CA. This role focuses... ...building automation and operational systems for large-scale GPU clusters, ensuring reliability and scalability. The ideal candidate...Senior
$200k - $322k
We are seeking a self‑motivated senior engineer for the Aerial Omniverse Digital Twin team. This hire will own the design and implementation... ...member of NVIDIA's Aerial team, you will design and implement GPU kernels that apply time‑varying, multi‑antenna channels to OFDM...Senior$160k - $253k
..., and NVIDIA accelerated computing is the engine of artificial intelligence. Our data center... ...power AI at scale. We are looking for a Senior Technical Marketing Engineer focused on GPUs... ...role is pivotal in showcasing NVIDIA's GPU architecture, server-level platforms, and...Senior$136k - $218.5k
NVIDIA is seeking outstanding Senior Design Verification Engineers with a specialty in tools and automation to drive efficiency and collaboration... ...graphics to artificial intelligence to self-driving cars and supercomputers. Our DV infrastructure and methodology team automates,...Senior$179k - $218k
...Senior Staff Data Center Operations Engineer, GPU Hardware Architecture Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer...SeniorTemporary work- NVIDIA Gruppe in Santa Clara is looking for a motivated Senior Timing Engineer (Circuits) to join its Circuit Solutions Group. The candidate will work on timing analysis and signoff for innovative processor designs. Ideal candidates should have over 6 years of experience...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior GPU Supercomputer Scheduler Engineer. Be the first to apply!
- medical scheduler no experience Santa Clara, CA
- bilingual medical scheduler Santa Clara, CA
- administrative assistant scheduler - remote Santa Clara, CA
- senior game producer Santa Clara, CA
- senior manager process engineering Santa Clara, CA
- senior manufacturing engineer Santa Clara, CA
- senior manager clinical operations Santa Clara, CA
- senior optical engineer Santa Clara, CA
- senior lead project manager Santa Clara, CA
- senior manager quality engineering Santa Clara, CA

