Senior GPU Clusters Platform & EngOps Engineer
NVIDIA Gruppe
NVIDIA Gruppe is seeking highly motivated EngOps and Platform Engineers to develop automated tools for managing large GPU clusters. This position requires strong expertise in high-performance computing and deep learning. The ideal applicants have a BS or MS in a relevant field and 8+ years of experience in cluster administration. A deep knowledge of automation tools such as Ansible and Python is also required. Join NVIDIA to help redefine the tech landscape with cutting-edge artificial intelligence solutions. #J-18808-Ljbffr
- ...Computing and Visualization. The GPU, our invention, serves as... ...Join our team of innovative engineers who develop and maintain... ...looking for highly motivated EngOps and Platform Engineers to boost execution... ...managing and maintaining large GPU clusters interconnected via NVLink...Senior
$152k - $287.5k
A leading technology company is seeking a Senior Software Engineer to develop solutions for GPU clusters aimed at enhancing machine learning innovation. The ideal candidate will have over 5 years of experience in software engineering with significant involvement in ML...Senior- ...NVIDIA Gruppe in Santa Clara is seeking a technical leader for the GPU AI/HPC Infrastructure team. You will design and implement cutting-edge GPU compute clusters, focusing on deep learning and high-performance computing. The ideal candidate will have at least 5+ years...Senior
$152k - $287.5k
...NVIDIA Gruppe, based in Santa Clara, is seeking a Senior Software Engineer to accelerate the development of machine learning innovations. In this role, you'll design and implement solutions for GPU clusters, enabling researchers to optimize their work. Strong expertise...Senior- ...NVIDIA Gruppe seeks a skilled HPC Cluster Engineer to design, deploy, and operate GPU Compute Clusters for high-performance computing workloads. This role involves collaboration with various teams to ensure effective and reliable cluster performance. Key responsibilities...Senior
$184k - $356.5k
NVIDIA Gruppe is seeking an experienced engineer to lead GPU cluster design and support for AI and HPC deployments in Santa Clara, California. The ideal candidate will have over 8 years of experience with large-scale GPU infrastructure and a strong ability to communicate...Senior- ...black.ai is looking for a skilled platform engineer in Palo Alto to enhance our AWS infrastructure... ...engineering, DevOps practices, and GPU workloads. As a platform engineer, you... ...workflows, ensure the reliability of GPU clusters, and own CI/CD pipelines, facilitating...Senior
$160k - $322k
...NVIDIA Gruppe in Santa Clara is seeking a Senior Technical Marketing Engineer focused on GPUs and scale-up architecture. The role involves showcasing NVIDIA's GPU architecture and server-level platforms, aiming to maximize performance for AI applications. The ideal candidate...Senior- ...NVIDIA Gruppe is looking for an experienced GPU Deployment Engineer to tackle end-to-end AI deployment challenges on the NVIDIA RTX AI platform. The role involves analyzing GPU-accelerated applications, improving user experiences, and collaborating with teams to influence...Senior
- ...Introduction We are hiring a senior engineer to design and deliver a BYOC (Bring Your Own Cloud) platform for Presto SaaS across Azure... ...strong plus), with a focus on GPU-enabled infrastructure. This role... .... • Experience with multi-cluster/multi-region platform design....Senior
- ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In... ...efficiency by addressing infrastructure deficiencies for GPU Clusters, fostering innovations in AI/ML research. The ideal candidate...
$272k - $431.25k
...NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing...$200k - $400k
...A dedicated research lab is seeking a Network Engineer to design and optimize low-latency, high-bandwidth networking solutions for AI supercomputing clusters. You will work on cutting-edge technologies in collaboration with world-class researchers. The ideal candidate...Senior$184k - $356.5k
...NVIDIA Corporation is seeking a Senior Software Engineer in Santa Clara to enhance the performance and reliability of large-scale AI infrastructures... ...distributed training workloads across NVIDIA’s GPU platforms. Ideal candidates should have extensive experience in large...Senior$272k - $431.25k
...Principal Ai And Ml Infra Software Engineer, Gpu Clusters We are seeking a Principal AI and ML Infra Software Engineer, GPU Clusters at NVIDIA... ..., Go, Bash, as well as familiarity with cloud computing platforms (e.g., AWS, GCP, Azure) in addition to experience with...- ...A leading technology firm is seeking a Senior Engineer to design and deliver a BYOC (Bring Your Own Cloud) platform for Presto SaaS across Azure and AWS. The ideal candidate... ...for production-grade deployments, focusing on GPU-enabled infrastructure and Kubernetes/OpenShift...Senior
$184k - $356.5k
...developer to design and implement systems for GPU based Client products. This role... ...in UEFI/BIOS development on X86 or ARM platforms, along with a strong background in C/C++... ...experience and a Bachelor’s Degree in Electrical Engineering or Computer Science. The compensation...Senior$160k - $200k
...PlusAI, based in Silicon Valley, is seeking a Senior ML Infrastructure Engineer to design scalable architectures for machine learning models. This... ...role involves building robust data pipelines, managing GPU clusters, and collaborating with cross-functional teams....Senior- ...Job Description We are hiring a Senior Platform Engineer to join the Autonomous Vehicle (AV) Cloud Engineering... ...the lifecycle of production‑grade clusters. Strong proficiency in software... ...offs between hardware‑level performance (GPU passthrough) and clean cloud...SeniorWork experience placementLocal area
$152k - $241.5k
..., and Visualization. Our invention—the GPU—functions as the visual cortex of modern... ...vehicles. We are now looking for a ML Platform Engineer to help accelerate the next era of machine... ...across large-scale, distributed GPU clusters. Apply SRE principles to diagnose,...Senior- ...hire a deeply technical, creative, and Senior AI Platform Engineer to build, support, and maintain the... ...ML infrastructure across cloud‑native clusters and on‑premises hardware. Design and implement... ..., including MLOps, model serving, and GPU‑accelerated environments. Experience...Senior
- ...NVIDIA Gruppe is seeking experienced Senior Software Engineers to join their production engineering team in Santa Clara, California. The role... ...involves building automation and operational systems for GPU clusters, with a focus on Kubernetes and reliability practices....Senior
- ...Crusoe is seeking a Virtualization Validation Engineer in Sunnyvale, California, responsible for the end-to-end validation of large-scale GPU clusters. The role involves executing multi-node scaling tests, validating high-speed interconnects, and benchmarking collective...Senior
- ...NVIDIA Gruppe in Santa Clara is looking for a Senior HPC Architect to support the deployment of large-scale GPU compute clusters. You will provide engineering solutions for GPU computing products, ensuring technical relationships with teams and assisting in creative solutions...Senior
- ...NVIDIA Corporation is seeking a Senior HPC Architect to enhance GPU compute clusters. This role involves designing solutions for operationalizing NVIDIA products and collaborating closely with engineering teams. Ideal candidates should have over 8 years of experience...Senior
- ...NVIDIA Gruppe is seeking a Data Analyst to join their GPU-accelerated cluster team. In this role, you will analyze complex datasets to drive application and platform improvements while applying machine learning and deep learning techniques to derive actionable insights...Senior
- ...Sanas, located in Palo Alto, California, is seeking an experienced Platform Engineer to build and operate a hybrid infrastructure for advanced AI/ML research and product development. You will architect and maintain the computing platform using Kubernetes and AWS, lead...Senior
$148k - $287.5k
...Labs in Santa Clara, California, is seeking a motivated Performance Engineer to advance communication libraries for deep learning and HPC. You will conduct in-depth performance analysis on multi-GPU clusters, collaborate with dynamic teams, and evaluate proof-of-concepts....Senior$152k - $241.5k
NVIDIA Gruppe is seeking an experienced engineer to join the Scheduling team to design and enhance GPU compute clusters for AI/ML workloads. Candidates should have a Bachelor's degree in Computer Science and 5+ years of relevant experience in system programming and batch...Senior$152k - $241.5k
NVIDIA Gruppe is seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving industry benchmark results and architecting distributed inference systems. Required qualifications include a relevant...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior GPU Clusters Platform & EngOps Engineer. Be the first to apply!
- client platform engineer Santa Clara, CA
- platform engineer Santa Clara, CA
- senior platform engineer Santa Clara, CA
- platform engineering manager Santa Clara, CA
- data platform engineer Santa Clara, CA
- platform developer Santa Clara, CA
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA

