Principal Software Engineer, E2E Performance and Goodput — CSP Engagements
$272k - $431.25kNVIDIA Gruppe
We're looking for a Principal Engineer to join our CSP Engagements team as the technical focal point for end-to-end performance, working directly with engineering teams of key CSP/hyperscale customers to ensure they achieve various performance targets on NVIDIA platforms. In this role, you will augment NVIDIA's performance and benchmark teams with a dedicated CSP-facing focus. You will drive work streams with CSP engineering teams to build shared understanding of platform performance characteristics, gather and incorporate their workload-specific feedback into NVIDIA's optimization priorities, and validate that performance targets are met in customer-representative configurations. Your cross-CSP visibility enables you to identify patterns and drive systemic improvements in documentation, configuration guidance, and tooling. What you'll be doing: Drive performance characterization work streams with engineering teams of key CSP/hyperscale customers — ensuring they understand platform performance expectations, profiling methodology, and tuning options for their specific workloads Gather and synthesize CSP performance feedback — identify gaps between expected and actual throughput, and champion optimization priorities back into NVIDIA's CUDA, NCCL, driver, and firmware teams Ensure key open-source performance and stress tools (STREAM, GPU Burn, GPU BLAST) are updated and validated for the latest NVIDIA rack-scale systems, GPU architectures, and CPU platforms — so customers and internal teams have reliable baseline measurements from day one Work closely with CSPs to ensure their own performance and validation tooling reflects the latest GPU capabilities, memory hierarchy changes, and platform-specific tuning parameters Conduct cross-CSP performance comparison and pattern analysis — identify configuration, software, or workload differences that explain performance gaps between deployments Collaborate with CSPs to ensure performance-related integration work (profiling infrastructure, benchmark harnesses, config validation) is ready ahead of deployment milestones Define test strategies and tooling requirements for performance validation — both for NVIDIA internal certification and customer acceptance What we need to see: 15+ years of experience in systems performance engineering, ideally in GPU/HPC/ML infrastructure. BS or MS in Computer Science, Computer Engineering, or related field (or equivalent experience) Proficiency in GPU workload profiling: nsight systems, nsight compute, DCGM metrics, or equivalent instrumentation Understanding of distributed training performance dynamics: computation/communication overlap, pipeline bubbles, memory bandwidth utilization, collective efficiency Statistical methods for performance analysis: regression detection, confidence intervals, A/B comparison at scale Understanding of how the full software stack impacts performance: driver overhead, collective algorithm selection, memory allocation, scheduling, firmware power management Strong data analysis and visualization skills (Python, pandas, dashboards). Customer obsession — genuine passion for understanding why customers aren't achieving expected performance and driving solutions Ability to communicate performance findings to both deep technical audiences and executive leadership Demonstrated success influencing multiple engineering teams to prioritize performance improvements Ways to stand out from the crowd: Experience profiling and optimizing distributed training at 1000+ GPU scale (Megatron‑LM, DeepSpeed, FSDP) Background in ML infrastructure performance at a CSP/hyperscaler Familiarity with NVIDIA platforms (DGX, HGX, NVLink topology) and profiling tools Experience building automated performance regression detection systems for production environments Understanding of inference workload performance dynamics (vLLM, TensorRT‑LLM, SGLang, continuous batching) Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until June 30, 2026. NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe
$272k - $431.25k
Overview We're looking for a Principal Engineer to join our CSP Engagements team as the technical focal point for end-to-end performance, working directly with engineering teams of key... ...pattern analysis—identify configuration, software, or workload differences that explain...Performance$272k - $431.25k
We're looking for a Principal Software Engineer to join our CSP Engagements team as the technical focal point for GPU firmware and GPU system software, working... ...tenancy isolation, secure boot, attestation), and performance — and champion those priorities into NVIDIA's GPU...Performance$272k - $431.25k
We're looking for a Principal Software Engineer to join our CSP Engagements team as the technical focal point for fleet-scale reliability, working directly... ...groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our...Performance$272k - $431.25k
What you’ll be doing: Drive system software architecture alignment and technical deep dives, acting as the primary software engineering contact for NPI projects with key customers... ...experience in designing scalable, high‑performance server systems at the SW/HW interface....PerformanceShift work$272k - $431.25k
Overview We're looking for a Principal Software Engineer to join our CSP Engagements team as the technical focal point for rack-scale system software and firmware. In this role, you will collaborate with NVIDIA's cross-functional rack-scale system software and firmware...SuggestedShift work$184k - $287.5k
NVIDIA is seeking a Senior Software Engineer, NCCL and CUDA specialization to join our Cloud Service Provider (CSP) Engagements team, focusing on ML software stack functionality and performance for datacenter products such as GB300 and Vera Rubin. This role involves working...Performance$184k - $287.5k
Senior Software Engineer, Cloud-Native Stack - CSP Engagements page is loaded Senior Software Engineer, Cloud-Native Stack - CSP Engagements Apply locations US... ...of the CSP engagements team. What you’ll be doing: Perform deep-dive debugging of multi-rack, multi-tenant...PerformanceFull time$184k - $287.5k
## Lead Systems Software Test Engineer - CSP EngagementsApplylocations: US, CA, Santa Claratime type:... ...join our Cloud Service Provider (CSP) Engagements team, focusing on ML software stack... ...providers with next-generation high-performance training and inference platforms....PerformanceLocal area$184k - $287.5k
NVIDIA is seeking a Senior Firmware Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines... ...NVIDIA GPU firmware issues, power management, performance, and thermal control problems for data center...Performance- Overview NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for datacenter products such as GB200.... ...and applications focusing on AI/ML and HPC workloads. Perform advanced system debugging, root cause analysis, and performance...Performance
$272k - $431.25k
...world. At NVIDIA, as a Principal Rack Scale Systems Infrastructure Engineer, you will build and guide the development of software systems. These systems... ...internal deployments and CSP environments. Bridge... ...silicon, or other high‑performance computing systems. Expertise...PerformanceShift work$184k - $356.5k
NVIDIA Corporation is looking for a Lead Systems Software Test Engineer for its CSP Engagements team in Santa Clara, California. This role demands deep... ...engage with cloud service providers to ensure high-performance training and inference platforms, working at the intersection...Performance$168k - $258.75k
...Technical Program Manager to join the CSP Engagements team, focused on deep technical... ...systems and embedded software leaders—including software engineering managers, technical leads, or senior... ...technical topics, including bring up, performance, reliability, observability, and...Performance$200k
...management platforms for enterprise customer engagement. Trusted by the world’s most... ..., and scale are non‑negotiable. The Principal Software Engineer role exists to help us continue... ...Participate hands‑on in coding, debugging, performance optimization, and production issue...PerformanceShift work$272k - $431.25k
...new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source... ...inference runtime architecture, GPU performance engineering, and distributed systems.... ...and land PRs or equivalent experience, engage in development discussions, help...Performance$272k - $431.25k
...graphics, and accelerated computing. As a Principal Software Engineer, you will lead the transformation of... ...to manage complex customer engagements and help develop our product and architecture... ...design‑in, coding, bring‑up, performance tuning, failure analysis, and production...Performance$272k - $431.25k
...across multi-node distributed environments. Built in Rust for performance and Python for extensibility, Dynamo orchestrates GPU... ...of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and roadmap for memory management of...PerformanceLocal areaRemote work- ...edge and cybersecurity to software-defined networking—for... ...commercialization success. As our Principal Distributed Systems Research Engineer, you won’t just be... ...real-time and/or high-performance applications. 7+ years... ...successfully actively engage with a highly distributed...PerformanceWork from homeHome officeFlexible hours
- ...Job Summary In the Layer-7 Security Software team, we are responsible for at least one... ...Application Identification and Content Inspection Engine runs on hardware, virtualized, container... ...firewall in terms of functionality and performance, working on GlobalProtect Own and be...Performance
$184k - $356.5k
NVIDIA Corporation is looking for a Senior Software Engineer specializing in NCCL and CUDA to join our Cloud Service Provider Engagements team in Santa Clara, California. You... ...with customers to address functional and performance challenges in our ML software stack for...Performance$272k - $431.25k
...Overview We are hiring senior engineers to work on the CUDA driver, a core component... ...programming model. Design and maintain performance and precision modeling. Write... ...experience). ~15+ years of relevant systems software development experience. ~ Strong C...Performance$231.4k - $331.8k
...digital world! What You'll Do As a Principal Software Engineer for the Silicon One Customer... ...local team in North America. You will engage with various internal teams within Cisco... ...policies. Employees on sales plans earn performance-based incentive pay on top of their...PerformanceFull timeTemporary workLocal areaWorldwideFlexible hours- NVIDIA Gruppe is looking for a Senior Firmware Engineer to join the CSP Engagements team. This role focuses on system software for data center products and requires deep technical expertise you will use to develop firmware solutions for server management and observability...
$172k - $349k
## Principal Software Engineer, Systems/Solutions TestApplylocations: Sunnyvale, California, United States of Americatime type: Full timeposted... ...and ensure reliability, scalability, resiliency, and performance across highly complex network environments.As part of Product...PerformanceWork experience placementWork at office2 days per week$248k - $391k
...Principal Software Development Engineer Specializing In Solid State Drives (Ssd) Are you ready to push the boundaries of what's possible? At nvidia... ...own the solid state drive selection process, storage performance optimization and ensuring operational excellence in...Performance$175k - $245k
...support the delivery of our new platform. Maintain the existing software components, OS related. Requirements: B.S./M.S. with 8+... ...-on experience with the Linux kernel, debugging, development, performance tuning, etc. Detailed knowledge of Linux kernel, scheduling...PerformanceFull timeWorldwide$100k - $150k
...Technologies is a forward-thinking software development company... ...'re looking for a skilled Principal Software Engineer to join our dynamic team... ...Bright Vision Technologies SOW engagement (no third-party client or... ...in system design, performance engineering, reliability architecture...PerformanceFull timeH1bLocal areaImmediate startRemote workVisa sponsorshipWork visa$165.8k - $307.9k
...is responsible for ensuring a software product meets its specified requirements and performs as intended by rigorously testing... ...lifecycle. As a Principal Software Developer in Test, you... ...role, you will represent quality engineering and verification on behalf of...PerformanceWork at officeLocal areaRelocation package$231.4k - $331.8k
...focused on bridging hardware and software-delivering seamless, high-performance networking solutions. Our team... ...role in coordinating across multiple engineering and business units, ensuring... ...with customer needs. We actively engage with customers to brainstorm new features...PerformanceFull timeTemporary workLocal areaFlexible hours- ...precision that drives great outcomes. Job Summary As a Principal Software Engineer within the Engineering team, you will drive the technical... .... Proven experience designing and developing high‑performance, high‑scale distributed software applications in a cloud...PerformanceFull timeWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Software Engineer, E2E Performance and Goodput — CSP Engagements. Be the first to apply!
- principal software engineer Santa Clara, CA
- senior principal software engineer Santa Clara, CA
- principal Santa Clara, CA
- principal data scientist Santa Clara, CA
- principal cloud computing engineer Santa Clara, CA
- senior principal cloud computing engineer Santa Clara, CA
- principal architect Santa Clara, CA
- senior principal scientist Santa Clara, CA
- embedded software Santa Clara, CA
- software sales Santa Clara, CA


