Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Software Systems Engineer - GPU Performance

$170k - $300k
Full-time

Nebius

About Nebius: Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure. Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI. Listed on Nasdaq (NBIS) and headquartered in Amsterdam, we have a global footprint with R&D hubs across Europe, the UK, North America and Israel. Our team of 1,500+ includes hundreds of engineers with deep expertise across hardware, software and AI R&D. We are looking for a Lead Software Systems Engineer - GPU Performance to play a key role in building our hyperscaler platform, working across its core components while analyzing and optimizing the performance of large-scale GPU clusters at the intersection of hardware and software. You will operate across the full stack—from hardware and system software to networking (InfiniBand/RoCE), virtualization (KVM/QEMU), and distributed communication layers (e.g., MPI, NCCL). In this role you will * Focus on understanding system behavior across multiple layers, identifying performance bottlenecks, and driving improvements that shape how our clusters are built, operated, tuned, and validated. * Investigate and troubleshoot performance issues of GPU cluster under real workloads (training and inference) * Evaluate and integrate new hardware, system configurations and tuning approaches through software stack * Support complex performance-related escalations from internal teams and customers * Work closely with infrastructure, software engineering and hardware vendor teams (e.g. NVIDIA, Mellanox, Intel) * Contribute to hardware and cluster qualification (acceptance), ensuring systems meet performance expectations We expect you to have: * 5+ years of professional experience in system-level software development (focused on performance optimization, low-level programming). * 3+ years of hands-on experience with Linux systems (administration, troubleshooting, and performance tuning). * In-depth understanding of server architecture, including PCIe devices, NICs, Linux OS/Kernel, and high-performance computing (HPC) systems. * Strong proficiency in one or more performance-oriented programming languages (C/C++, Go, Python). We conduct coding interviews as part of the process. Key employee benefits: * Health insurance: 100% company-paid medical, dental and vision coverage for employees and families.

  • 401(k) plan: Up to 4% company match with immediate vesting.
  • Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary
caregivers.
  • Remote work reimbursement: Up to $85/month for mobile and internet.
  • Disability & life insurance: Company-paid short-term, long-term and life
insurance coverage. Compensation We offer competitive salaries ranging from $170k-$300k OTE + equity based on your experience. Pay Transparency We offer competitive compensation and benefits packages. Actual compensation will be determined based on job-related factors, including experience, skills, qualifications, the level at which the candidate is hired, and geographic location, consistent with applicable law. Base Compensation Range

$170,000—$300,000 USD

Benefits & Perks:
  • Competitive compensation
  • Career growth and learning opportunities
  • Flexibility and ownership
  • Collaborative and innovative culture
  • Opportunity to work on impactful AI projects
  • International environment and talented teams
What's it like to work at Nebius: Fast moving - Bold thinking - Constant growth - Meaningful impact - Trust and real ownership - Opportunity to shape the future of AI Equal Opportunity Statement: Nebius is an equal opportunity employer. We are committed to fostering an inclusive and diverse workplace and to providing equal employment opportunities in all aspects of employment. We do not discriminate on the basis of race, color, religion, sex (including pregnancy), national origin, ancestry, age, disability, genetic information, marital status, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by applicable law. Applicants must be authorized to work in the country in which they apply and will be required to provide proof of employment eligibility as a condition of hire. If you need accommodations during the application process, please let us know.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Lead Software Systems Engineer - GPU Performance in United States vacancy
  • $152k - $157k

     ...can thrive. Company Overview A Lead Software Engineer (LSE) is recognized as an expert in their...  ...software components for large-scale systems with minimal oversight, This...  ...higher versions. Optimize application performance and scalability through effective database... 
    Performance
    Full time
    Work experience placement
    Seasonal work

    Dollar General

    Goodlettsville, TN
    more than 2 months ago
  • $150k - $300k

     ...Hudson River Trading (HRT) is looking for GPU Systems Engineers to help scale and evolve our...  ...scope, from HPC/AI cluster design and performance tuning, to troubleshooting and automation...  ...Test and deploy new hardware and software, and partner with vendors to resolve... 
    Performance
    Work at office
    Local area
    Immediate start

    Hudson River Trading

    New York, NY
    3 days ago
  • $100k - $150k

     ...Vision Technologies is a forward-thinking software development company dedicated to...  ...to grow, we’re looking for a skilled GPU Systems Engineer (CUDA) to join our dynamic team and contribute...  ..., GPU architecture, and high-performance computing to design and optimize compute... 
    Performance
    Full time
    H1b
    Local area
    Immediate start
    Remote work
    Visa sponsorship
    Work visa

    Bright Vision Technologies

    Norcross, GA
    5 days ago
  • Bright Vision Technologies is looking for a skilled GPU Systems Engineer (CUDA) to join their team remotely. This role focuses on designing and optimizing workloads for AI and high-performance computing using CUDA. The ideal candidate will have at least six years of experience... 
    Performance
    Remote job

    Bright Vision Technologies

    Norcross, GA
    3 days ago
  •  ...professional in New York to design and operate large-scale GPU infrastructure for model inference and reinforcement...  ...role demands several years of experience in deploying GPU systems, optimizing model performance, and working with frameworks like SGLang and Megatron. The... 
    Performance

    Reflection

    New York, NY
    5 days ago
  • NVIDIA in Santa Clara is looking for software engineers to join their Performance Lab. You will create cutting-edge GPU-accelerated workloads for the financial services industry, utilizing deep learning and benchmarking models on HPC clusters. The ideal candidate should... 
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $124k - $195.5k

    NVIDIA Corporation in Santa Clara, CA is seeking a Hands-On Systems Engineer to ensure the performance and long-term health of their next-generation GPU platforms. You will collaborate with engineering teams to stabilize early production hardware and optimize system configurations... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • NVIDIA Corporation, located in Santa Clara, CA, is seeking a Senior Systems Software Engineer focused on GPU Performance at Scale. This role entails leading performance practices in large-scale GPU infrastructure and aligning AI workloads with next-generation datacenter... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  •  ...platform development engineering is seeking a highly driven GPU/CPU Platform System Engineer at the Principal...  ...engineers who lead the development and day...  ..., system integration, performance testing and characterization...  ...internal hardware and software development teams to... 
    Performance

    Ll Oefentherapie

    Seattle, WA
    4 days ago
  • $100k - $120k

     ...Robotics is looking for an experienced engineer to join their founding team,...  ...have substantial experience in systems programming (C/C++, assembly), expertise in GPU optimizations, and familiarity...  ...internals. Responsibilities include leading engineering teams, integrating... 
    Performance

    Coda Robotics

    San Francisco, CA
    2 days ago
  • $135.2k - $306.4k

     ...platform development engineering is seeking a highly driven GPU/CPU Platform System Engineer at the Principal...  ...engineers who lead the development and day...  ..., system integration, performance testing and characterization...  ...internal hardware and software development teams to... 
    Performance
    Temporary work
    Work experience placement
    Remote work
    Flexible hours

    Oracle

    Seattle, WA
    4 days ago
  •  ...edge AI infrastructure startup is seeking a Kubernetes DevOps Engineer to join their innovative team in San Francisco. The role...  ...Kubernetes clusters across various environments, focusing on high-performance GPU workloads. Ideal candidates will have deep Kubernetes... 
    Performance

    Jack & Jill/External ATS

    San Francisco, CA
    3 days ago
  • Apple Inc. in Cambridge, Massachusetts, is seeking a Software Engineer focused on GPU performance. In this role, you will develop the infrastructure for Apple GPUs, conduct performance analyses, and define driver software for enhanced GPU introspection capabilities. A... 
    Performance

    Apple Inc.

    Cambridge, MA
    2 days ago
  •  ...Group, LLP is seeking a Machine Learning Engineer in Bala Cynwyd, PA. This role focuses...  ...inference optimization for high-performance model serving systems. You will collaborate with researchers...  ...performance, evaluate frameworks, and debug GPU memory issues while managing... 
    Performance

    Susquehanna International Group

    Bala Cynwyd, PA
    5 days ago
  • $168k - $322k

    NVIDIA Gruppe is looking for a System Design Engineer to join the Graphics Product Team in Santa Clara...  ...In this role, you will develop NVIDIA GPU/Tegra based products while...  ...SW engineers to balance product cost, performance, and schedule. Candidates should have... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $121k - $194k

     ...immediate career opening for a Lead Systems Engineer. This opening is located at...  ...to manage its High Performance Computing (HPC) resources,...  ...significant experience with CPU/GPU based systems, high-performance...  ...systems; install software to support research; ensure... 
    Performance
    Immediate start

    The Center for Communications Research - CCR-P: Princeton

    Princeton, NJ
    4 days ago
  •  ...graduate with a B.S. or M.S. in Electrical Engineering for a role involving the development of system hardware products around GPU & Tegra SoC. This position requires strong...  ...across teams to balance product cost and performance, drive testing efforts, create schematics,... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $138k - $172.5k

     ...Lead Software Engineer, Agentic AI Systems Lehi, UT | Plano, TX At Collective Health, we're transforming how employers and their people engage with...  ...agents handle claims logic with high precision, performing Supervised Fine-Tuning on Gemini models to improve domain... 
    Performance
    Work at office
    Flexible hours
    Weekday work

    Collective Health

    Plano, TX
    1 day ago
  • $220k - $292k

     ..., Anduril is changing how military systems are designed, built and sold. Anduril...  ...contracts while simultaneously performing Robot-as-a-Service (RaaS) AUV operations...  ...THE JOB We are looking for a Lead Mission Systems Software Engineer to join our rapidly growing... 
    Performance
    Full time
    Work experience placement
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Anduril Industries

    Boston, MA
    1 day ago
  • $87.1k - $157.45k

    Leidos, located in Bethesda, MD, seeks a Mid-Career Systems Engineer specializing in HPC & GPU Infrastructure. This on-site position involves designing...  ...in Linux systems, hardware architecture, and performance optimization. Strong communication skills and relevant... 
    Performance

    Leidos

    Bethesda, MD
    5 days ago
  • $200k - $322k

     ...self‑motivated senior engineer for the Aerial Omniverse...  ...devices, across systems of potentially thousands...  ...design and implement GPU kernels that apply time...  ...need to see: PhD in high‑performance computing, computer...  ...RAN platforms, L1/L2 software stacks, or channel emulators... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $160k - $253k

     ...accelerated computing is the engine of artificial...  ...platforms integrate high performance compute, networking, and a full-stack software ecosystem to power AI at...  ...in showcasing NVIDIA's GPU architecture, server-level...  ...accelerating AI workloads. System Architecture: Demonstrate... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $229.9k - $262.4k

     ...Senior Lead Software Engineer, Distributed Systems (Golang + Python on Kubernetes) Do you love building and pioneering in the technology space? Do you...  ...salary information is solely for candidates hired to perform work within one of these locations, and refers to the... 
    Performance
    Full time
    Part time
    Internship
    Local area

    Capital One Financial Corp

    Cambridge, MA
    6 days ago
  • A leading semiconductor company in Austin, Texas, is seeking a System Application Engineer to support Data Center GPU customers. This role involves interacting with OEM partners and internal...  ...skills, a passion for high-performance computing, and experience in Data Center... 
    Performance

    Advanced Micro Devices

    Austin, TX
    1 day ago
  • $161.8k - $242.6k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group GPU ASICS Engineering...  ...microarchitecture and workload for performance and power optimizations...  ...field and 4+ years of Software Engineering, Hardware Engineering, Systems Engineering, or related work... 
    Performance
    Work experience placement
    Work from home

    Qualcomm

    San Diego, CA
    10 hours ago
  • $152k - $241.5k

     ...Gruppe is seeking an experienced engineer to join the Scheduling team to design and enhance GPU compute clusters for AI/ML...  ...years of relevant experience in system programming and batch scheduling...  ...cutting-edge technology, focusing on performance optimizations and automation... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $105k - $154k

     ...Electrical Critical Power Solutions Division is hiring a Lead Embedded Software Engineer - Real-Time Systems to join our growing team in Raleigh, NC. We offer...  ...development requirements factoring in cost, performance, and schedule. Establish system & sub-system level... 
    Performance
    Work experience placement
    Work at office
    Local area
    Remote work
    Relocation package

    Eaton

    Raleigh, NC
    6 days ago
  •  ...Manufacturing Co is seeking a Principal Software Engineer to join their team in Santa...  ...define technical direction, lead architectural reviews, and...  ...You will also focus on both performance and cost optimization to...  ...robust, efficient software systems. Join us in reshaping the... 
    Performance

    Dormont Manufacturing Co

    Santa Monica, CA
    3 days ago
  • Krämer IT Solutions GmbH sucht einen AI Engineer / DevOps für unsere Saar-Cloud in Deutschland. Du baust den Maschinenraum für die KI von morgen und optimierst unsere GPU-Cluster für bestmögliche Performance. Du hast Erfahrung mit Docker und Kubernetes, und deine Aufgaben... 
    Performance
    Remote job
    Flexible hours

    Server Eye

    New Bremen, OH
    4 days ago
  • Advanced Micro Devices in Santa Clara is seeking a senior software engineer committed to enhancing AI performance on GPUs. You will work on cutting-edge software...  ...software and hardware collaboration to optimize GPU operations. The ideal candidate brings substantial experience... 
    Performance

    Advanced Micro Devices

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Software Systems Engineer - GPU Performance. Be the first to apply!