Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Lead Software Systems Engineer - GPU Performance

$170k - $300k

Nebius

About Nebius:

Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform that supports developers and enterprises from data and model training through to production deployment, without the cost and complexity of building large in-house AI/ML infrastructure.

Built by engineers, for engineers. From large-scale GPU orchestration to inference optimization, we own the hard problems across compute, storage, networking and applied AI.

Listed on Nasdaq (NBIS) and headquartered in Amsterdam, we have a global footprint with R&D hubs across Europe, the UK, North America and Israel. Our team of 1,500+ includes hundreds of engineers with deep expertise across hardware, software and AI R&D.

We are looking for a Lead Software Systems Engineer - GPU Performance to play a key role in building our hyperscaler platform, working across its core components while analyzing and optimizing the performance of large-scale GPU clusters at the intersection of hardware and software.

You will operate across the full stack-from hardware and system software to networking (InfiniBand/RoCE), virtualization (KVM/QEMU), and distributed communication layers (e.g., MPI, NCCL).

In this role you will
  • Focus on understanding system behavior across multiple layers, identifying performance bottlenecks, and driving improvements that shape how our clusters are built, operated, tuned, and validated.
  • Investigate and troubleshoot performance issues of GPU cluster under real workloads (training and inference)
  • Evaluate and integrate new hardware, system configurations and tuning approaches through software stack
  • Support complex performance-related escalations from internal teams and customers
  • Work closely with infrastructure, software engineering and hardware vendor teams (e.g. NVIDIA, Mellanox, Intel)
  • Contribute to hardware and cluster qualification (acceptance), ensuring systems meet performance expectations
We expect you to have:
  • 5+ years of professional experience in system-level software development (focused on performance optimization, low-level programming).
  • 3+ years of hands-on experience with Linux systems (administration, troubleshooting, and performance tuning).
  • In-depth understanding of server architecture, including PCIe devices, NICs, Linux OS/Kernel, and high-performance computing (HPC) systems.
  • Strong proficiency in one or more performance-oriented programming languages (C/C++, Go, Python).
We conduct coding interviews as part of the process.

Key employee benefits:
  • Health insurance: 100% company-paid medical, dental and vision coverage for employees and families.
  • 401(k) plan: Up to 4% company match with immediate vesting.
  • Parental leave : 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers.
  • Remote work reimbursement: Up to $85/month for mobile and internet.
  • Disability & life insurance: Company-paid short-term, long-term and life insurance coverage.
Compensation

We offer competitive salaries ranging from $170k-$300k OTE + equity based on your experience.

Pay Transparency

We offer competitive compensation and benefits packages. Actual compensation will be determined based on job-related factors, including experience, skills, qualifications, the level at which the candidate is hired, and geographic location, consistent with applicable law.

Base Compensation Range

$170,000-$300,000 USD

Benefits & Perks:
  • Competitive compensation
  • Career growth and learning opportunities
  • Flexibility and ownership
  • Collaborative and innovative culture
  • Opportunity to work on impactful AI projects
  • International environment and talented teams

What's it like to work at Nebius:

Fast moving - Bold thinking - Constant growth - Meaningful impact - Trust and real ownership - Opportunity to shape the future of AI


Equal Opportunity Statement:

Nebius is an equal opportunity employer. We are committed to fostering an inclusive and diverse workplace and to providing equal employment opportunities in all aspects of employment. We do not discriminate on the basis of race, color, religion, sex (including pregnancy), national origin, ancestry, age, disability, genetic information, marital status, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by applicable law.

Applicants must be authorized to work in the country in which they apply and will be required to provide proof of employment eligibility as a condition of hire.


If you need accommodations during the application process, please let us know.
Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Lead Software Systems Engineer - GPU Performance in United States vacancy
  • NVIDIA Corporation, located in Santa Clara, CA, is seeking a Senior Systems Software Engineer focused on GPU Performance at Scale. This role entails leading performance practices in large-scale GPU infrastructure and aligning AI workloads with next-generation datacenter... 
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...GPU Systems Engineer (CUDA) Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses...  ...CUDA programming, GPU architecture, and high-performance computing to design and optimize compute-... 
    Performance
    Full time
    H1b
    Immediate start
    Remote work
    Visa sponsorship

    Bright Vision Technologies

    United States
    12 hours ago
  •  ...ID: 2684 Standard Title: Senior GPU Systems Engineer Required Security Clearance: Top Secret...  ...define and optimize architectures for performance, power efficiency, and required...  ...improve efficiency across hardware and software layers. Build and maintain debugging... 
    Performance
    Hourly pay
    Contract work
    Temporary work
    Immediate start
    Flexible hours
    Shift work

    Base2 Solutions

    Bethesda, MD
    12 hours ago
  • $200k - $300k

     ...our own, taking pride in the systems we build and the trust we...  ...About the Role As a System Engineer, GPU Fleet, you will manage, operate...  .... Ensure high availability, performance, and reliability of GPU server...  ..., and application teams Lead post‑incident reviews, document... 
    Performance
    Local area

    Fluidstack

    Seattle, WA
    12 hours ago
  • $181k - $248.5k

     ...propulsion, manufacturing, software, avionics, or a...  ...exploration across our solar system. Its mission is to...  ...the Role: Own the GPU compute environment for...  ..., job scheduling, and performance optimization —...  ...Science or Electrical Engineering and 5+ years of relevant... 
    Performance
    Shift work

    Relativity Space

    Long Beach, CA
    26 days ago
  • $100k - $120k

     ...Robotics is looking for an experienced engineer to join their founding team,...  ...have substantial experience in systems programming (C/C++, assembly), expertise in GPU optimizations, and familiarity...  ...internals. Responsibilities include leading engineering teams, integrating... 
    Performance

    Coda Robotics

    San Francisco, CA
    12 hours ago
  • $195.2k - $292.8k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group GPU ASICS Engineering General Summary: GPU System Driver Team are looking for talented software engineers to develop in-...  ...to verify GPU function and performance on simulator/emulator, and... 
    Performance
    Work experience placement

    Qualcomm

    Nacogdoches, TX
    1 day ago
  • $195k - $255k

     ...Senior Systems Engineer (Virtualization / GPU Infrastructure) Laurel, MD We're seeking a Senior Systems Engineer to support our U.S. Government...  ...virtualization and infrastructure requirements Perform lab configuration management activities and maintain system... 
    Performance
    Immediate start
    Remote work
    Flexible hours

    EnDepth Solutions LLC

    Laurel, MD
    3 days ago
  • $160k - $230k

     ...Systems Research Engineer, GPU Programming San Francisco About the Role As...  ...architecture to enhance the performance and efficiency of our AI...  ...Collaborating with the hardware and software teams, you will contribute...  .... We have contributed to leading open-source research,... 
    Performance
    Full time
    Remote work

    Together AI

    San Francisco, CA
    2 days ago
  • $140k - $225k

     ...Systems Engineer - Graphics Processing Unit (GPU) Absolute Business Solutions Corp (ABSC) is not just another tech...  ...-site position. All work must be performed at the customer site in Bethesda...  ...efficiency across hardware and software layers. Tooling and Automation... 
    Performance
    Contract work

    Absolute Business Solutions Corp

    Bethesda, MD
    3 days ago
  • $165k - $180k

     ...automated 3D ultrasound system. To succeed...  ...The Imaging Engineer at iSono Health will...  ...intersection of hardware and software and AI (can learn)...  ...excellent field performance, high reliability,...  ...domains. Lead hands-on...  ...Nvidia and other GPU platforms (e.g., Jetson... 
    Performance

    iSono Health

    Sunnyvale, CA
    3 days ago
  • $160k - $240k

     ...rest data in a single AI-native engine, combining the speed and...  ...We're looking for talented Software Engineers to join our team and...  ...development of StarRocks, our high-performance SQL engine purpose-built for...  ...developing advanced database systems and enjoy solving challenging... 
    Performance
    Remote work

    CelerData, Inc.

    Menlo Park, CA
    4 days ago
  •  ...Lead Software Engineer - Content Systems Technology New York Full-Time Fully Remote #WeAreParamount on a mission to unleash the power of content…...  ...responsibilities, including: Design, build, and maintain high-performance, scalable APIs that power content management, metadata... 
    Performance
    Full time
    Contract work
    Remote work

    Paramount Global Services

    United States
    3 days ago
  •  ...brands optimize warehouse performance while supporting...  ...mechanical design, resilient software and a deep...  ...means contributing to the systems used daily in warehouses...  ...Software Integration team, leading technical training and...  ...in a similar software engineering or software... 
    Performance
    Work at office
    Local area

    Exotec

    Atlanta, GA
    12 hours ago
  • $175k - $220k

     ...Lead Software Engineer - Retail Systems San Ramon, California, United States Mindful movement. It's at the core of why we do what we do at ALO...  ...including but not limited to location, experience, and performance. As such, on occasion and when applicable, there is... 
    Performance

    ALO Yoga

    San Ramon, CA
    1 day ago
  • $138k - $172.5k

     ...Lead Software Engineer, Agentic AI Systems Collective Health is the leading health benefits platform that brings together medical, dental, vision,...  ...ensure agents handle claims logic with high precision, performing Supervised Fine-Tuning on Gemini models to improve domain... 
    Performance
    Work at office
    Flexible hours
    Weekday work

    Softbank Investment Advisers

    Lehi, UT
    1 day ago
  • $161.8k - $242.6k

     ...Technologies, Inc. Job Area: Engineering Group, Engineering Group GPU ASICS Engineering...  ...microarchitecture and workload for performance and power optimizations...  ...field and 4+ years of Software Engineering, Hardware Engineering, Systems Engineering, or related work... 
    Performance
    Work experience placement
    Work from home

    Qualcomm

    San Diego, CA
    12 hours ago
  • $158.1k - $213.9k

     ...future with us. The Boeing Company is looking for a Lead Software Systems Engineer to join our Space Mission Systems (SMS) team, focused...  ...requirements and models that meet customer, operational and performance requirements and have clear traceability to design, code... 
    Performance
    Permanent employment
    Relocation
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours
    Shift work
    Day shift

    The Boeing Company

    Chantilly, Loudoun County, VA
    4 days ago
  • $121.5k - $224.88k

     ...Lead Software Engineer, Engine Systems We need you The minions of hell are growing stronger… Join us as we continue to shape the Diablo universe...  ..., including streaming, asset loading, job scheduling, performance systems, platform support, and key runtime systems... 
    Performance
    Full time
    Temporary work
    Part time
    Local area
    Remote work
    Relocation package
    Flexible hours

    Blizzard Entertainment

    Albany, NY
    12 hours ago
  •  ...help build the platform engineers turn to to ship AI...  ...the global operating system for distributed, heterogeneous...  ...engineers to lead our GPU Networking efforts,...  ...configuration to architect the software fabric that unifies...  ...validate networking performance on bleeding-edge... 
    Performance
    Flexible hours

    Baseten

    New York, NY
    1 day ago
  • $116.2k - $343.6k

     ...a talented and enthusiastic Lead Engine System Engineer to join our studio...  ...engine systems and focus on performance and optimization Work with...  ...tools to identify CPU and GPU performance issues Evolve...  ...performing in a senior or principal software engineering role or... 
    Performance
    Relocation package

    LightSpeed Studios

    Irvine, CA
    7 days ago
  • $116.2k - $343.6k

     ...Lead Engine System Engineer LightSpeed LA is seeking a talented and enthusiastic...  ...engine systems and focus on performance and optimization Work...  ...tools to identify CPU and GPU performance issues Evolve...  ...in a senior or principal software engineering role or equivalent... 
    Performance
    Relocation package

    Lightspeed Studios

    Irvine, CA
    12 hours ago
  • Krämer IT Solutions GmbH sucht einen AI Engineer / DevOps für unsere Saar-Cloud in Deutschland. Du baust den Maschinenraum für die KI von morgen und optimierst unsere GPU-Cluster für bestmögliche Performance. Du hast Erfahrung mit Docker und Kubernetes, und deine Aufgaben... 
    Performance
    Remote job
    Flexible hours

    Server Eye

    New Bremen, OH
    2 days ago
  • $244.8k

     ...developer infrastructure engineering team at ByteDance. Our...  ...-quality features and systems to our users. We aim...  ...systems enabling software development streamline...  ...Identify and resolve performance and scalability issues...  ...with great people. We lead with curiosity, humility... 
    Performance
    Temporary work
    Local area
    Remote work

    ByteDance

    San Jose, CA
    3 days ago
  • $272k - $431.25k

    NVIDIA Gruppe is seeking software engineers in Santa Clara to develop next-generation high-speed...  ...the lifecycle of GPUs and high-performance computing servers. The ideal candidate...  ...skills, and knowledge of networking and systems software. A base salary between 272,00... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...to PCs, gaming and embedded systems. Grounded in a culture of innovation...  ...Graphics Processing Units (GPU’s). Our team plays a major...  ...essential to maximize the performance from our GPU’s in an efficient...  ...performance CPUs/GPUs/APUs. The engineer will work with cross‑... 
    Performance

    Advanced Micro Devices

    Austin, TX
    1 day ago
  • $105k - $154k

     ...Lead Embedded Software Engineer – Real-Time Systems Eaton's Electrical Critical Power Solutions Division is hiring a Lead Embedded Software Engineer –...  ...product development requirements factoring in cost, performance, and schedule. Establish system & sub-system level... 
    Performance
    Work experience placement
    Work at office
    Remote work
    Relocation package

    Eaton Plc

    Raleigh, NC
    12 hours ago
  •  ...artificial intelligence, and software-defined networking to...  ...prestigious awards, such as Best Engineering Team, Best Company for...  ...highest standards of quality and performance in everything we do....  ...looking for world-class Senior/Lead Network Systems software engineers. Network... 
    Performance
    Work experience placement

    Arista Networks Inc

    Austin, TX
    3 days ago
  • $160k - $322k

     ...Santa Clara is seeking a Senior Technical Marketing Engineer focused on GPUs and scale-up architecture. The role involves showcasing NVIDIA's GPU architecture and server-level platforms, aiming to maximize performance for AI applications. The ideal candidate will have at... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $240k - $280k

     ...Lead Principal Software Engineer/Systems Architect New York SoundCloud empowers artists and fans to connect and share through music. Founded in 2...  ...leadership to shape the architecture, reliability, and performance of systems used by millions of fans, creators, and partner... 
    Performance
    Work at office
    Work from home
    Worldwide
    Flexible hours

    SoundCloud

    New York, NY
    6 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Lead Software Systems Engineer - GPU Performance. Be the first to apply!