Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer, AI Inference Systems

$184k - $287.5k

Dormont Manufacturing Co

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across multi-GPU, multi-node, and multi-cloud environments. You’ll collaborate across inference, compiler, scheduling, and performance teams to push the frontier of accelerated computing for AI. What you’ll be doing: Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and optimize the inference framework (vLLM) with methods like speculative decoding, data/tensor/expert/pipeline-parallelism, prefill-decode disaggregation. Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated) using techniques such as fusion, autotuning, and memory/layout optimization; build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. Define and build inference benchmarking methodologies and tools; contribute both new benchmark and NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite. Architect the scheduling and orchestration of containerized large-scale inference deployments on GPU clusters across clouds. Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way to integrate research ideas and prototypes into NVIDIA’s software products. What we need to see: Bachelor’s degree (or equivalent expeience) in Computer Science (CS), Computer Engineering (CE) or Software Engineering (SE) with 7+ years of experience; alternatively, Master’s degree in CS/CE/SE with 5+ years of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high-performance computing. Strong programming skills in Python and C/C++; experience with Go or Rust is a plus; solid CS fundamentals: algorithms & data structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories. Knowledgeable and passionate about performance engineering in ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/debug tools (e.g., Nsight Systems/Compute). Experience with containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces and cgroups. Excellent debugging, problem-solving, and communication skills; ability to excel in a fast-paced, multi-functional setting. Ways to stand out from the crowd Experience building and optimizing LLM inference engines (e.g., vLLM, SGLang). Hands‑on work with ML compilers and DSLs (e.g., Triton, TorchDynamo/Inductor, MLIR/LLVM, XLA), GPU libraries (e.g., CUTLASS) and features (e.g., CUDA Graph, Tensor Cores). Experience contributing to containerization/virtualization technologies such as containerd/CRI-O/CRIU. Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and production observability. Contributions to open‑source projects and/or publications; please include links to GitHub pull requests, published papers and artifacts. At NVIDIA, we believe artificial intelligence (AI) will fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone to harness the power of AI and benefit from its potential. Our team consists of experts in AI, systems and performance optimization. Our leadership includes world-renowned experts in AI systems who have received multiple academic and industry research awards. If you’re excited to build systems, kernels, and tools that make large-scale AI faster, more efficient, and easier to deploy, we’d love to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 28, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr Dormont Manufacturing Co

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer, AI Inference Systems in California, MO vacancy
  • $184k - $287.5k

    Dormont Manufacturing Co is seeking an outstanding AI systems engineer to develop groundbreaking technologies in the inference systems software stack. You'll innovate and develop new AI systems technologies while collaborating closely with other engineers at NVIDIA across... 
    Senior

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $184k - $287.5k

    Dormont Manufacturing Co is seeking highly skilled software engineers to join their team in California. The role involves developing and optimizing AI inference systems for large-scale models, collaborating with cross-functional teams to enhance system performance. The... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $152k - $287.5k

    Dormont Manufacturing Co is seeking a Senior System Software Engineer to join its fast-paced team working on the Dynamo-Triton Inference Server. The ideal candidate will develop GPU-accelerated AI software, contribute to feature development, and drive customer adoption... 
    Senior

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $152k - $241.5k

    We are looking for a Senior System Software Engineer to work on Dynamo‑Triton Inference Server. NVIDIA is hiring software engineers for its GPU‑accelerated deep learning...  ...the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $180k - $280k

     ...builds autonomous aerospace systems for the real world. At the...  ...real time, safety critical software stack that has to work under...  ..., read on. The Role As a Senior Embedded Software Engineer, you will own software that...  ...simulation and test runners Modern AI coding tools used as... 
    Senior

    Elroy Air

    California, MO
    2 days ago
  • $184k - $287.5k

     ...deep learning ignited modern AI — the next era of computing —...  ...test case reports Providing software quality reports based on static...  ...(or higher) in Electrical Engineering or Computer Science or equivalent...  ...kernel. OOB or In-band System Management experience with exposure... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA...  ...performance of NVIDIA’s inference ecosystem. NVIDIA is rapidly...  ...in areas like Generative AI, Recommenders and Vision...  ...latency, resource-constrained systems or embedded AI pipelines (... 
    Senior

    NVIDIA

    California, MO
    2 days ago
  • $184k - $287.5k

    Joining NVIDIA’s DGX Cloud AI Efficiency Team means...  ..., post‑training, inference. Our objective is to deliver...  ...an AI infrastructure software engineer to join our team. You’...  ...software and systems engineering practices...  ...of AI systems. As a senior DGX Cloud AI Infrastructure... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $184k - $287.5k

    A leading tech company is seeking a Senior Performance Engineer in California to enhance AI system performance and datacenter applications. The role requires extensive experience in accelerated computing, deep learning frameworks, and cloud/container architecture. Applicants... 
    Senior

    NVIDIA

    California, MO
    2 days ago
  • $184k - $287.5k

    We’re looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you’ll develop libraries, code generators,... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  •  ...Developer located in California, Missouri. The role involves developing and optimizing high-speed L2/L3 protocol stacks for AI training and inference server infrastructure. The ideal candidate will have 6+ years of development experience with a strong proficiency in C,... 
    Senior

    QFocus Technologies LLC

    California, MO
    3 days ago
  • $152k - $241.5k

    NVIDIA Solutions Engineering team is searching for engineers to help develop and bring NVIDIA...  ...their best work. We are looking for a System Software Engineer with expertise in embedded...  ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $152k - $241.5k

     ...looking for a motivated Performance engineer to influence the roadmap of...  ...understanding of computer system architecture, HW-SW interactions...  ...principles (aka systems software fundamentals) Implement micro...  ...existing vacancy. NVIDIA uses AI tools in its recruiting processes... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • Dormont Manufacturing Co is seeking a Compiler Architect with a focus on cloud-based AI inference. The role requires 12+ years of experience in compiler and systems software development, emphasizing ML inference. You will design and implement a scalable MLIR-based compiler... 

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $184k - $287.5k

    We are hiring senior engineers to work on the CUDA driver and runtime, core...  ...investigates bottlenecks in software or hardware and delivers features...  ...model across a range of system configurations and hardware capabilities...  ...vacancy. NVIDIA uses AI tools in its recruiting... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • $184k - $287.5k

    NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines deep technical...  ...OS, middleware, and applications with focus on AI/ML and HPC workloads. Perform advanced system... 
    Senior

    Dormont Manufacturing Co

    California, MO
    2 days ago
  • Compa is seeking a Senior Software Engineer in San Francisco to take ownership of software systems. You will work closely with Product and Design teams to ship impactful features. The ideal candidate will have 5+ years of experience and proficiency in modern web languages... 
    Senior

    Compa

    California, MO
    7 days ago
  • Quiet Capital is seeking a software development professional for their Revenue Cycle Management team focused on claims submission. You...  ...a key role in transforming financial workflows into automated systems to streamline operations for healthcare providers. The ideal candidate... 
    Senior

    Quiet Capital

    California, MO
    1 day ago
  • $184k - $287.5k

     ...developing advanced multi-rack, multi-tenant AI/ML datacenters with NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our CSP (Cloud Service Provider)...  ...development experience in distributed systems (Go, Rust, C/C++ or Python for tooling).... 
    Senior

    Dormont Manufacturing Co

    California, MO
    1 day ago
  •  ...offerings. You will drive Engineering projects that deliver...  ...functionality. At IBM Software, we transform client challenges...  ...the world’s leading AI-powered, cloud-native...  ...We’re looking for Senior Engineers with a deep backend...  ...roadmap and improve system effectiveness across... 
    Senior

    Cacheflow

    California, MO
    4 days ago
  • $184k - $287.5k

     ...grounded in real‑world scenarios. Do you obsess about software engineering? So do we! We are looking for a strong System Engineer to develop and maintain NVIDIA’s software...  ...team and help us build the future of Physical AI! What you’ll be doing: Implement, validate,... 
    Senior

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $152k - $241.5k

    We are looking for a world-class System Software Automation engineer to join the Build & Release DRIVE OS team. NVIDIA DRIVE™ OS is a foundational software...  ...in Python (or similar). Curiosity about LLMs, NLP, or AI‑driven developer tools. Passion for “it just works”... 
    Senior

    Dormont Manufacturing Co

    California, MO
    1 day ago
  • $168k - $270.25k

    NVIDIA GPU Architecture Group is seeking a senior software engineer to automate and optimize performance analysis workflows for AI training and inference workloads. You will not only perform analysis but also reshape how it's done, building tools and workflows that scale... 
    Senior
    Work experience placement

    NVIDIA

    California, MO
    16 hours ago
  •  ...exciting opportunity to work on cutting-edge NAND Flash technology for AI infrastructure. The role involves working with cross-functional...  ...in a relevant field and have extensive experience in NAND Flash systems, along with strong leadership and technical skills. A... 
    Senior

    6AM City, LLC

    California, MO
    16 hours ago
  • Introduction At IBM Software, we transform client challenges into solutions...  ...Building the world’s leading AI-powered, cloud-native...  ...Management, and cross-functional engineering teams to align on priorities...  ..., and participate in system-wide debugging, maintenance,... 
    Senior

    Cacheflow

    California, MO
    3 days ago
  • $141.3k - $226k

    Senior Software Engineer - Cloud Native Storage Design and implement scalable distributed storage control...  ...clusters. Resiliency in Distributed Systems: Building robust storage control-plane...  ...delivery of product features. Utilize AI tools for specification-driven development... 
    Senior
    Local area

    jobs.frontdoordefense.com - Jobboard

    California, MO
    4 days ago
  • Overview NVIDIA is looking for an outstanding senior software engineer to join our endeavor to accelerate...  ...familiarity with the use of agentic AI to boost the speed and quality of delivered...  ...response properties, larger molecular systems, or Gaussian integral technology.... 
    Senior

    NVIDIA Gruppe

    California, MO
    16 hours ago
  • About the Role Senior Software Engineer What You'll Do Design, build, and maintain reliable microservices...  ...that benefit multiple teams. Use AI as a force multiplier. Experiment with...  ...experience, including work on production systems. Strong experience with Go for backend... 
    Senior

    Bonterra

    California, MO
    4 days ago
  • Bonterra is seeking a Senior Software Engineer to design, build, and maintain microservices and scalable Angular applications. You'll work with...  ...to the architectural decisions of a large-scale distributed system. The ideal candidate has 3+ years of experience in full-... 
    Senior

    Bonterra

    California, MO
    4 days ago
  • $152k - $241.5k

    NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA’s rapidly growing enterprise and...  ...NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server... 
    Senior
    Work experience placement

    Dormont Manufacturing Co

    California, MO
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer, AI Inference Systems. Be the first to apply!