Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer, AI Inference Systems

$184k - $287.5k

Dormont Manufacturing Company

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across multi-GPU, multi-node, and multi-cloud environments. You’ll collaborate across inference, compiler, scheduling, and performance teams to push the frontier of accelerated computing for AI. What you’ll be doing: Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and optimize the inference framework (vLLM) with methods like speculative decoding, data/tensor/expert/pipeline-parallelism, prefill-decode disaggregation. Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated) using techniques such as fusion, autotuning, and memory/layout optimization; build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. Define and build inference benchmarking methodologies and tools; contribute both new benchmark and NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite. Architect the scheduling and orchestration of containerized large-scale inference deployments on GPU clusters across clouds. Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way to integrate research ideas and prototypes into NVIDIA’s software products. What we need to see: Bachelor’s degree (or equivalent expeience) in Computer Science (CS), Computer Engineering (CE) or Software Engineering (SE) with 7+ years of experience; alternatively, Master’s degree in CS/CE/SE with 5+ years of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high-performance computing. Strong programming skills in Python and C/C++; experience with Go or Rust is a plus; solid CS fundamentals: algorithms & data structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories. Knowledgeable and passionate about performance engineering in ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/debug tools (e.g., Nsight Systems/Compute). Experience with containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces and cgroups. Excellent debugging, problem-solving, and communication skills; ability to excel in a fast-paced, multi-functional setting. Ways to stand out from the crowd Experience building and optimizing LLM inference engines (e.g., vLLM, SGLang). Hands‑on work with ML compilers and DSLs (e.g., Triton, TorchDynamo/Inductor, MLIR/LLVM, XLA), GPU libraries (e.g., CUTLASS) and features (e.g., CUDA Graph, Tensor Cores). Experience contributing to containerization/virtualization technologies such as containerd/CRI-O/CRIU. Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and production observability. Contributions to open‑source projects and/or publications; please include links to GitHub pull requests, published papers and artifacts. At NVIDIA, we believe artificial intelligence (AI) will fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone to harness the power of AI and benefit from its potential. Our team consists of experts in AI, systems and performance optimization. Our leadership includes world-renowned experts in AI systems who have received multiple academic and industry research awards. If you’re excited to build systems, kernels, and tools that make large-scale AI faster, more efficient, and easier to deploy, we’d love to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 28, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer, AI Inference Systems in California, MO vacancy
  • $184k - $287.5k

     ...Dormont Manufacturing Co is seeking an outstanding AI systems engineer to develop groundbreaking technologies in the inference systems software stack. You'll innovate and develop new AI systems technologies while collaborating closely with other engineers at NVIDIA across... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $152k - $287.5k

     ...Dormont Manufacturing Co is seeking a Senior System Software Engineer to join its fast-paced team working on the Dynamo-Triton Inference Server. The ideal candidate will develop GPU-accelerated AI software, contribute to feature development, and drive customer adoption... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $184k - $287.5k

     ...We are now looking for a Systems Software Engineer. Do you like to think creatively and enjoy solving challenges that require innovation? If so,...  ...patterns on Silicon, failure analysis, and yield learning Enable AI applications to optimize all aspects of our software and... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $152k - $241.5k

     ...NVIDIA is seeking a driven Software Engineer to enhance its neural graphics capabilities. This position focuses on design...  ...experience with a robust background in distributed systems, Python, C++, and modern inference platforms. The role offers a base salary ranging from... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $152k - $288k

     ...Senior Software Engineer – TensorRT Edge-LLM Feb 12, 2026 $152K - $288K Are you...  ...real-time large language model inference? Join NVIDIA’s TensorRT Edge...  ...the next generation of edge AI for automotive and robotics....  ...autoregressive LLM serving systems, including speculative decoding... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $152k - $241.5k

     ...We are looking for a driven Software Engineer to bring ground breaking models...  ...the future of Physical AI! What you’ll be doing: Design...  ...and optimize containerized inference execution for the latest 3D...  ...production-grade AI distributed systems, backend services,... 
    Senior

    Dormont Manufacturing Company

    California, MO
    4 days ago
  • $184k - $287.5k

     ...Joining NVIDIA’s DGX Cloud AI Efficiency Team means...  ..., post‑training, inference. Our objective is to deliver...  ...an AI infrastructure software engineer to join our team. You’...  ...software and systems engineering practices...  ...availability of AI systems. As a senior DGX Cloud AI... 
    Senior

    Dormont Manufacturing Company

    California, MO
    4 days ago
  • $184k - $287.5k

     ...We’re looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you’ll develop libraries, code generators... 
    Senior

    Dormont Manufacturing Company

    California, MO
    4 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA...  ...performance of NVIDIA’s inference ecosystem. NVIDIA is rapidly...  ...in areas like Generative AI, Recommenders and Vision...  ...latency, resource-constrained systems or embedded AI pipelines (... 
    Senior

    NVIDIA

    California, MO
    5 days ago
  • $184k - $287.5k

    A leading tech company is seeking a Senior Performance Engineer in California to enhance AI system performance and datacenter applications. The role requires extensive experience in accelerated computing, deep learning frameworks, and cloud/container architecture. Applicants... 
    Senior

    NVIDIA

    California, MO
    5 days ago
  •  ...Developer located in California, Missouri. The role involves developing and optimizing high-speed L2/L3 protocol stacks for AI training and inference server infrastructure. The ideal candidate will have 6+ years of development experience with a strong proficiency in C,... 
    Senior

    QFocus Technologies LLC

    California, MO
    3 days ago
  • $152k - $241.5k

     ...looking for a motivated Performance engineer to influence the roadmap of...  ...understanding of computer system architecture, HW-SW interactions...  ...principles (aka systems software fundamentals) Implement micro-...  ...existing vacancy. NVIDIA uses AI tools in its recruiting processes... 
    Senior

    Dormont Manufacturing Company

    California, MO
    4 days ago
  • $184k - $287.5k

     ...We are hiring senior engineers to work on the CUDA driver and runtime, core...  ...investigates bottlenecks in software or hardware and delivers features...  ...model across a range of system configurations and hardware capabilities...  ...vacancy. NVIDIA uses AI tools in its recruiting... 
    Senior

    Dormont Manufacturing Company

    California, MO
    4 days ago
  •  ...City, LLC in California is seeking a skilled backend engineer to develop secure services while optimizing systems for data security and efficiency. The ideal...  ...service development. Join a dynamic team focused on AI technologies and privacy solutions. You will be crucial... 
    Senior

    6AM City

    California, MO
    3 days ago
  • $184k - $287.5k

     ...developing advanced multi-rack, multi-tenant AI/ML datacenters with NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our CSP (Cloud Service Provider)...  ...development experience in distributed systems (Go, Rust, C/C++ or Python for tooling).... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $184k - $287.5k

     ...grounded in real‑world scenarios. Do you obsess about software engineering? So do we! We are looking for a strong System Engineer to develop and maintain NVIDIA’s software...  ...team and help us build the future of Physical AI! What you’ll be doing: Implement, validate, release... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  • $224k - $356.5k

     ...NVIDIA is hiring engineers to scale up the introduction of next generation architecture...  ...introductions (NPIs), distributed systems, familiarity with software testing and deployment, and excellent...  ...stand out from the crowd: Developing ML/AI infrastructure. Developing bare metal... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  •  ...Introduction At IBM Software, we transform client challenges into...  ...Building the world’s leading AI-powered, cloud-native products...  ...Management, and cross-functional engineering teams to align on priorities...  ..., and participate in system-wide debugging, maintenance,... 
    Senior

    Cacheflow

    California, MO
    4 days ago
  • $184k - $357k

     ...Senior Software Engineer, CUDA UMD - Graphs and GPU Sharing Mar 6, 2026 $184K - $357K NVIDIA’s invention...  ..., GPU deep learning ignited modern AI — the next era of computing — with the...  ...technological advancement. Are you a motivated system software engineer with a deep... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  •  ...offerings. You will drive Engineering projects that deliver...  ...functionality. At IBM Software, we transform client challenges...  ...the world’s leading AI-powered, cloud-native...  ...We’re looking for Senior Engineers with a deep backend...  ...roadmap and improve system effectiveness across... 
    Senior

    Cacheflow

    California, MO
    3 days ago
  • $184k - $287.5k

     ...NVIDIA is hiring senior engineers to develop its AI platform and more specifically its performance optimizations...  ...programming skills and excellent system design skills. You should have...  ...architectures, numeric libraries, modular software design. Highly motivated with... 
    Senior

    Dormont Manufacturing Company

    California, MO
    4 days ago
  • $183k - $250k

     ...Crusoe Energy is searching for a Senior / Staff Software Engineer to join our Bitcoin mining operations team. You will play a critical role in developing scalable, fault-tolerant distributed systems while collaborating with product teams across global data centers. With... 
    Senior

    6AM City

    California, MO
    3 days ago
  • $152k - $241.5k

     ...NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA’s rapidly growing enterprise...  ...NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server... 
    Senior
    Work experience placement

    Dormont Manufacturing Company

    California, MO
    3 days ago
  •  ...industry. Our growing suite of AI solutions spans ambient AI...  ...$10 billion flows through our systems and we support over 100 million...  ...deployment, and observability Mentor engineers across the team, raising the...  ...4+ years of professional software engineering experience, ideally... 
    Senior

    Commure

    California, MO
    3 days ago
  •  ...Introduction At IBM Software, we transform client challenges...  ...the world’s leading AI-powered, cloud-native...  ...world. On the HashiCorp engineering team, we build the...  ...responsibilities We’re looking for Senior Engineers with a deep...  ..., and participate in system-wide debugging,... 
    Senior
    Remote work

    Cacheflow

    California, MO
    4 days ago
  •  ...About the Role Senior Software Engineer What You'll Do Design, build, and maintain reliable microservices...  ...that benefit multiple teams. Use AI as a force multiplier. Experiment with...  ...experience, including work on production systems. Strong experience with Go for backend... 
    Senior

    Bonterra

    California, MO
    4 days ago
  •  ...intelligence. We are looking for highly motivated Senior Software Engineers to work on our GPU Fabric Networking...  ...doing: Design, implement and maintain system software that enables communication...  ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes.... 
    Senior

    Dormont Manufacturing Company

    California, MO
    3 days ago
  •  ...Bonterra is seeking a Senior Software Engineer to design, build, and maintain microservices and scalable Angular applications. You'll work with...  ...to the architectural decisions of a large-scale distributed system. The ideal candidate has 3+ years of experience in full-... 
    Senior

    Bonterra

    California, MO
    3 days ago
  •  ...company for Bitcoin mining and AI cloud. Bitdeer is committed...  ...responsible for: Cloud Platform Software Development Design, develop,...  ..., fault‑tolerant distributed systems that handle high‑throughput...  ...documented code following software engineering best practices (CI/CD, code... 
    Senior
    Local area

    Bitdeer Group

    California, MO
    4 days ago
  • $169k - $195k

     ...learning, generative, and agentic AI.Aerospike powers millions of...  ...stores, recommendation engines,and other use cases. At Aerospike...  .... We’re looking for a Senior Backend Engineer to join our...  ...of what real-time, always-on systems can achieve. What You'll Do... 
    Senior
    Remote work

    Aerospike

    California, MO
    22 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer, AI Inference Systems. Be the first to apply!