Senior Software Engineer, AI Inference Systems
$184k - $287.5kDormont Manufacturing Co
We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads across multi-GPU, multi-node, and multi-cloud environments. You’ll collaborate across inference, compiler, scheduling, and performance teams to push the frontier of accelerated computing for AI. What you’ll be doing: Contribute features to vLLM that empower the newest models with the latest NVIDIA GPU hardware features; profile and optimize the inference framework (vLLM) with methods like speculative decoding, data/tensor/expert/pipeline-parallelism, prefill-decode disaggregation. Develop, optimize, and benchmark GPU kernels (hand-tuned and compiler-generated) using techniques such as fusion, autotuning, and memory/layout optimization; build and extend high-level DSLs and compiler infrastructure to boost kernel developer productivity while approaching peak hardware utilization. Define and build inference benchmarking methodologies and tools; contribute both new benchmark and NVIDIA’s submissions to the industry-leading MLPerf Inference benchmarking suite. Architect the scheduling and orchestration of containerized large-scale inference deployments on GPU clusters across clouds. Conduct and publish original research that pushes the pareto frontier for the field of ML Systems; survey recent publications and find a way to integrate research ideas and prototypes into NVIDIA’s software products. What we need to see: Bachelor’s degree (or equivalent expeience) in Computer Science (CS), Computer Engineering (CE) or Software Engineering (SE) with 7+ years of experience; alternatively, Master’s degree in CS/CE/SE with 5+ years of experience; or PhD degree with the thesis and top-tier publications in ML Systems, GPU architecture, or high-performance computing. Strong programming skills in Python and C/C++; experience with Go or Rust is a plus; solid CS fundamentals: algorithms & data structures, operating systems, computer architecture, parallel programming, distributed systems, deep learning theories. Knowledgeable and passionate about performance engineering in ML frameworks (e.g., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance: CUDA, memory hierarchy, streams, NCCL; proficiency with profiling/debug tools (e.g., Nsight Systems/Compute). Experience with containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces and cgroups. Excellent debugging, problem-solving, and communication skills; ability to excel in a fast-paced, multi-functional setting. Ways to stand out from the crowd Experience building and optimizing LLM inference engines (e.g., vLLM, SGLang). Hands‑on work with ML compilers and DSLs (e.g., Triton, TorchDynamo/Inductor, MLIR/LLVM, XLA), GPU libraries (e.g., CUTLASS) and features (e.g., CUDA Graph, Tensor Cores). Experience contributing to containerization/virtualization technologies such as containerd/CRI-O/CRIU. Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and production observability. Contributions to open‑source projects and/or publications; please include links to GitHub pull requests, published papers and artifacts. At NVIDIA, we believe artificial intelligence (AI) will fundamentally transform how people live and work. Our mission is to advance AI research and development to create groundbreaking technologies that enable anyone to harness the power of AI and benefit from its potential. Our team consists of experts in AI, systems and performance optimization. Our leadership includes world-renowned experts in AI systems who have received multiple academic and industry research awards. If you’re excited to build systems, kernels, and tools that make large-scale AI faster, more efficient, and easier to deploy, we’d love to hear from you. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5. You will also be eligible for equity and benefits. Applications for this job will be accepted at least until February 28, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr Dormont Manufacturing Co
$184k - $287.5k
Dormont Manufacturing Co is seeking an outstanding AI systems engineer to develop groundbreaking technologies in the inference systems software stack. You'll innovate and develop new AI systems technologies while collaborating closely with other engineers at NVIDIA across...Senior$184k - $287.5k
Dormont Manufacturing Co is seeking highly skilled software engineers to join their team in California. The role involves developing and optimizing AI inference systems for large-scale models, collaborating with cross-functional teams to enhance system performance. The...Senior$152k - $287.5k
Dormont Manufacturing Co is seeking a Senior System Software Engineer to join its fast-paced team working on the Dynamo-Triton Inference Server. The ideal candidate will develop GPU-accelerated AI software, contribute to feature development, and drive customer adoption...Senior$152k - $241.5k
We are looking for a Senior System Software Engineer to work on Dynamo‑Triton Inference Server. NVIDIA is hiring software engineers for its GPU‑accelerated deep learning... ...the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image...Senior$180k - $280k
...builds autonomous aerospace systems for the real world. At the... ...real time, safety critical software stack that has to work under... ..., read on. The Role As a Senior Embedded Software Engineer, you will own software that... ...simulation and test runners Modern AI coding tools used as...Senior$184k - $287.5k
...deep learning ignited modern AI — the next era of computing —... ...test case reports Providing software quality reports based on static... ...(or higher) in Electrical Engineering or Computer Science or equivalent... ...kernel. OOB or In-band System Management experience with exposure...Senior$152k - $241.5k
...We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA... ...performance of NVIDIA’s inference ecosystem. NVIDIA is rapidly... ...in areas like Generative AI, Recommenders and Vision... ...latency, resource-constrained systems or embedded AI pipelines (...Senior$184k - $287.5k
Joining NVIDIA’s DGX Cloud AI Efficiency Team means... ..., post‑training, inference. Our objective is to deliver... ...an AI infrastructure software engineer to join our team. You’... ...software and systems engineering practices... ...of AI systems. As a senior DGX Cloud AI Infrastructure...Senior$184k - $287.5k
A leading tech company is seeking a Senior Performance Engineer in California to enhance AI system performance and datacenter applications. The role requires extensive experience in accelerated computing, deep learning frameworks, and cloud/container architecture. Applicants...Senior$184k - $287.5k
We’re looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software to accelerate for AI inference. As a member of the team, you’ll develop libraries, code generators,...Senior- ...Developer located in California, Missouri. The role involves developing and optimizing high-speed L2/L3 protocol stacks for AI training and inference server infrastructure. The ideal candidate will have 6+ years of development experience with a strong proficiency in C,...Senior
$152k - $241.5k
NVIDIA Solutions Engineering team is searching for engineers to help develop and bring NVIDIA... ...their best work. We are looking for a System Software Engineer with expertise in embedded... ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA...Senior$152k - $241.5k
...looking for a motivated Performance engineer to influence the roadmap of... ...understanding of computer system architecture, HW-SW interactions... ...principles (aka systems software fundamentals) Implement micro... ...existing vacancy. NVIDIA uses AI tools in its recruiting processes...Senior- Dormont Manufacturing Co is seeking a Compiler Architect with a focus on cloud-based AI inference. The role requires 12+ years of experience in compiler and systems software development, emphasizing ML inference. You will design and implement a scalable MLIR-based compiler...
$184k - $287.5k
We are hiring senior engineers to work on the CUDA driver and runtime, core... ...investigates bottlenecks in software or hardware and delivers features... ...model across a range of system configurations and hardware capabilities... ...vacancy. NVIDIA uses AI tools in its recruiting...Senior$184k - $287.5k
NVIDIA is seeking a Senior Software Engineer to join our CSP Engagements team, focusing on system software for Datacenter products such as GB200. This role combines deep technical... ...OS, middleware, and applications with focus on AI/ML and HPC workloads. Perform advanced system...Senior- Compa is seeking a Senior Software Engineer in San Francisco to take ownership of software systems. You will work closely with Product and Design teams to ship impactful features. The ideal candidate will have 5+ years of experience and proficiency in modern web languages...Senior
- Quiet Capital is seeking a software development professional for their Revenue Cycle Management team focused on claims submission. You... ...a key role in transforming financial workflows into automated systems to streamline operations for healthcare providers. The ideal candidate...Senior
$184k - $287.5k
...developing advanced multi-rack, multi-tenant AI/ML datacenters with NVIDIA GB200, and upcoming GB300 GPUs. NVIDIA seeks a Senior Software Engineer for our CSP (Cloud Service Provider)... ...development experience in distributed systems (Go, Rust, C/C++ or Python for tooling)....Senior- ...offerings. You will drive Engineering projects that deliver... ...functionality. At IBM Software, we transform client challenges... ...the world’s leading AI-powered, cloud-native... ...We’re looking for Senior Engineers with a deep backend... ...roadmap and improve system effectiveness across...Senior
$184k - $287.5k
...grounded in real‑world scenarios. Do you obsess about software engineering? So do we! We are looking for a strong System Engineer to develop and maintain NVIDIA’s software... ...team and help us build the future of Physical AI! What you’ll be doing: Implement, validate,...Senior$152k - $241.5k
We are looking for a world-class System Software Automation engineer to join the Build & Release DRIVE OS team. NVIDIA DRIVE™ OS is a foundational software... ...in Python (or similar). Curiosity about LLMs, NLP, or AI‑driven developer tools. Passion for “it just works”...Senior$168k - $270.25k
NVIDIA GPU Architecture Group is seeking a senior software engineer to automate and optimize performance analysis workflows for AI training and inference workloads. You will not only perform analysis but also reshape how it's done, building tools and workflows that scale...SeniorWork experience placement- ...exciting opportunity to work on cutting-edge NAND Flash technology for AI infrastructure. The role involves working with cross-functional... ...in a relevant field and have extensive experience in NAND Flash systems, along with strong leadership and technical skills. A...Senior
- Introduction At IBM Software, we transform client challenges into solutions... ...Building the world’s leading AI-powered, cloud-native... ...Management, and cross-functional engineering teams to align on priorities... ..., and participate in system-wide debugging, maintenance,...Senior
$141.3k - $226k
Senior Software Engineer - Cloud Native Storage Design and implement scalable distributed storage control... ...clusters. Resiliency in Distributed Systems: Building robust storage control-plane... ...delivery of product features. Utilize AI tools for specification-driven development...SeniorLocal area- Overview NVIDIA is looking for an outstanding senior software engineer to join our endeavor to accelerate... ...familiarity with the use of agentic AI to boost the speed and quality of delivered... ...response properties, larger molecular systems, or Gaussian integral technology....Senior
- About the Role Senior Software Engineer What You'll Do Design, build, and maintain reliable microservices... ...that benefit multiple teams. Use AI as a force multiplier. Experiment with... ...experience, including work on production systems. Strong experience with Go for backend...Senior
- Bonterra is seeking a Senior Software Engineer to design, build, and maintain microservices and scalable Angular applications. You'll work with... ...to the architectural decisions of a large-scale distributed system. The ideal candidate has 3+ years of experience in full-...Senior
$152k - $241.5k
NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA’s rapidly growing enterprise and... ...NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We are hiring Sr. Software Engineer who will help build simulators for our DGX Server...SeniorWork experience placement
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer, AI Inference Systems. Be the first to apply!
- software engineer amazon California, MO
- software engineer student California, MO
- agile software developer California, MO
- rust software engineer California, MO
- software developer positions California, MO
- senior software design engineer California, MO
- software developer California, MO
- ngo software engineer California, MO
- startup software engineer California, MO
- IT software engineer California, MO
