Senior GPU AI Inference Engineer - Triton & Dynamo
NVIDIA
A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach in a fast-paced environment. This role offers competitive salary options based on experience, along with other incentives. Join us in shaping the AI landscape and contribute to important projects in a diverse and inclusive team. #J-18808-Ljbffr NVIDIA Corporation
$152k - $241.5k
...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server ( . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic... ...using GPUs to power a revolution in AI, enabling breakthroughs in problems...Senior$184k - $356.5k
NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep...Senior- ...leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California... ...groundbreaking AI systems software for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely...Senior
$152k - $241.5k
...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source... ...work. Improve multi‑GPU inference performance and... ...to vLLM, SGLang, PyTorch, Triton, NCCL, Dynamo or adjacent serving/...Senior- ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as a Senior AI/ML... ...observable AI services, optimizing inference performance from CPU and small GPUs to large multi-GPU servers, including air-... ...and GPU profiling (vLLM, Triton, TensorRT-LLM, llama.cpp)....Senior
$170.5k - $240.71k
Intel Corporation is seeking an experienced AI Software Development Engineer to drive optimization of AI inference workloads. Responsibilities include optimizing Large Language Models on GPUs and developing efficient graph-based compilation flows. Candidates should have...$184k - $287.5k
...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build... ..., code generators, and GPU kernel technologies for NVIDIA... ...especially using CUDA C/C++, cuTile, Triton, or similar) Open source...SeniorRemote work$152k - $241.5k
...unlimited potential of AI to define the next era... ...computing. An era in which our GPU acts as the brains of... ...Deep Learning Compiler Engineer. NVIDIA is hiring... ...the backbone of NVIDIA’s inference engine, spanning across... ...e.g., MLIR, LLVM, XLA, Triton, etc.). ~ Excellent C...Senior- Advanced Micro Devices is seeking a principal software developer to join the ROCm GPU-compute team in Santa Clara, California. The ideal candidate will have over 10 years of software development experience in C/C++, Python, and GPU technologies. This role involves developing...Senior
- ...leading technology company is looking for a Principal AI Performance Engineer to optimize AI inference performance on GPUs. In this role, you will lead a team... .... Ideal candidates possess extensive experience in GPU computing, strong analytical skills, and a background...
- ...computing experiences-from AI and data centers, to... ...in enhancing GPU kernel performance, accelerating... ...SOTA LLM and Multimodal inference at scale across multi-... ...PERSON: Skilled engineer with strong technical... ...in GPGPU C++, Triton, TileLang or DSL development...Senior
$184k - $287.5k
...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key... ...build, and optimize the GPU-accelerated software that... ...'s most sophisticated AI applications. Our team... ...including CUTLASS, OAI Triton, NCCL, and CUDA kernels...SeniorRemote work$165k - $242k
...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers... ...-per-token analytics, GPU resource isolation).... ...inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work- Advanced Micro Devices, Inc. is seeking a Senior Staff Software Developer who will play a pivotal role in shaping the future of AI and improving performance in key applications.... ...expertise in high-performance C++ programming and GPU technologies, with experience optimizing AI...Senior
$163.5k - $212.4k
NIO is seeking a Senior AI Inference Infrastructure Software Engineer in San Jose, CA, specializing in building scalable inference systems for large language and... ...and strong skills in performance optimization and GPU programming. The position offers a competitive salary...Senior$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers... ...-per-token analytics, GPU resource isolation).... ...inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work- A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative... ...with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro...Senior
$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale... ...performance inference stacks, optimize GPU kernels and compilers, drive... ...with ML compilers and DSLs (e.g., Triton, TorchDynamo/Inductor, MLIR/LLVM, XLA...Senior$152k - $287.5k
A leading technology company is seeking a Senior Software Engineer to develop solutions for GPU clusters aimed at enhancing machine learning innovation. The ideal candidate will have over 5 years of experience in software engineering with significant involvement in ML infrastructure...Senior- A leading technology company in Santa Clara is seeking a Senior Deep Learning Software Engineer to design and build automated inference solutions. The ideal candidate will have extensive experience with deep learning techniques and software engineering. Key responsibilities...Senior
$128.7k - $261.3k
...Team The Model Deployment & Inference Solutions team in GM AV... ...currently performed manually by engineers. Build the developer experience... ...Familiarity with the NVIDIA GPU stack at the integration... ...(CUDA-aware Python,TensorRT, Triton inference server,torch.compile...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hoursShift work$155.42k - $205.9k
...the Team: The ML Inference Platform is part of the... ...that powers GM's AI efforts. We're proud... ...committed to maximizing GPU utilization across platforms... ...We are seeking a Senior ML Infrastructure engineer to help build and... ...serving frameworks (triton, rayserve, vLLM etc)....SeniorLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$209k
...preprocessing, feature engineering, and dataset versioning... ...performance LLM training GPU infrastructure and... ...Understand the auto scale for inference service and multi-... ...and resource-efficient AI workloads across multi-... ...specific kernels (e.g., CUDA, Triton); • Systems...SeniorWork at officeRemote work1 day per week- A leading tech company in California is seeking a Principal Software Engineer for the Dynamo platform, specializing in scalable AI inference in distributed environments. Candidates should possess over 15 years of experience and strong skills in Rust, C++, and Python, alongside...
$272k - $431.25k
...platform for every new AI-powered application.... ...a Principal Software Engineer - AI Inference to advance open-source... ...runtime architecture, GPU performance engineering... ...community. Mentor senior engineers, raise the technical... ..., SGLang, PyTorch, Triton, NCCL, or related GPU/...Remote work$152k - $241.5k
...driving advancements in AI and machine learning to... ...talented and motivated engineers to join our TensorRT... ...-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the... ...deep learning experts and GPU architects throughout...Senior$272k - $431.25k
...NVIDIA Dynamo is an innovative, open-source platform... ...on efficient, scalable inference for large language and... ...models in distributed GPU environments. By bringing... ...achieves high-performance AI inference for demanding... ...we’re searching for engineers enthusiastic about building...$152k - $241.5k
...built in the age of Generative AI? Join NVIDIA’s TensorRT team... ...entry point for out-of-framework inference globally. We are moving beyond... ...are a systems-thinking C++ engineer who wants to help scale out an... ...and optimization (CPU and/or GPU), including using tooling to drive...Senior$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep... ...with teams of deep learning experts, GPU architects and DevOps engineers... ...an existing vacancy. NVIDIA uses AI tools in its recruiting processes....Senior$152k - $241.5k
...highly motivated, creative engineers to join the Platform... ...: debug and root-cause GPU bottlenecks and issues... ...for gaming, creator, and AI workload, validate BSP... ...engineering levels and senior management. Strong C... ..., LLM training and inference, and Arm architecture performance...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior GPU AI Inference Engineer - Triton & Dynamo. Be the first to apply!
- machine learning ai engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- ai engineer remote Santa Clara, CA
- ai ml engineer Santa Clara, CA
- ai engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- senior development executive Santa Clara, CA
- senior technical manager Santa Clara, CA
- senior software development engineer in test Santa Clara, CA

