Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior GPU AI Inference Engineer - Triton & Dynamo

NVIDIA

A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach in a fast-paced environment. This role offers competitive salary options based on experience, along with other incentives. Join us in shaping the AI landscape and contribute to important projects in a diverse and inclusive team. #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior GPU AI Inference Engineer - Triton & Dynamo in Santa Clara, CA vacancy
  • $152k - $241.5k

     ...We are looking for a Senior System Software Engineer to work on Dynamo-Triton Inference Server ( . NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic...  ...using GPUs to power a revolution in AI, enabling breakthroughs in problems... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa Clara, CA. The role involves optimizing GPU-accelerated software for advanced AI applications, including developing high-performance deep... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    13 hours ago
  •  ...leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California...  ...groundbreaking AI systems software for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    13 hours ago
  • $152k - $241.5k

     ...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source...  ...work. Improve multi‑GPU inference performance and...  ...to vLLM, SGLang, PyTorch, Triton, NCCL, Dynamo or adjacent serving/... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as a Senior AI/ML...  ...observable AI services, optimizing inference performance from CPU and small GPUs to large multi-GPU servers, including air-...  ...and GPU profiling (vLLM, Triton, TensorRT-LLM, llama.cpp).... 
    Senior

    Webex Events (formerly Socio)

    San Jose, CA
    2 days ago
  • $170.5k - $240.71k

    Intel Corporation is seeking an experienced AI Software Development Engineer to drive optimization of AI inference workloads. Responsibilities include optimizing Large Language Models on GPUs and developing efficient graph-based compilation flows. Candidates should have... 

    Intel Corporation

    Santa Clara, CA
    13 hours ago
  • $184k - $287.5k

     ...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build...  ..., code generators, and GPU kernel technologies for NVIDIA...  ...especially using CUDA C/C++, cuTile, Triton, or similar) Open source... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...unlimited potential of AI to define the next era...  ...computing. An era in which our GPU acts as the brains of...  ...Deep Learning Compiler Engineer. NVIDIA is hiring...  ...the backbone of NVIDIA’s inference engine, spanning across...  ...e.g., MLIR, LLVM, XLA, Triton, etc.). ~ Excellent C... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • Advanced Micro Devices is seeking a principal software developer to join the ROCm GPU-compute team in Santa Clara, California. The ideal candidate will have over 10 years of software development experience in C/C++, Python, and GPU technologies. This role involves developing... 
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    13 hours ago
  •  ...leading technology company is looking for a Principal AI Performance Engineer to optimize AI inference performance on GPUs. In this role, you will lead a team...  .... Ideal candidates possess extensive experience in GPU computing, strong analytical skills, and a background... 

    Advanced Micro Devices

    San Jose, CA
    13 hours ago
  •  ...computing experiences-from AI and data centers, to...  ...in enhancing GPU kernel performance, accelerating...  ...SOTA LLM and Multimodal inference at scale across multi-...  ...PERSON: Skilled engineer with strong technical...  ...in GPGPU C++, Triton, TileLang or DSL development... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

     ...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key...  ...build, and optimize the GPU-accelerated software that...  ...'s most sophisticated AI applications. Our team...  ...including CUTLASS, OAI Triton, NCCL, and CUDA kernels... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $165k - $242k

     ...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers...  ...-per-token analytics, GPU resource isolation)....  ...inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • Advanced Micro Devices, Inc. is seeking a Senior Staff Software Developer who will play a pivotal role in shaping the future of AI and improving performance in key applications....  ...expertise in high-performance C++ programming and GPU technologies, with experience optimizing AI... 
    Senior

    Advanced Micro Devices, Inc.

    Santa Clara, CA
    3 days ago
  • $163.5k - $212.4k

    NIO is seeking a Senior AI Inference Infrastructure Software Engineer in San Jose, CA, specializing in building scalable inference systems for large language and...  ...and strong skills in performance optimization and GPU programming. The position offers a competitive salary... 
    Senior

    nio.com

    San Jose, CA
    1 day ago
  • $139k - $204k

     ...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers...  ...-per-token analytics, GPU resource isolation)....  ...inference frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve... 
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative...  ...with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro... 
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale...  ...performance inference stacks, optimize GPU kernels and compilers, drive...  ...with ML compilers and DSLs (e.g., Triton, TorchDynamo/Inductor, MLIR/LLVM, XLA... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $287.5k

    A leading technology company is seeking a Senior Software Engineer to develop solutions for GPU clusters aimed at enhancing machine learning innovation. The ideal candidate will have over 5 years of experience in software engineering with significant involvement in ML infrastructure... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • A leading technology company in Santa Clara is seeking a Senior Deep Learning Software Engineer to design and build automated inference solutions. The ideal candidate will have extensive experience with deep learning techniques and software engineering. Key responsibilities... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $128.7k - $261.3k

     ...Team The Model Deployment & Inference Solutions team in GM AV...  ...currently performed manually by engineers. Build the developer experience...  ...Familiarity with the NVIDIA GPU stack at the integration...  ...(CUDA-aware Python,TensorRT, Triton inference server,torch.compile... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours
    Shift work

    General Motors

    Sunnyvale, CA
    2 days ago
  • $155.42k - $205.9k

     ...the Team: The ML Inference Platform is part of the...  ...that powers GM's AI efforts. We're proud...  ...committed to maximizing GPU utilization across platforms...  ...We are seeking a Senior ML Infrastructure engineer to help build and...  ...serving frameworks (triton, rayserve, vLLM etc).... 
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    13 hours ago
  • $209k

     ...preprocessing, feature engineering, and dataset versioning...  ...performance LLM training GPU infrastructure and...  ...Understand the auto scale for inference service and multi-...  ...and resource-efficient AI workloads across multi-...  ...specific kernels (e.g., CUDA, Triton); • Systems... 
    Senior
    Work at office
    Remote work
    1 day per week

    Zoom Video Communications

    San Jose, CA
    2 days ago
  • A leading tech company in California is seeking a Principal Software Engineer for the Dynamo platform, specializing in scalable AI inference in distributed environments. Candidates should possess over 15 years of experience and strong skills in Rust, C++, and Python, alongside... 

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...platform for every new AI-powered application....  ...a Principal Software Engineer - AI Inference to advance open-source...  ...runtime architecture, GPU performance engineering...  ...community. Mentor senior engineers, raise the technical...  ..., SGLang, PyTorch, Triton, NCCL, or related GPU/... 
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...driving advancements in AI and machine learning to...  ...talented and motivated engineers to join our TensorRT...  ...-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the...  ...deep learning experts and GPU architects throughout... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...NVIDIA Dynamo is an innovative, open-source platform...  ...on efficient, scalable inference for large language and...  ...models in distributed GPU environments. By bringing...  ...achieves high-performance AI inference for demanding...  ...we’re searching for engineers enthusiastic about building... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...built in the age of Generative AI? Join NVIDIA’s TensorRT team...  ...entry point for out-of-framework inference globally. We are moving beyond...  ...are a systems-thinking C++ engineer who wants to help scale out an...  ...and optimization (CPU and/or GPU), including using tooling to drive... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep...  ...with teams of deep learning experts, GPU architects and DevOps engineers...  ...an existing vacancy. NVIDIA uses AI tools in its recruiting processes.... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...highly motivated, creative engineers to join the Platform...  ...: debug and root-cause GPU bottlenecks and issues...  ...for gaming, creator, and AI workload, validate BSP...  ...engineering levels and senior management. Strong C...  ..., LLM training and inference, and Arm architecture performance... 
    Senior

    NVIDIA

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior GPU AI Inference Engineer - Triton & Dynamo. Be the first to apply!