Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Inference Kernel & Runtime Engineer

$184k - $287.5k

NVIDIA

NVIDIA is seeking an experienced AI systems engineer to innovate and develop cutting-edge technologies in AI inference systems. You will design and optimize kernel technologies to accelerate workloads for NVIDIA's hardware architecture.

The ideal candidate holds a Master’s degree and has over 6 years of experience in ML/DL systems. A competitive salary is offered, ranging from 184,000 USD to 287,500 USD, along with equity and benefits.

#J-18808-Ljbffr
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI Inference Kernel & Runtime Engineer in Santa Clara, CA vacancy
  • $184k - $287.5k

    NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...computing experiences—from AI and data centers, to...  .... THE ROLE As a senior member of the LLM inference framework team, you...  ...inference runtimes for large language models...  ...intersection of inference engines, distributed systems...  ...and GPU runtime and kernel backends. THE... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    5 days ago
  •  ...leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara,...  ...groundbreaking AI systems software for inference applications including deep learning framework optimizations and GPU kernel technologies. You will closely collaborate... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master...  .... The role involves building efficient kernels and compilers for AI workloads while actively... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...Advanced Micro Devices in Santa Clara seeks a Senior ML Engineer focused on optimizing large language model inference runtimes. The role involves architecting distributed systems...  ...field. Join us to push the boundaries of AI and shape impactful technology. We offer a culture... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  •  ...in Sunnyvale is seeking a Sr. Software Engineer for its Cloud Runtime Protection team. You will design and...  ...performance features to secure cloud-native and AI workloads. The position demands over 1...  ...with C/C++ on Linux and expertise in kernel modules and eBPF. This hybrid role... 
    Senior
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    3 days ago
  • $152k - $241.5k

     ...deep learning ignited modern AI — the next era of computing —...  ...an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers...  ...been the backbone of NVIDIA’s inference engine, spanning across data...  ...such as PyTorch, JAX. GPU kernel authoring and performance... 
    Senior

    NVIDIA

    Santa Clara, CA
    6 days ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency...  ...high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks, and scale workloads... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...eager to work on cutting-edge AI technology for safety-...  ...NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of...  ...enabling high-performance AI inference solutions for automotive safety...  ...into TensorRT's compiler and runtime for specialized and constrained... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with... 
    Senior
    3 days per week

    d-Matrix

    Santa Clara, CA
    4 days ago
  •  ...computing experiences-from AI and data centers, to...  ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about...  ...accelerate LLM training and inference on AMD GPUs, improving kernel, communication, and end...  ...GPU, network, and runtime layers. • Drive technical... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  •  ...A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $356.5k

     ...NVIDIA Gruppe is looking for a Senior Software Engineer specializing in Deep Learning Inference in Santa Clara, California. You will design and optimize GPU-accelerated software critical for advanced AI applications, contributing to libraries like vLLM and SGLang. Ideal... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We...  ...libraries, code generators, and GPU kernel technologies for NVIDIA's...  ..., new LLM inference runtimes components, and kernel code generators... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM...  ..., designing pragmatic runtime improvements, and shipping...  ...orchestration to C++/CUDA kernels—using data to guide optimization... 

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $287.5k

     ...NVIDIA is seeking a Senior Software Engineer for Quantized Inference in Santa Clara, CA. This role involves implementing quantized and sparse recipes in inference engines, ensuring efficient model export pipelines, and optimizing throughput for large language models.... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Advanced Micro Devices is seeking a strategic software engineering lead in Santa Clara, California. This role involves improving application...  .... Key responsibilities include developing techniques for inference optimization and supporting the ROCm ecosystem expansion. A Bachelor... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make...  ...well as Background in GPU kernel programming using CUDA...  ...PyTorch, TensorFlow, ONNX Runtime or other ML frameworks....  ...vacancy.NVIDIA uses AI tools in its recruiting... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...NVIDIA is hiring an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyzing deep learning...  ...algorithms and frameworks, such as PyTorch and JAX. GPU kernel authoring and performance analysis using tools such as Nsight... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...computing experiences-from AI and data centers, to...  ...in enhancing GPU kernel performance, accelerating...  ...SOTA LLM and Multimodal inference at scale across multi-GPU...  ...PERSON: Skilled engineer with strong technical...  ...Integrate and optimize runtime execution through graph... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

    A leading technology company is seeking a Senior Software Engineer for AI and DL Kernel Libraries in Santa Clara, CA. The role involves designing and optimizing kernels for high-impact AI workloads and collaborating with engineers on innovative solutions. Candidates should... 
    Senior
    Remote job

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $150k - $225k

     ...PlusAI is a Physical AI company pioneering AI-based virtual driver...  ...closely with our autonomy and runtime teams to improve our redundant...  ...with Linux system and basic kernel tuning, network tuning, device...  ...with CV pipeline and model inference on edge platforms. Experience... 
    Senior

    PlusAI, Inc.

    Santa Clara, CA
    3 days ago
  •  ...leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative...  ...a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro... 
    Senior

    Advanced Micro Devices

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...NVIDIA Corporation is seeking a Senior Formal Verification Engineer for GPU Kernels in Santa Clara, CA. In this role, you will develop and deliver verification tools for GPU kernels, integrating AI into verification workflows. The ideal candidate has an MS or PhD in Computer... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • NVIDIA Gruppe is looking for a senior engineer to join their Math Libraries team in Santa Clara,...  ...software on GPUs, with a strong focus on kernel generation. The ideal candidate has over...  ...opportunity to be part of cutting-edge AI and data center technologies. #J-18808-... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

    NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe is seeking a Senior Formal Verification Engineer for GPU Kernels, focused on creating verification tools that ensure correct behavior in various...  ...role involves designing verification tools, integrating AI into workflows, and participating in innovative... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  • $140k - $215k

     ...Sr. Software Engineer - Sensor - Cloud Runtime Protection (Hybrid) page is loaded## Sr. Software Engineer -...  ...next generation of cloud-native and AI workloads. Leveraging cutting-edge technologies...  ...standards)* Experience developing Kernel modules for Linux* Experience... 
    Senior
    Work experience placement
    Work at office
    Local area
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    2 days ago
  •  ...A forward-thinking AI infrastructure company is seeking a Staff AI Runtime Engineer to lead the design and optimization of their AI compute platform. In this leadership role, you'll enhance AI training and inference capabilities. Successful candidates will have over 8... 
    Senior

    FlexAI

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...platform for every new AI-powered application....  ...a Principal Software Engineer - AI Inference to advance open-source...  ...intersection of inference runtime architecture, GPU...  ...orchestration down to C++/CUDA kernels—using profiling and...  ...the community. Mentor senior engineers, raise the... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Inference Kernel & Runtime Engineer. Be the first to apply!