Senior Deep Learning Software Engineer, LLM Performance
$184k - $287.5kNVIDIA Corporation
Senior Deep Learning Software Engineer, LLM Performance page is loaded## Senior Deep Learning Software Engineer, LLM Performancelocations: US, CA, Santa Claratime type: Full timeposted on: Posted Todayjob requisition id: JR2016389We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like LLM, Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to enable the performance optimization, deployment and serving of these DL solutions. We specialize in developing GPU-accelerated Deep learning software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models.Collaborate with the deep learning community to implement the latest algorithms for public release in TensorRT LLM, VLLM, SGLang and LLM benchmarks. Identify performance opportunities and optimize SoTA LLM models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement LLM inference, serving and deployment algorithms and optimizations using TensorRT LLM, VLLM, SGLang, Triton and CUDA kernels. Work and collaborate with a diverse set of teams involving performance modeling, performance analysis, kernel development and inference software development.**What you'll be doing:*** Performance optimization, analysis, and tuning of LLM, VLM and GenAI models for DL inference, serving and deployment in NVIDIA/OSS LLM frameworks.* Scale performance of LLM models across different architectures and types of NVIDIA accelerators.* Scale performance for max throughput, minimum latency and throughput under latency constraints.* Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton.* Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding to develop innovative solutions.**What we need to see:*** Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Engineering, Computer Science, EECS, AI).* At least 8 years of relevant software development experience.* Excellent Python/C/C++ programming, software design and software engineering skills* Experience with a DL framework like PyTorch, JAX, TensorFlow.**Ways to stand out from the crowd:*** Prior experience with a LLM framework or a DL compiler in inference, deployment, algorithms, or implementation* Prior experience with performance modeling, profiling, debug, and code optimization of a DL/HPC/high-performance application* Architectural knowledge of CPU and GPU* GPU programming experience (CUDA or OpenCL)GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the engine for simulating human imagination, conjuring up the amazing virtual worlds of video games and Hollywood films. Now, NVIDIA's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why NVIDIA GPUs are used broadly for deep learning, and NVIDIA is increasingly known as “the AI computing company.” Come, join our DL Architecture team, where you can help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.#LI-HybridYour base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.You will also be eligible for equity and .Applications for this job will be accepted at least until April 20, 2026.This posting is for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Corporation
- We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference. This role focuses on designing and optimizing GPU-accelerated...SeniorPerformance
$224k - $356.5k
We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment... ...designing and implementing a high-performance execution environment, low-level... ...software solutions (TRT, TRT-LLM, TRT Model Optimizer) can maintain...SeniorPerformance$152k - $241.5k
NVIDIA's high-performance computing platforms are powering... .... Within our software stack, CUTLASS stands... ...and related math and deep learning computations on NVIDIA... ...Computer Science, Computer Engineering, or related field (or... ..., SGLang, vLLM, TRT-LLM, or others....SeniorPerformance- We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing... ...and inference libraries (e.g., TensorRT, TensorRT‑LLM, vLLM, SGLang, FlashInfer). Experience with...SeniorPerformance
$184k - $287.5k
...end‑to‑end model performance for production deployments... ...‑caching. Conduct deep, layer‑by‑layer... .... Architect the software interface to... ...Science, Computer Engineering, or a related technical... ...similar machine learning frameworks. Sophisticated... ...with modern LLM/VLM inference stacks...SeniorPerformance- ...NVIDIA’s TensorRT Edge‑LLM team and help shape... ...robotics. We build the software stack that enables Large... ...to deliver high‑performance, production‑ready solutions... ..., Electrical/Computer Engineering, or a closely related... ...development experience. Deep understanding of...SeniorPerformance
$184k - $287.5k
...outstanding AI systems engineers to develop... ...the inference systems software stack! We build innovative... ...implementations, new LLM inference runtimes components... ...engineers at NVIDIA across deep learning frameworks, libraries,... ...development and performance optimizations (especially...SeniorPerformance- Overview Reinforcement learning post‑training is... ...an RL Frameworks engineering team to develop... ...team spans the full software stack, from... ...their need optimizing deep learning frameworks... ...CPUs, and LPUs for performance where it matters,... ...Reinforcement learning for LLM post‑training (...SeniorPerformance
- ...optimizing and developing deep learning frameworks for AMD... ...enhancing GPU kernel performance, accelerating deep learning... ...RL training and SOTA LLM and multimodal... ...collaborate across internal GPU software teams and engage with... .... THE PERSON Skilled engineer with strong technical...SeniorPerformance
$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI... ...architect and implement high-performance inference stacks, optimize... ...programming, distributed systems, deep learning theories. Knowledgeable and... ...building and optimizing LLM inference engines (e.g.,...SeniorPerformance- ...revolution, building the software and systems that... ...are looking for a Senior Software Engineer to lead the bring‑up... ...ensure state‑of‑the‑art LLM workloads run... ...scale. You will lead deep performance and reliability investigations... ...of deep learning systems, GPU performance...SeniorPerformance
$224k - $356.5k
...is building the software foundation for scalable, high-performance vehicle... ...for exceptional engineers who thrive on solving... ...are seeking a Senior Software Engineer... ...deployment of deep neural networks... ...platform, deep learning inference, TensorRT... ..., TensorRT-LLM, ONNX, PyTorch,...SeniorPerformanceFull time$184k - $287.5k
NVIDIA is looking for a Senior Deep Learning Software Engineer in Santa Clara, California. This role involves analyzing and improving LLM inference performance using NVIDIA GPUs. Candidates should have extensive software development experience, strong skills in Python/C++...SeniorPerformance$184k - $356.5k
A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms...SeniorPerformanceFull time- NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer focused on LLM performance in Santa Clara. You will optimize GPU-accelerated software for large language model deployment, working on performance tuning for various models. The ideal candidate has over 8...SeniorPerformance
$152k - $287.5k
NVIDIA Gruppe is seeking a highly motivated Senior Software Engineer to join our communication libraries and network software team... ...for complex computing systems used in High Performance Computing and Deep Learning. The ideal candidate should have a M.S./Ph.D. degree...SeniorPerformance$152k - $241.5k
...individual for their CUTLASS team to benchmark and enhance the performance of deep learning models. A strong background in Python and C++ is essential... ...alongside advanced degrees and 3+ years of experience in software performance analysis. The role involves identifying...SeniorPerformance$152k - $241.5k
NVIDIA Gruppe is seeking a Senior Software Engineer - AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...SeniorPerformance$153k - $222k
...Bangalore; Seoul; and Tokyo. Learn more at applied.co. We... ...both infrastructure engineers with expertise in... ...training approaches for deep learning model... ...in Computer Science, Software Engineering, or equivalent... ...requirements, interview performance, and the level and scope...SeniorPerformanceFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift- Senior Deep Learning Software Engineer, PyTorch Overview NVIDIA is hiring software engineers to design and build tools used by AI engineers across... ...’s hardware and software teams to improve overall GPU performance in PyTorch. Design, build and support production AI solutions...SeniorPerformance
- We are now looking for a Senior Deep Learning Software Engineer, PyTorch. NVIDIA is hiring software engineers to design and build tools used by AI... ...’s hardware and software teams to improve overall GPU performance in PyTorch. Design, build, and support production AI solutions...SeniorPerformance
$184k - $287.5k
NVIDIA is seeking a Senior Software Engineer, NCCL and CUDA specialization to join... ...stack functionality and performance for datacenter products such... ...to PCIe and NVLINK. Deep understanding of operating... ...experience. Experience with deep learning workloads training and inferencing...SeniorPerformance$184k - $287.5k
...computing. More recently, GPU deep learning ignited modern deep... ...industry that bring exceptional performance and security improvements to... ...willing to take on complex engineering tasks that progress towards... ...experience in storage software such as Key-Value, File systems...SeniorPerformance$152k - $241.5k
...developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU... ...are looking for a highly motivated senior software engineer for an exciting role in our... ...products in High Performance Computing and Deep Learning. What you will be doing: Design,...SeniorPerformance$184k - $356.5k
NVIDIA Gruppe is seeking senior engineers to enhance its AI platform, focusing on performance optimizations in deep learning frameworks using JAX. Candidates should have strong programming and system design skills, along with proficiency in C/C++ and Python. The role involves...SeniorPerformance$184k - $287.5k
Overview NVIDIA is hiring senior engineers to develop its AI platform and more specifically its performance optimizations in deep learning frameworks using JAX, a tool that can differentiate... ..., numeric libraries, modular software design. Highly motivated with excellent...SeniorPerformance$184k - $287.5k
We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize NVIDIA's deep learning solutions in autonomous driving... ...and quantization to low-level kernel development and performance optimization. Develop workflows that let users...SeniorPerformanceWork experience placement$152k - $241.5k
...OS, container, GPU, and systems engineers, and apply machine learning and deep learning techniques for categorization... .../prediction) inside existing software workflows. Qualifications 5+ years... .... Linux and HPC / large‑scale or performance‑sensitive environments....SeniorPerformance$152k - $241.5k
Senior Software Engineer - Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning...SeniorPerformance$184k - $287.5k
Senior Software Engineer, CUDA Core Libraries page is loaded## Senior Software Engineer, CUDA Core... ...experience for developers across deep learning, scientific computing, and data analytics... ...-level interfaces down to low-level performance tuning involving memory, parallelism...SeniorPerformanceFull time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Deep Learning Software Engineer, LLM Performance. Be the first to apply!
- software engineer amazon Santa Clara, CA
- agile software developer Santa Clara, CA
- rust software engineer Santa Clara, CA
- software developer positions Santa Clara, CA
- senior software design engineer Santa Clara, CA
- software developer Santa Clara, CA
- ngo software engineer Santa Clara, CA
- startup software engineer Santa Clara, CA
- software development engineer (robotics engineer) Santa Clara, CA
- scientific software engineer Santa Clara, CA

