Senior Deep Learning Software Engineer, LLM Performance
NVIDIA Gruppe
We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference. This role focuses on designing and optimizing GPU-accelerated software for large language model deployment and serving. What you'll be doing: Optimize performance, analysis, and tuning of LLM, VLM, and GenAI models for DL inference, serving, and deployment in NVIDIA/OSS LLM frameworks. Scale performance of LLM models across different architectures and NVIDIA accelerators from datacenter GPUs to edge SoCs. Achieve maximum throughput and minimum latency, meeting throughput under latency constraints. Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton. Collaborate with cross‑functional teams in generative AI, automotive, image understanding, and speech understanding to develop innovative solutions. What we need to see: Bachelor’s, Master’s, PhD, or equivalent experience in Computer Engineering, Computer Science, EECS, AI. At least 8 years of relevant software development experience. Excellent Python, C, and C++ programming, software design, and engineering skills. Experience with a deep learning framework such as PyTorch, JAX, or TensorFlow. Ways to stand out from the crowd: Prior experience with an LLM framework or a deep learning compiler in inference, deployment, algorithms, or implementation. Prior experience with performance modeling, profiling, debugging, and code optimization of a deep learning, HPC, or high‑performance application. Architectural knowledge of CPU and GPU systems. GPU programming experience (CUDA or OpenCL). Compensation & Benefits Base salary determined by location, experience, and comparable roles: $184,000 – $287,500 for Level4; $224,000 – $356,500 for Level5. Eligible for equity and benefits. EEO Statement NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe
- ...THE ROLE: As a senior member of the LLM inference framework... ...runtime layer, driving performance, scalability, and... ...intersection of inference engines, distributed systems... ...projects Deep Learning Integration ~... ...development Software Engineering ~ Expertise...SeniorPerformance
$224k - $356.5k
...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and... ...designing and implementing a high-performance execution environment, low-level GPU... ...inference software solutions (TRT, TRT-LLM, TRT Model Optimizer) can maintain...SeniorPerformance$152k - $241.5k
...AI and machine learning to solve some of... ...talented and motivated engineers to join our... ...-leading deep learning inference software for NVIDIA AI accelerators... .... As a Senior Software Engineer... ...TensorRT and TensorRT-LLM to supercharge... ...close-to-metal performance analysis,...SeniorPerformance$152k - $241.5k
...NVIDIA's high-performance computing platforms are powering... .... Within our software stack, CUTLASS stands... ...and related math and deep learning computations on NVIDIA... ...Computer Science, Computer Engineering, or related field (or... ..., SGLang, vLLM, TRT-LLM, or others....SeniorPerformance$152k - $241.5k
...NVIDIA's TensorRT Edge-LLM team and help shape... ...robotics. We build the software stack that enables Large... ...to deliver high-performance, production-ready solutions... ..., Electrical/Computer Engineering, or a closely related... ...development experience. ~ Deep understanding of...SeniorPerformanceRemote work$184k - $287.5k
...outstanding AI systems engineers to develop... ...the inference systems software stack! We build innovative... ...implementations, new LLM inference runtimes components... ...engineers at NVIDIA across deep learning frameworks, libraries,... ...development and performance optimizations (especially...SeniorPerformanceRemote work- ...optimizing and developing deep learning frameworks for AMD... ...enhancing GPU kernel performance, accelerating deep learning... ...RL training and SOTA LLM and Multimodal... ...collaborate across internal GPU software teams and engage with... ...: Skilled engineer with strong technical...SeniorPerformance
$184k - $287.5k
...Artificial Intelligence, High Performance Computing and Visualization... ...looking for a motivated Deep Learning engineer to bring advanced CUDA features... ..., including PyTorch, TRT-LLM, vLLM, SGLang, JAX, etc.... ...systems principles (aka systems software fundamentals) ~...SeniorPerformanceRemote work$152k - $241.5k
...advanced computer vision and deep learning. Our team builds large-... ...are looking for a seasoned software engineer to help build video analytics... ...focusing on usability, accuracy, performance, and scalability. This is... ...and AutoML for vision and LLM/VLM model enhancement....SeniorPerformanceWorldwide$152k - $241.5k
...NVIDIA seeks a senior software engineer to join the AI Networking co-design... ...productizing machine learning tools. These include... ...working on distributed Deep Learning, particularly within LLM training and inference... ...(ML) for comprehensive performance analysis and optimization...SeniorPerformance- We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing... ...and inference libraries (e.g., TensorRT, TensorRT‑LLM, vLLM, SGLang, FlashInfer). Experience with...SeniorPerformance
$184k - $287.5k
...end‑to‑end model performance for production deployments... ...‑caching. Conduct deep, layer‑by‑layer... .... Architect the software interface to... ...Science, Computer Engineering, or a related technical... ...similar machine learning frameworks. Sophisticated... ...with modern LLM/VLM inference stacks...SeniorPerformance- ...Senior AI Engineer – LLM, RAG Bright.AI is a high-growth Physical AI company... ...evaluation strategies to measure performance, accuracy, and user... ...Computer Science, AI, Machine Learning, or a related field, with specialization in NLP or deep learning. Strong research...SeniorPerformanceImmediate start
- Overview Reinforcement learning post‑training is... ...an RL Frameworks engineering team to develop... ...team spans the full software stack, from... ...their need optimizing deep learning frameworks... ...CPUs, and LPUs for performance where it matters,... ...Reinforcement learning for LLM post‑training (...SeniorPerformance
$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI... ...architect and implement high-performance inference stacks, optimize... ...programming, distributed systems, deep learning theories. Knowledgeable... ...building and optimizing LLM inference engines (e.g.,...SeniorPerformance$229.9k - $262.4k
...Senior Lead AI Engineer (LLM Gateway, FM Hosting) Overview: At... ...leader in using machine learning to create real-time,... ...— along with our deep experience in... ...and scalable, high-performance AI infrastructure. At... ...deploy, and support AI software components including...SeniorPerformanceFull timePart timeLocal area$184k - $287.5k
NVIDIA is looking for a Senior Deep Learning Software Engineer in Santa Clara, California. This role involves analyzing and improving LLM inference performance using NVIDIA GPUs. Candidates should have extensive software development experience, strong skills in Python/C++...SeniorPerformance- NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer focused on LLM performance in Santa Clara. You will optimize GPU-accelerated software for large language model deployment, working on performance tuning for various models. The ideal candidate has over 8...SeniorPerformance
$152k - $287.5k
NVIDIA Gruppe is seeking a highly motivated Senior Software Engineer to join our communication libraries and network software team... ...for complex computing systems used in High Performance Computing and Deep Learning. The ideal candidate should have a M.S./Ph.D. degree...SeniorPerformance$152k - $241.5k
...developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU... ...are looking for a highly motivated senior software engineer for an exciting role in our... ...products in High Performance Computing and Deep Learning. What you will be doing:...SeniorPerformance$152k - $241.5k
...NVIDIA's high-performance computing platforms are powering the AI... ...and industries. Within our software stack, CUTLASS stands out as... ...GEMM) and related math and deep learning computations on NVIDIA GPUs... ...Computer Science, Computer Engineering, or related field (or equivalent...SeniorPerformance$184k - $287.5k
...That position depends on software as much as hardware, and compiler engineering is a big part of what... ...work. We're hiring senior software engineers for... ...team within NVIDIA's deep learning software organization.... ...stack, targeting high-performance kernel generation for...SeniorPerformanceWork experience placement$152k - $241.5k
NVIDIA Gruppe is seeking a Senior Software Engineer - AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...SeniorPerformance$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build... ...be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high...SeniorPerformance$152k - $241.5k
..., container, GPU, and systems engineers. When useful, you will apply machine learning and deep learning techniques for categorization... .../prediction) inside existing software workflows. What we need to... ...and HPC / large-scale or performance-sensitive environments...SeniorPerformance$152k - $241.5k
...NVIDIA is seeking outstanding senior engineers to work on the CUDA driver,... .... You will join a versatile software engineering team that... ...unlock the full potential and performance of NVIDIA hardware across diverse workloads like deep learning, scientific research, autonomous...SeniorPerformance$184k - $287.5k
...building AI/ML and HPC software solutions at scale. As... ...in understanding performance aspects related to tasks like large scale LLM training and inference... ...in Electrical/Computer Engineering, Computer Science, Physics... ...-on experience with Deep Learning frameworks (PyTorch, JAX...SeniorPerformance$168k - $270.25k
...Senior Engineer For Factory Infrastructure And Automation... ...optimizes and serves performant inferencing for every... .... You will apply your deep technical expertise to... ...intersecting our prowess in deep learning and computing, with... ...hardware and software environments. You will...SeniorPerformance$184k - $287.5k
...Autonomous Driving Systems Engineer Today, NVIDIA is tapping... ...where everyone is motivated to perform at their highest level. Come... ..., data analysis, and software architecture. ~ Strong software... ...trade-offs between End-to-End deep learning approaches, classical...SeniorPerformance$224k - $356.5k
...System Software Engineer For Vision Ai NVIDIA's technology is at the heart of the AI revolution... ...using advanced computer vision and deep learning. Our team builds large-scale... ..., you will develop and optimize high-performance vision systems that turn massive streams...SeniorPerformance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Deep Learning Software Engineer, LLM Performance. Be the first to apply!
- software engineer full time Santa Clara, CA
- startup software engineer Santa Clara, CA
- rust software engineer Santa Clara, CA
- work from home software developer Santa Clara, CA
- software developer Santa Clara, CA
- software development engineer aws Santa Clara, CA
- software qa engineer Santa Clara, CA
- ngo software engineer Santa Clara, CA
- software engineer staff Santa Clara, CA
- software engineer Santa Clara, CA

