Senior Deep Learning Software Engineer, LLM Performance
NVIDIA Gruppe
We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of LLM inference. This role focuses on designing and optimizing GPU-accelerated software for large language model deployment and serving. What you'll be doing: Optimize performance, analysis, and tuning of LLM, VLM, and GenAI models for DL inference, serving, and deployment in NVIDIA/OSS LLM frameworks. Scale performance of LLM models across different architectures and NVIDIA accelerators from datacenter GPUs to edge SoCs. Achieve maximum throughput and minimum latency, meeting throughput under latency constraints. Contribute features and code to NVIDIA/OSS LLM frameworks, inference benchmarking frameworks, TensorRT, and Triton. Collaborate with cross‑functional teams in generative AI, automotive, image understanding, and speech understanding to develop innovative solutions. What we need to see: Bachelor’s, Master’s, PhD, or equivalent experience in Computer Engineering, Computer Science, EECS, AI. At least 8 years of relevant software development experience. Excellent Python, C, and C++ programming, software design, and engineering skills. Experience with a deep learning framework such as PyTorch, JAX, or TensorFlow. Ways to stand out from the crowd: Prior experience with an LLM framework or a deep learning compiler in inference, deployment, algorithms, or implementation. Prior experience with performance modeling, profiling, debugging, and code optimization of a deep learning, HPC, or high‑performance application. Architectural knowledge of CPU and GPU systems. GPU programming experience (CUDA or OpenCL). Compensation & Benefits Base salary determined by location, experience, and comparable roles: $184,000 – $287,500 for Level4; $224,000 – $356,500 for Level5. Eligible for equity and benefits. EEO Statement NVIDIA is committed to fostering a diverse work environment and is an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe
- ...career. THE ROLE As a senior member of the LLM inference framework... ...layer, driving performance, scalability, and reliability... ...of inference engines, distributed systems... ...open‑source projects. Deep Learning Integration Strong... ...development. Software Engineering Expertise...SeniorPerformance
$224k - $356.5k
We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and deployment... ...designing and implementing a high-performance execution environment, low-level... ...software solutions (TRT, TRT-LLM, TRT Model Optimizer) can maintain...SeniorPerformance$184k - $287.5k
...looking for an experienced Software Engineer to develop our core libraries... ...a global leader in AI and deep learning, NVIDIA is redefining... ...and optimization for highly performant and efficient agents. Work... ...harnesses. ~ Proficiency in LLM applications, agent workflows...SeniorPerformance- We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing... ...and inference libraries (e.g., TensorRT, TensorRT‑LLM, vLLM, SGLang, FlashInfer). Experience with...SeniorPerformance
$152k - $241.5k
NVIDIA's high-performance computing platforms are powering... .... Within our software stack, CUTLASS stands... ...and related math and deep learning computations on NVIDIA... ...Computer Science, Computer Engineering, or related field (or... ..., SGLang, vLLM, TRT-LLM, or others....SeniorPerformance- ...NVIDIA’s TensorRT Edge‑LLM team and help shape... ...robotics. We build the software stack that enables Large... ...to deliver high‑performance, production‑ready solutions... ..., Electrical/Computer Engineering, or a closely related... ...development experience. Deep understanding of...SeniorPerformance
$184k - $287.5k
...end‑to‑end model performance for production deployments... ...‑caching. Conduct deep, layer‑by‑layer... .... Architect the software interface to... ...Science, Computer Engineering, or a related technical... ...similar machine learning frameworks. Sophisticated... ...with modern LLM/VLM inference stacks...SeniorPerformance- Overview Join the new Agentic Engineering team, within the Deep Learning Framework Group, at NVIDIA. We build the... ...orchestration is core, ranging from LLM‑native tooling to custom work with... ...dimensional parameter spaces Hardware‑aware performance optimization for deep learning...SeniorPerformanceWork experience placement
- Overview Reinforcement learning post‑training is... ...an RL Frameworks engineering team to develop... ...team spans the full software stack, from... ...their need optimizing deep learning frameworks... ...CPUs, and LPUs for performance where it matters,... ...Reinforcement learning for LLM post‑training (...SeniorPerformance
$184k - $287.5k
Senior Software Engineer, AI and DL Kernel Libraries page is loaded## Senior... ...kernel implementations, new LLM inference runtimes... ...engineers at NVIDIA across deep learning frameworks, libraries, kernels... ...GPU kernel development and performance optimizations (especially...SeniorPerformanceRemote work- ...inoptimizingand developing deep learning frameworks for AMD... ...enhancing GPU kernel performance, accelerating deep learning... ...RL training and SOTA LLM and Multimodal... ...collaborate across internal GPU software teams and engage with... ...THE PERSON: Skilled engineer with strong technical...SeniorPerformance
$229.9k - $262.4k
...Senior Lead AI Engineer (LLM Gateway, FM Hosting) Overview: At... ...leader in using machine learning to create real-time,... ...— along with our deep experience in... ...and scalable, high-performance AI infrastructure. At... ...deploy, and support AI software components including...SeniorPerformanceFull timePart timeLocal area$184k - $287.5k
...highly skilled and motivated software engineers to join us and build AI... ...architect and implement high-performance inference stacks, optimize... ...programming, distributed systems, deep learning theories. Knowledgeable and... ...building and optimizing LLM inference engines (e.g.,...SeniorPerformance- ...revolution, building the software and systems that... ...are looking for a Senior Software Engineer to lead the bring‑up... ...ensure state‑of‑the‑art LLM workloads run... ...scale. You will lead deep performance and reliability investigations... ...of deep learning systems, GPU performance...SeniorPerformance
$184k - $287.5k
NVIDIA is looking for a Senior Deep Learning Software Engineer in Santa Clara, California. This role involves analyzing and improving LLM inference performance using NVIDIA GPUs. Candidates should have extensive software development experience, strong skills in Python/C++...SeniorPerformance$184k - $356.5k
A leading AI computing company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-collaborative teams to implement cutting-edge algorithms...SeniorPerformanceFull time- NVIDIA Gruppe is seeking a Senior Deep Learning Software Engineer focused on LLM performance in Santa Clara. You will optimize GPU-accelerated software for large language model deployment, working on performance tuning for various models. The ideal candidate has over 8...SeniorPerformance
$152k - $287.5k
...NVIDIA Gruppe is seeking a highly motivated Senior Software Engineer to join our communication libraries and network software... ...software for complex computing systems used in High Performance Computing and Deep Learning. The ideal candidate should have a M.S./Ph.D. degree...SeniorPerformance- NVIDIA is looking for a Senior Deep Learning Software Engineer to enhance its TensorRT performance and inference ecosystem. The role focuses on optimizing performance across various NVIDIA accelerators and developing groundbreaking methodologies. Applicants should have...SeniorPerformance
$152k - $241.5k
...individual for their CUTLASS team to benchmark and enhance the performance of deep learning models. A strong background in Python and C++ is essential... ...alongside advanced degrees and 3+ years of experience in software performance analysis. The role involves identifying...SeniorPerformance$184k - $287.5k
...NVIDIA is hiring senior engineers to develop its AI platform and more specifically its performance optimizations in deep learning frameworks using JAX, a tool that can differentiate between... ..., numeric libraries, modular software design. ~ Highly motivated with excellent...SeniorPerformanceRemote work$152k - $241.5k
NVIDIA Gruppe is seeking a Senior Software Engineer - AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...SeniorPerformance$184k - $287.5k
NVIDIA is seeking a Senior Software Engineer, NCCL and CUDA specialization to join... ...stack functionality and performance for datacenter products such... ...to PCIe and NVLINK. Deep understanding of operating... ...experience. Experience with deep learning workloads training and inferencing...SeniorPerformance$152k - $241.5k
...developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU... ...are looking for a highly motivated senior software engineer for an exciting role in our... ...products in High Performance Computing and Deep Learning. What you will be doing: Design,...SeniorPerformance$153k - $222k
...Bangalore; Seoul; and Tokyo. Learn more at applied.co. We... ...both infrastructure engineers with expertise in... ...training approaches for deep learning model... ...in Computer Science, Software Engineering, or equivalent... ...requirements, interview performance, and the level and scope...SeniorPerformanceFull timeFor contractorsFor subcontractorCasual workWork at officeRemote workDay shift$184k - $356.5k
NVIDIA Gruppe is seeking senior engineers to enhance its AI platform, focusing on performance optimizations in deep learning frameworks using JAX. Candidates should have strong programming and system design skills, along with proficiency in C/C++ and Python. The role involves...SeniorPerformance$184k - $287.5k
We are looking for outstanding Senior Deep Learning Software Engineers to develop and productize NVIDIA's deep learning solutions in autonomous driving... ...and quantization to low-level kernel development and performance optimization. Develop workflows that let users...SeniorPerformanceWork experience placement$152k - $241.5k
Senior Software Engineer - Deep Learning Inference What you’ll be doing: Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning...SeniorPerformance$152k - $241.5k
...OS, container, GPU, and systems engineers, and apply machine learning or deep learning techniques for categorization... ...or prediction) within existing software workflows. Qualifications 5+... ...TensorFlow or PyTorch in Linux and HPC or performance‑sensitive large‑scale...SeniorPerformance- NVIDIA is seeking outstanding senior engineers to work on the CUDA driver,... ...You will join a versatile software engineering team that... ...unlock the full potential and performance of NVIDIA hardware across diverse workloads such as deep learning, scientific research, autonomous...SeniorPerformance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Deep Learning Software Engineer, LLM Performance. Be the first to apply!
- software engineer amazon Santa Clara, CA
- agile software developer Santa Clara, CA
- rust software engineer Santa Clara, CA
- software developer positions Santa Clara, CA
- senior software design engineer Santa Clara, CA
- software developer Santa Clara, CA
- ngo software engineer Santa Clara, CA
- startup software engineer Santa Clara, CA
- software development engineer (robotics engineer) Santa Clara, CA
- scientific software engineer Santa Clara, CA

