Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026
$124k - $195.5kNVIDIA
We are now looking for a Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of NVIDIA’s inference ecosystem! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to enable the performance optimization, deployment and serving of these DL inference solutions. We specialize in developing GPU-accelerated deep learning inference software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models.
Collaborate with the deep learning community to integrate TensorRT into OSS frameworks like TensorRT-EdgeLLM and PyTorch. Identify performance opportunities and optimize SoTA models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler algorithms, frontend operators and code generators across NVIDIA’s inference ecosystem. Work and collaborate with a diverse set of teams involving workflow improvements, performance modeling, performance analysis, kernel development and inference software development.
What you'll be doing:
Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT)
Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT.
Develop new model pipelines for NVIDIA’s inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance.
Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions.
Scale performance of deep learning models across different architectures and types of NVIDIA accelerators.
What we need to see:
Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI).
2 years of relevant software development experience.
Strong C++, Python programming and software engineering skills
Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer).
Experience with performance analysis and performance optimization
Ways to stand out from the crowd:
Strong foundation and architectural knowledge of GPUs.
Deep understanding of modern deep learning models and workloads (e.g. Transformers, Recommenders, ASR, TTS, Visual Understanding).
Proficiency in one of the deep learning programming domain specific languages (e.g. CUDA/TileIR/CuTeDSL/cutlass/Triton).
Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph compilers in deep learning inference (e.g. TorchDynamo/TorchInductor).
Prior experience optimizing performance for low-latency, resource-constrained systems or embedded AI pipelines (e.g. Jetson systems or other edge AI accelerators).
GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the engine for simulating human imagination, conjuring up the amazing virtual worlds of video games and Hollywood films. Now, NVIDIA's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why NVIDIA GPUs are used broadly for deep learning, and NVIDIA is increasingly known as “the AI computing company.” Come, join our DL Architecture team, where you can help build a real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 2, and 152,000 USD - 241,500 USD for Level 3.
You will also be eligible for equity and benefits ( .
Applications for this job will be accepted at least until April 7, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$124k - $195.5k
...driving innovation in deep learning and eager to work... ...? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront... ...to high‑performance AI inference solutions... ...understand and leverage new technologies to improve... ...until June 2, 2026 EEO Statement NVIDIA...New gradPerformanceInternship$124k - $195.5k
...outstanding AI systems engineers to develop... ...inference systems software stack! We build innovative... ...things like new abstractions, efficient... ...at NVIDIA across deep learning frameworks,... ...kernel development and performance optimizations (... ...least until June 19, 2026. This posting...New gradPerformance$124k - $195.5k
...intelligence, running deep learning algorithms and... ...Developer Technology Engineer to push the limits of performance at the intersection... ...architectures, system software, libraries, and... ...production. Experience with TensorRT, TensorRT-LLM, and... ...until April 13, 2026. This posting is...New gradPerformanceInternship$124k - $195.5k
Deep Learning Kernel Software Performance Architect - New College Grad 2026 page is loaded## Deep Learning Kernel Software Performance Architect - New College Grad 2... ...Master's or PhD in Computer Science, Electrical Engineering or Computer Engineering, or equivalent...New gradPerformanceWork experience placement$100k - $166.75k
NVIDIA’s deep learning platform is widely used across leading academic institutions... ...people to help us explore new GPU application opportunities... ...space. What you’ll be doing: Perform system design reviews for... ...BS, MS, or PhD in Electrical Engineering, Computer Engineering,...New gradPerformance$116k - $189.75k
## Software Engineer, Hardware Tools and Methodology - New College Grad 2026Applylocations: US, CA, Santa Claratime type: Full timeposted... ...human intelligence, running deep learning algorithms and acting as the... ...at least until June 7, 2026.This posting is for an existing...New grad$116k - $189.75k
...motivated Circuit Design Engineer to join our dynamic... ...artificial intelligence and deep learning have ignited the era... ...evolves by seizing new opportunities and... ...teams to improve the performance, reliability and security... ...at least until May 31, 2026.This posting is for an...New gradPerformance$168k - $264.5k
...the world. We, the Human Performance and Experience (HPX)... ...interaction to shape perception, learning, and behavior. We... ...scientists and engineers from varied backgrounds... ...and interest in modern deep learning models, including... ...least until January 25, 2026. This posting is for an...New gradPerformance$124k - $195.5k
...are now looking for a Deep Learning Computer Architect!... ...hardware architects, and software engineers. Your day to day... ...learning methods, proposing new features to... ...level architecture; Performance analysis and optimization... ...least until June 5, 2026. This posting is for...New gradPerformance$124k - $195.5k
..., Data Processing - New College Grad 2026 page is loaded## Solutions... ...More recently, GPU deep learning ignited modern AI —... ...GPU-accelerate high performance database, ETL and... ...architectures, software, and programming models... ...Computer Science, Computer Engineering, or related...New gradPerformance$124k - $195.5k
## Systems Performance Engineer, Agentic AI Workloads - New College Grad 2026Applylocations: US, CA, Santa... ...NVIDIA is looking for a Deep Learning Architect to join our... ...with hardware, software, and research teams to... ...at least until June 7, 2026.This posting is for an...New gradPerformance- Performance Architect - Deep Learning Software We are seeking a Performance Architect to develop processor and system architectures that accelerate machine... ...Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or equivalent experience. Proven...New gradPerformanceWork experience placement
$116k - $189.75k
...is seeking highly motivated Engineers to help in the development and... ...their seamless and efficient performance. If you’re passionate about the... ...advancements in machine learning and AI to create state‑of‑the... ...accepted at least until April 4, 2026. This posting is for an...New gradPerformance$116k - $189.75k
...parallel computing, machine learning, and novel algorithms in... ...seeking an innovative EDA Software R&D Engineer with particular interest in... ...doing: Invent and develop new algorithms for RTL synthesis... ...into production. Explore high performance algorithms for clustering,...New gradPerformance$152k - $241.5k
Senior Software Engineer - Deep Learning Inference What you’ll be doing: Craft and develop robust... ...for functionality and performance Develop components of TensorRT, NVIDIA’s SDK for high-performance... ...accepted at least until March 21, 2026. NVIDIA is committed to fostering...Performance- ...speeds and empowers machine learning users to effortlessly run... ...About The Role As a New Graduate Software Engineer, you will collaborate with... ...systems that directly impact performance, scalability, reliability,... ...discipline (graduating in 2026). Proficiency in C/C++...New gradPerformanceInternship
- ...NVIDIA Corporation is seeking a Software Engineer for the TensorRT team in Santa Clara, California. This role will focus on developing cutting-edge... ..., requiring strong C++ skills and an understanding of deep learning models. The ideal candidate will have a Master's or PhD...New grad
$168k - $264.5k
...Gaming! NVIDIA is seeking world‑class deep learning researchers to build the next... ...time graphics quality, robustness, performance, and latency, and enable new interactive experiences such as world... ..., spanning synthetic data, in‑engine captures, and real‑world content....New gradPerformance$116k - $189.75k
...human intelligence, running deep learning algorithms and acting as the... ...Our team builds AI-driven software systems for circuit design,... ...PhD in Electrical or Computer Engineering, Computer Science, or Applied... ...accepted at least until May 1, 2026. This posting is for an...New grad- NVIDIA Gruppe in Santa Clara is seeking a Deep Learning Software Engineer focused on improving performance of deep learning inference software like TensorRT. The ideal candidate will have a strong foundation in C++ and Python, relevant experience with deep learning frameworks...Performance
- NVIDIA is looking for a Senior Deep Learning Software Engineer to enhance its TensorRT performance and inference ecosystem. The role focuses on optimizing performance across various NVIDIA accelerators and developing groundbreaking methodologies. Applicants should have...Performance
$168k - $264.5k
...the ability to scale process to increase performance and reduce power, we must rely more and... ...will be accepted at least until April18,2026. What you’ll be doing: Explore circuit... ...or equivalent experience in Electrical Engineering, Computer Science/Engineering, or related...New gradPerformance$152k - $241.5k
...redefining how software is built in... ...Join NVIDIA’s TensorRT team to help lead... ...cycles with a new framework... ...produce high-performance, high-quality,... ...‑thinking C++ engineer who wants to help... ...state‑of‑the‑art deep learning breakthroughs,... ...April 25, 2026 . Equal Opportunity...Performance$116k - $189.75k
...Signal & Power Integrity Engineer! NVIDIA has... ...computing. More recently, GPU deep learning ignited modern AI — the... ...by adapting to new opportunities that are... ...integrity simulations of high-performance AI systems, graphic... ...at least until May 16, 2026. This posting is for...New gradPerformance$100k - $166.75k
...implement, and deliver fully verified and high‑performance RTL to achieve design targets. What we... ...see Bachelors or Masters in Electrical Engineering or Computer Engineering (or equivalent... ...job will be accepted until April 5, 2026. Equal Opportunity Employer NVIDIA is committed...New gradPerformance$124k - $195.5k
...Architecture Group seeks new college graduates to join... ...GPU profiling and performance telemetry features... ..., performance engineering, drivers, and... ...extreme hardware-software codesign process that... ...across teams Develop deep knowledge of GPU... ...by summer 2026) Foundation in computer...New gradPerformanceSummer work$108k - $178.25k
...looking for creative architects and software developers to join and lead... ..., delivering the highest performance in the world for graphics processing... ...our leadership by developing new programming models, new... ...accepted at least until February 16, 2026. This posting is for an...New gradPerformance- ...and partners. Collaborate with software developers, program managers,... ..., Electronics, Software Engineering, or related fields. Proficient... ...remote debugging and Windows performance tracing tools (e.g., ETW). Knowledge... ...at least until June19,2026. This posting is for an existing...New gradPerformanceRemote work
$116k - $189.75k
...computing. More recently, GPU deep learning ignited modern AI — the... ...evolves by adapting to new opportunities that are... ...Clocks Verification Engineer. The team is... ...teams. Collaborate with Software and product group to debug... ...at least until May 4, 2026. This posting is for...New grad$168k - $264.5k
...searching for a world‑class new college grad PhD researcher to... ...Automation (EDA). Deep learning and GPU acceleration... ...techniques. Innovate in EDA software and algorithms, with... ...Electrical/Computer Engineering, or related field (or... ...least until June 7, 2026. This posting is for...New grad
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026. Be the first to apply!
- software engineer amazon Santa Clara, CA
- agile software developer Santa Clara, CA
- rust software engineer Santa Clara, CA
- software developer positions Santa Clara, CA
- senior software design engineer Santa Clara, CA
- software developer Santa Clara, CA
- ngo software engineer Santa Clara, CA
- startup software engineer Santa Clara, CA
- software development engineer (robotics engineer) Santa Clara, CA
- scientific software engineer Santa Clara, CA

