Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026

$124k - $195.5k

NVIDIA

We are now looking for a Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of NVIDIA’s inference ecosystem! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to enable the performance optimization, deployment and serving of these DL inference solutions. We specialize in developing GPU-accelerated deep learning inference software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models.

Collaborate with the deep learning community to integrate TensorRT into OSS frameworks like TensorRT-EdgeLLM and PyTorch. Identify performance opportunities and optimize SoTA models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler algorithms, frontend operators and code generators across NVIDIA’s inference ecosystem. Work and collaborate with a diverse set of teams involving workflow improvements, performance modeling, performance analysis, kernel development and inference software development.

What you'll be doing:

  • Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT)

  • Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT.

  • Develop new model pipelines for NVIDIA’s inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance.

  • Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions.

  • Scale performance of deep learning models across different architectures and types of NVIDIA accelerators.

What we need to see:

  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI).

  • 2 years of relevant software development experience.

  • Strong C++, Python programming and software engineering skills

  • Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer).

  • Experience with performance analysis and performance optimization

Ways to stand out from the crowd:

  • Strong foundation and architectural knowledge of GPUs.

  • Deep understanding of modern deep learning models and workloads (e.g. Transformers, Recommenders, ASR, TTS, Visual Understanding).

  • Proficiency in one of the deep learning programming domain specific languages (e.g. CUDA/TileIR/CuTeDSL/cutlass/Triton).

  • Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph compilers in deep learning inference (e.g. TorchDynamo/TorchInductor).

  • Prior experience optimizing performance for low-latency, resource-constrained systems or embedded AI pipelines (e.g. Jetson systems or other edge AI accelerators).

GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the engine for simulating human imagination, conjuring up the amazing virtual worlds of video games and Hollywood films. Now, NVIDIA's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why NVIDIA GPUs are used broadly for deep learning, and NVIDIA is increasingly known as “the AI computing company.” Come, join our DL Architecture team, where you can help build a real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 2, and 152,000 USD - 241,500 USD for Level 3.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until April 7, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026 in Santa Clara, CA vacancy
  • $100k - $166.75k

    Hardware Applications Engineer - New College Grad 2026 page is loaded## Hardware Applications...  ...id: JR2016940NVIDIA's deep learning platform has already made...  ...hardware, firmware and software issues, and provide key...  ...What you'll be doing:*** Perform system design reviews for... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $124k - $195.5k

    Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026 page is loaded## Machine Learning...  ...intelligence, running deep learning algorithms...  ...develop, and maintain high-performance runtime and compiler...  ...teams to feedback software observations,... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    18 hours ago
  • $124k - $195.5k

     ...the world. We are looking for a Performance Architect for Deep Learning Software! NVIDIA is seeking extraordinary...  ...PhD in Computer Science, Electrical Engineering or Computer Engineering, or...  ...accepted at least until April 20, 2026. This posting is for an existing... 
    New grad
    Performance
    Work experience placement

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $108k - $178.25k

     ...a Backend Compiler Engineer for a role in our GPU Software organization. Our Compiler...  ...the highest performance GPU machine instructions...  ...in C++ Develop new register allocation...  ...at the center of deep‑learning compiler technology...  ...least until May 3, 2026. This posting is... 
    New grad
    Performance
    Worldwide

    Nvidia Corporation

    Santa Clara, CA
    3 days ago
  • $116k - $189.75k

    ## Software R&D Engineer, Digital Logic Synthesis - New College Grad 2026Applylocations: US, CA, Santa Clara:...  ...parallel computing, machine learning, and novel algorithms...  ....* Explore high performance algorithms for clustering...  ...least until May 19, 2026.This posting is for an... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $124k - $195.5k

     ...now looking for a Deep Learning Architect, LLM Inference...  ...inference server performance optimization for...  ...GPU hardware and software performance and...  ...Collaborate with engineers from AI startup companies...  .... Verify that new GPU product...  ...least until April 26, 2026. This posting... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $124k - $195.5k

     ...enjoy researching new algorithms and...  ..., and optimizing performance of data intensive...  ...More recently, GPU deep learning ignited modern AI...  ...hardware architectures, software, and programming...  ..., Computer Engineering, or related computationally...  ...until April 28, 2026. This posting... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    18 days ago
  • $116k - $189.75k

     ...Senior Circuit Design Engineer to join our dynamic and...  ...intelligence and deep learning have ignited the era of...  ...that evolves by seizing new opportunities and tackling...  ...teams to improve the performance, reliability and...  ...at least until May 30, 2026. This posting is for... 
    New grad
    Performance
    Full time

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $120k - $165k

     ...cutting-edge perception and deep learning models, including...  ...2025 and June 2026. Strong programming...  ...odometry. Excellent software and computing fundamentals...  ...eligible for an annual performance bonus, equity, and a competitive...  ...interviewing for a new role is significant,... 
    New grad
    Performance

    WeRide.ai

    San Jose, CA
    3 days ago
  • $100k - $166.75k

    Cell Modelling and Verification Engineer - New College Grad 2026 page is loaded## Cell Modelling and Verification...  ...human intelligence, running deep learning algorithms and acting as the brain...  ...craft the most power-efficient and performance leading SoCs and GPUs to push the... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $116k - $189.75k

    ## Power Architect - New College Grad 2026Applylocations:...  ...intelligence, running deep learning algorithms and...  ...with other Architects, Software Engineers, ASIC Design Engineers...  ...accurate power and performance models of our GPUs and...  ...least until May 15, 2026.This posting is for... 
    New grad
    Performance
    Internship

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $120k - $165k

     ..., onboard sensors, and learning-based methods to ensure...  ...December 2025 and June 2026 . Proficient in C++ and...  .... We develop diverse deep learning models using perceptual...  ...eligible for an annual performance bonus, equity, and a...  ...interviewing for a new role is significant,... 
    New grad
    Performance

    WeRide

    San Jose, CA
    3 days ago
  • $152k - $241.5k

     ...now looking for a Senior Software Engineer for Deep Learning Inference! Would you like...  ...the Deep Learning Inference TensorRT software team. What you...  ...platforms for functionality and performance Develop components of...  ...at least until March 21, 2026. This posting is for... 
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...speeds and empowers machine learning users to effortlessly run...  ...About The Role As a New Graduate Software Engineer, you will collaborate with...  ...systems that directly impact performance, scalability, reliability,...  ...discipline (graduating in 2026). Proficiency in C/C++... 
    New grad
    Performance
    Internship

    Cerebras

    Sunnyvale, CA
    1 day ago
  • $152k - $241.5k

     ...redefining how software is built in...  ...Join NVIDIA’s TensorRT team to help lead...  ...cycles with a new framework...  ...produce high-performance, high-quality,...  ...-thinking C++ engineer who wants to help...  ...state-of-the-art deep learning breakthroughs,...  ...April 25, 2026. This posting... 
    Performance

    NVIDIA

    Santa Clara, CA
    18 hours ago
  •  ...is seeking a Power Architect for New College Grad 2026 in Santa Clara, CA. You will be...  ...GPU applications, including deep learning and gaming. Candidates should be...  ...or PhD in Electrical or Computer Engineering, with knowledge of performance simulators and programming tools... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $124k - $195.5k

     ...highly motivated Compiler Software Engineer to join this dynamic...  ...deployment of high performance parallel applications...  ..., ranging from deep learning, scientific computation...  ...factoring in support for new GPU hardware...  ...least until May 30, 2026. This posting is for... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $100k - $166.75k

    GPU Power Architect - New College Grad 2026 page is loaded## GPU Power Architect...  ...energy-efficient high-performance products and we continue to...  ...product lineup.* Deploy machine learning techniques to develop...  ...Bachelors or Masters in Electrical Engineering, Computer Engineering, or... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $116k - $189.75k

     ...Signal and Power Integrity Engineer - New College Grad 2026Applylocations: US, CA...  .... More recently, GPU deep learning ignited modern AI — the next...  ...integrity simulations of high-performance AI systems, graphic cards,...  ...at least until May 16, 2026.This posting is for an existing... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $120k - $165k

     ...Develop a high-performance, highly reliable data...  ...Design and develop new features to...  ...bridging machine learning and distributed systems...  ...submitted by other engineers or community contributors...  ...Computer Science, Software Engineering,...  ...December 2025 and June 2026 . Proficient... 
    New grad
    Performance

    WeRide.ai

    San Jose, CA
    3 days ago
  • $116k - $189.75k

     ...computing. More recently, GPU deep learning ignited modern AI — the...  ...evolves by adapting to new opportunities that are...  ...Clocks Verification Engineer. The team is...  ...teams.* Collaborate with Software and product group to debug...  ...at least until May 4, 2026.This posting is for an... 
    New grad

    NVIDIA Corporation

    Santa Clara, CA
    18 hours ago
  • $224k - $356.5k

     ...are looking for a Senior Deep Learning Software Engineer to design and build our automated...  ...and implementing a high-performance execution environment, low...  ...learning SDKs such as TensorRT. Prior experience in writing...  ...at least until April 28, 2026. This posting is for... 
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...in AI and machine learning to solve some of the...  ...and motivated engineers to join our TensorRT team in developing...  ...industry-leading deep learning inference software for NVIDIA AI accelerators...  ...of close-to-metal performance analysis,...  ...least until April 14, 2026. This posting... 
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...inference? Join NVIDIA's TensorRT Edge-LLM team and...  .... We build the software stack that enables...  ...to deliver high-performance, production-ready...  ...Electrical/Computer Engineering, or a closely...  ...development experience. ~ Deep understanding of...  ...until March 21, 2026. This posting... 
    Performance
    Remote work

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep...  ...Deep learning software like TensorRT, DL benchmarking software and performant...  ...accepted at least until April 20, 2026. This posting is for an... 
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...Intelligence, High Performance Computing and...  ...Our work opens up new universes to explore...  ...for a motivated Deep Learning engineer to bring advanced...  ...(aka systems software fundamentals) ~...  ...., PyTorch, JAX, TensorRT, vLLM, sgLang, Nemo...  ...least until May 18, 2026. This posting... 
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $45 - $60 per hour

     ...The Applied Machine Learning Enterprise team combines system engineering and machine...  ...expertise in coding, performance analysis and large...  ...an internship in 2026. Internships at ByteDance...  ...; - Proficient in deep learning...  ...- Experience with software development in at... 
    Performance
    Hourly pay
    Full time
    Internship
    Local area

    ByteDance

    San Jose, CA
    4 days ago
  • $152k - $241.5k

     ...computer vision and deep learning. Our team builds large...  ...looking for a seasoned software engineer to help build video...  ...usability, accuracy, performance, and scalability. This...  ...(such as CUDA, TensorRT, or comparable technologies...  ...until March 15, 2026. This posting is... 
    Performance
    Worldwide

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...computer vision and deep learning. Our team builds...  .... As a System Software Engineer for Vision AI, you...  ...and optimize high-performance vision systems...  ...scenarios using brand-new computer vision...  ...(such as CUDA, TensorRT, or comparable technologies...  ...until April 28, 2026. This posting... 
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $100k - $166.75k

    Verification Engineer - New College Grad 2026 page is loaded## Verification Engineer - New College Grad 2026locations: US, CA, Santa Clara: US,...  ...Today, NVIDIA’s GPU simulates human intelligence, running deep learning algorithms and acting as the brain of computers, robots... 
    New grad

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026. Be the first to apply!