Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026

$124k - $195.5k

NVIDIA Gruppe

Deep Learning Software Engineer, TensorRT Performance We are now looking for a Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving performance of NVIDIA’s inference ecosystem. NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to enable performance optimization, deployment and serving of these DL inference solutions. We specialize in developing GPU-accelerated deep learning inference software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models. Collaborate with the deep learning community to integrate TensorRT into OSS frameworks like TensorRT-EdgeLLM and PyTorch. Identify performance opportunities and optimize SoTA models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler algorithms, frontend operators and code generators across NVIDIA’s inference ecosystem. Work and collaborate with a diverse set of teams involving workflow improvements, performance modeling, performance analysis, kernel development and inference software development. What you’ll be doing Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA’s inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT) Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT. Develop new model pipelines for NVIDIA’s inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance. Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions. Scale performance of deep learning models across different architectures and types of NVIDIA accelerators. What we need to see Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI). 2 years of relevant software development experience. Strong C++, Python programming and software engineering skills. Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer). Experience with performance analysis and performance optimization. Ways to stand out from the crowd Strong foundation and architectural knowledge of GPUs. Deep understanding of modern deep learning models and workloads (e.g. Transformers, Recommenders, ASR, TTS, Visual Understanding). Proficiency in one of the deep learning programming domain specific languages (e.g. CUDA/TileIR/CuTeDSL/cutlass/Triton). Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph compilers in deep learning inference (e.g. TorchDynamo/TorchInductor). Prior experience optimizing performance for low-latency, resource-constrained systems or embedded AI pipelines (e.g. Jetson systems or other edge AI accelerators). Compensation Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 124,000 USD - 195,500 USD for Level 2, and 152,000 USD - 241,500 USD for Level 3. You will also be eligible for equity and benefits. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026 in Santa Clara, CA vacancy
  • $124k - $195.5k

    We are now looking for a Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing...  ...TensorRT/TensorRT‑EdgeLLM/Torch‑TensorRT. Develop new model pipelines for NVIDIA’s inference ecosystem with... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $124k - $195.5k

     ...driving innovation in deep learning and eager to work...  ...? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront...  ...to high-performance AI inference solutions...  ...understand and leverage new technologies to improve...  ...until June 2, 2026. This posting is... 
    New grad
    Performance
    Internship

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $124k - $195.5k

     ...intelligence, running deep learning algorithms and...  ...Developer Technology Engineer to push the limits of performance at the intersection...  ...architectures, system software, libraries, and...  ...production. Experience with TensorRT, TensorRT-LLM, and...  ...until April 13, 2026. This posting is... 
    New grad
    Performance
    Internship

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $124k - $195.5k

     ...the world. We are looking for a Performance Architect for Deep Learning Software! NVIDIA is seeking extraordinary...  ...PhD in Computer Science, Electrical Engineering or Computer Engineering, or...  ...accepted at least until April 20, 2026. This posting is for an existing... 
    New grad
    Performance
    Work experience placement

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $116k - $189.75k

    ## Software R&D Engineer, Digital Logic Synthesis - New College Grad 2026Applylocations: US, CA, Santa Clara:...  ...parallel computing, machine learning, and novel algorithms...  ....* Explore high performance algorithms for clustering...  ...least until May 19, 2026.This posting is for an... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $116k - $189.75k

     ...human intelligence, running deep learning algorithms and acting as the...  ...for a dedicated and motivated Software developer with particular interest...  ...degree or higher in Computer Engineer or Electrical Engineering (or...  ...at least until June 7, 2026. This posting is for an existing... 
    New grad
    Full time

    NVIDIA

    Santa Clara, CA
    18 hours ago
  • $168k - $264.5k

     ...the world. We, the Human Performance and Experience (HPX)...  ...interaction to shape perception, learning, and behavior. We...  ...scientists and engineers from varied backgrounds...  ...and interest in modern deep learning models, including...  ...least until January 25, 2026. This posting is for an... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $124k - $195.5k

     ...enjoy researching new algorithms and...  ..., and optimizing performance of data intensive...  ...More recently, GPU deep learning ignited modern AI...  ...hardware architectures, software, and programming...  ..., Computer Engineering, or related computationally...  ...until April 28, 2026. This posting... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    26 days ago
  • $124k - $195.5k

     ...are now looking for a Deep Learning Computer Architect!...  ...hardware architects, and software engineers. Your day to day...  ...methods, proposing new features to...  ...architecture; ~ Performance analysis and optimization...  ...least until June 5, 2026. This posting is for... 
    New grad
    Performance
    Night shift

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $124k - $195.5k

     ...Deep Learning Architect NVIDIA is looking for a Deep Learning...  ...datacenters, we need engineers who can model,...  ...you have a passion for performance analysis, a strong quantitative...  ...with hardware, software, and research teams to...  ...at least until June 7, 2026. NVIDIA uses AI... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    13 hours ago
  • $120k - $165k

     ...cutting-edge perception and deep learning models, including...  ...2025 and June 2026. Strong programming...  ...odometry. Excellent software and computing fundamentals...  ...eligible for an annual performance bonus, equity, and a competitive...  ...interviewing for a new role is significant,... 
    New grad
    Performance

    WeRide.ai

    San Jose, CA
    1 day ago
  • $116k - $189.75k

    ## Power Architect - New College Grad 2026Applylocations:...  ...intelligence, running deep learning algorithms and...  ...with other Architects, Software Engineers, ASIC Design Engineers...  ...accurate power and performance models of our GPUs and...  ...least until May 15, 2026.This posting is for... 
    New grad
    Performance
    Internship

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • Performance Architect - Deep Learning Software We are seeking a Performance Architect to develop processor and system architectures that accelerate machine...  ...Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or equivalent experience. Proven... 
    New grad
    Performance
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $116k - $189.75k

     ...is seeking highly motivated Engineers to help in the development and...  ...their seamless and efficient performance. If you’re passionate about the...  ...advancements in machine learning and AI to create state‑of‑the...  ...accepted at least until April 4, 2026. This posting is for an... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...now looking for a Senior Software Engineer for Deep Learning Inference! Would you like...  ...the Deep Learning Inference TensorRT software team. What you...  ...platforms for functionality and performance Develop components of...  ...at least until March 21, 2026. This posting is for... 
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $124k - $195.5k

     ...position. Join forces with the performance marketing team to build...  ...achievements. Collaborate with engineers from AI startup companies...  ...AI. Contribute to deep learning software projects such as PyTorch, TRT...  ...in the field. Verify that new GPU product launches produce... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...speeds and empowers machine learning users to effortlessly run...  ...About The Role As a New Graduate Software Engineer, you will collaborate with...  ...systems that directly impact performance, scalability, reliability,...  ...discipline (graduating in 2026). Proficiency in C/C++... 
    New grad
    Performance
    Internship

    Cerebras

    Sunnyvale, CA
    4 days ago
  • $124k - $195.5k

    Deep Learning Architect, LLM Inference NVIDIA is at the forefront...  ...on inference server performance optimization for Large...  ...of GPU hardware and software performance and...  ...achievements. Collaborate with engineers from AI startup...  ...the field. Verify that new GPU product launches... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $168k - $264.5k

     ...Gaming! NVIDIA is seeking world‑class deep learning researchers to build the next...  ...time graphics quality, robustness, performance, and latency, and enable new interactive experiences such as world...  ..., spanning synthetic data, in‑engine captures, and real‑world content.... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...for a motivated Circuit Design Engineer to join our dynamic and...  ...cutting edge processor design in deep submicron technologies. Work as...  ...functional teams to improve the performance, reliability and security of NVIDIA...  ...accepted at least until May31,2026. EEO Statement: NVIDIA is... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...redefining how software is built in...  ...Join NVIDIA’s TensorRT team to help lead...  ...cycles with a new framework...  ...produce high-performance, high-quality,...  ...-thinking C++ engineer who wants to help...  ...state-of-the-art deep learning breakthroughs,...  ...April 25, 2026. This posting... 
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $120k - $165k

     ..., onboard sensors, and learning-based methods to ensure...  ...December 2025 and June 2026 . Proficient in C++...  .... We develop diverse deep learning models using perceptual...  ...eligible for an annual performance bonus, equity, and a...  ...interviewing for a new role is significant,... 
    New grad
    Performance

    WeRide.ai

    San Jose, CA
    22 days ago
  •  ...is seeking a Power Architect for New College Grad 2026 in Santa Clara, CA. You will be...  ...GPU applications, including deep learning and gaming. Candidates should be...  ...or PhD in Electrical or Computer Engineering, with knowledge of performance simulators and programming tools... 
    New grad
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • NVIDIA Gruppe in Santa Clara is seeking a Deep Learning Software Engineer focused on improving performance of deep learning inference software like TensorRT. The ideal candidate will have a strong foundation in C++ and Python, relevant experience with deep learning frameworks... 
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • NVIDIA is looking for a Senior Deep Learning Software Engineer to enhance its TensorRT performance and inference ecosystem. The role focuses on optimizing performance across various NVIDIA accelerators and developing groundbreaking methodologies. Applicants should have... 
    Performance

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $168k - $264.5k

     ...the ability to scale process to increase performance and reduce power, we must rely more and...  ...will be accepted at least until April18,2026. What you’ll be doing: Explore circuit...  ...or equivalent experience in Electrical Engineering, Computer Science/Engineering, or related... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $108k - $178.25k

     ...Scheduling Architect - New College Grad Today, NVIDIA...  ...in concert with software, hardware, and...  ...functional and performance models, often in...  ...activities Constantly learning and growing!...  ..., or Electrical Engineering (or related degree...  ...until May 14, 2026. This posting... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $100k - $166.75k

     ...implement, and deliver fully verified and high‑performance RTL to achieve design targets. What we...  ...see Bachelors or Masters in Electrical Engineering or Computer Engineering (or equivalent...  ...job will be accepted until April 5, 2026. Equal Opportunity Employer NVIDIA is committed... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $108k - $178.25k

     ...looking for creative architects and software developers to join and lead...  ..., delivering the highest performance in the world for graphics processing...  ...our leadership by developing new programming models, new...  ...accepted at least until February 16, 2026. This posting is for an... 
    New grad
    Performance

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $124k - $195.5k

     ...highly motivated Compiler Software Engineer to join this dynamic...  ...deployment of high performance parallel applications...  ..., ranging from deep learning, scientific computation...  ...factoring in support for new GPU hardware...  ...least until May 30, 2026. This posting is for... 
    New grad
    Performance

    NVIDIA

    Santa Clara, CA
    8 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026. Be the first to apply!