Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Deep Learning Software Engineer, TensorRT Performance

$152k - $241.5k

NVIDIA

We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of NVIDIA's inference ecosystem! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to enable the performance optimization, deployment and serving of these DL inference solutions. We specialize in developing GPU-accelerated deep learning inference software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models.

Collaborate with the deep learning community to integrate TensorRT into OSS frameworks like TensorRT-EdgeLLM and PyTorch. Identify performance opportunities and optimize SoTA models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler algorithms, frontend operators and code generators across NVIDIA's inference ecosystem. Work and collaborate with a diverse set of teams involving workflow improvements, performance modeling, performance analysis, kernel development and inference software development.

What you'll be doing:
  • Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA's inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT)
  • Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT.
  • Develop new model pipelines for NVIDIA's inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance.
  • Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions.
  • Scale performance of deep learning models across different architectures and types of NVIDIA accelerators.
What we need to see:
  • Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI).
  • At least 3 years of relevant software development experience.
  • Strong C++, Python programming and software engineering skills
  • Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer).
  • Experience with performance analysis and performance optimization
Ways to stand out from the crowd:
  • Strong foundation and architectural knowledge of GPUs.
  • Deep understanding of modern deep learning models and workloads (e.g. Transformers, Recommenders, ASR, TTS, Visual Understanding).
  • Proficiency in one of the deep learning programming domain specific languages (e.g. CUDA/TileIR/CuTeDSL/cutlass/Triton).
  • Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph compilers in deep learning inference (e.g. TorchDynamo/TorchInductor).
  • Prior experience optimizing performance for low-latency, resource-constrained systems or embedded AI pipelines (e.g. Jetson systems or other edge AI accelerators).

GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the engine for simulating human imagination, conjuring up the amazing virtual worlds of video games and Hollywood films. Now, NVIDIA's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why NVIDIA GPUs are used broadly for deep learning, and NVIDIA is increasingly known as "the AI computing company." Come, join our DL Architecture team, where you can help build a real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 26, 2026.

This posting is for an existing vacancy.


NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Deep Learning Software Engineer, TensorRT Performance in United States vacancy
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact...  ...in the Deep Learning Inference TensorRT software team. What you’ll be doing...  ...platforms for functionality and performance Develop components of TensorRT,... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...leading technology company in Santa Clara is seeking a Senior Deep Learning Software Engineer focused on optimizing NVIDIA's inference ecosystem....  ...should have a solid understanding of GPU architectures and performance analysis. The position offers competitive salary and... 
    Senior
    Performance
    Remote job

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...about redefining how software is built in the age of...  ...AI? Join NVIDIA’s TensorRT team to help lead a first...  ...to produce high-performance, high-quality, modern...  ...systems-thinking C++ engineer who wants to help scale...  ...of state-of-the-art deep learning breakthroughs, and improve... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $224k - $356.5k

     ...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and...  ...designing and implementing a high-performance execution environment, low-level GPU...  ...NVIDIA's deep learning SDKs such as TensorRT. Prior experience in writing high... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...in AI and machine learning to solve some of...  ...talented and motivated engineers to join our TensorRT team in...  ...industry-leading deep learning inference software for NVIDIA AI accelerators...  .... As a Senior Software Engineer...  ...of close-to-metal performance analysis, optimization... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...inference? Join NVIDIA's TensorRT Edge-LLM team and help...  .... We build the software stack that enables Large...  ...robotics to deliver high-performance, production-ready...  ..., Electrical/Computer Engineering, or a closely related...  ...development experience. ~ Deep understanding of... 
    Senior
    Performance
    Remote work

    NVIDIA

    United States
    8 hours ago
  • $184k - $287.5k

     ...We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing...  ...GPU-accelerated Deep learning software like TensorRT, DL benchmarking software and performant solutions... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...Artificial Intelligence, High Performance Computing and Visualization...  ...looking for a motivated Deep Learning engineer to bring advanced CUDA features...  ...principles (aka systems software fundamentals) ~...  ...frameworks (e.g., PyTorch, JAX, TensorRT, vLLM, sgLang, Nemo, Megatron... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $168.1k - $227.4k

     ...Description As a Software Development Engineer for Amazon Robotics,...  ...This role combines deep expertise in software...  ...development of scalable, high-performance computer vision...  ...with deep learning frameworks (PyTorch,...  ...deployment/optimization (TensorRT, ONNX) ~ Experience... 
    Senior
    Performance
    Internship
    Flexible hours

    Amazon

    Westborough, MA
    8 hours ago
  • $152k - $241.5k

     ...advanced computer vision and deep learning. Our team builds large-...  ...are looking for a seasoned software engineer to help build video analytics...  ...focusing on usability, accuracy, performance, and scalability. This is...  ...(such as CUDA, TensorRT, or comparable technologies... 
    Senior
    Performance
    Worldwide

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $159k - $207k

     ...that executes the software and neural...  ...intersection of software engineering, machine learning, sensors, and...  ...: As a senior engineer in the...  ...improve the compute performance of our current...  ...platforms. Dive deep into the full ML...  ...including CUDA, TensorRT, or similar... 
    Senior
    Performance
    Work at office
    2 days per week

    Motional

    Boston, MA
    2 days ago
  • $224k - $356.5k

     ...advanced computer vision and deep learning. Our team builds large-...  ...-world impact. As a System Software Engineer for Vision AI, you will develop and optimize high-performance vision systems that turn massive...  ...(such as CUDA, TensorRT, or comparable technologies... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184.9k - $250.2k

     ...AWS Neuron, the software development kit used...  ...to accelerate deep learning and GenAI workloads...  ...and training performance. The Inference...  ...software boundary, our engineers build systematic...  ...mentorship. Our senior members enjoy one...  ...vLLM, SGLang, TensorRT or similar platforms... 
    Senior
    Performance
    Work experience placement
    Internship
    Flexible hours

    Amazon

    New York, NY
    3 days ago
  • $220k - $292k

     ...Senior Software Engineer, Perception Anduril Industries is a defense...  ...systems to high-performance solid rocket motors....  ...Computer Vision and Machine Learning Engineering. You will...  ...our UAVs, bringing deep expertise in Object...  ...models with TensorRT and ONNX. ~ Experience... 
    Senior
    Performance
    Full time
    Work experience placement
    Immediate start
    Remote work
    Relocation package

    anduril

    Costa Mesa, CA
    2 days ago
  •  ...technology company in Santa Clara is seeking a Senior AI-Native Systems Software Engineer to design an AI-native framework, optimizing performance for critical use cases. This role...  ...modern C++ skills, familiarity with deep learning frameworks, and a collaborative mindset... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $184k - $287.5k

    Senior Software Engineer, RL Post-Training Frameworks page is loaded##...  ...2015863Reinforcement learning post-training is...  ...their need optimizing deep learning frameworks,...  ..., CPUs, and LPUs for performance where it matters, contributing...  ...(vLLM, SGLang, TensorRT-LLM) into RL training... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • Senior Software Engineer, Machine Learning About us Moonware builds products to modernize airfield...  ..., communications, and performance. By enhancing operational...  ...Software Engineer with deep expertise in Computer Vision...  ...for edge inference (TensorRT, ONNX, quantization, pruning... 
    Senior
    Performance
    Worldwide

    Moonware

    Los Angeles, CA
    1 day ago
  • $184k - $287.5k

    Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous...  ...seeking a high-caliber Deep Learning Engineer to bridge the gap...  ...etc. to boost E2E model performance for production deployments...  ...deployment pipelines for TensorRT conversion.* Scale DL... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $152k - $241.5k

     ...passionate about driving innovation in deep learning and eager to work on cutting-edge AI...  ...applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    8 hours ago
  •  ...looking for a backend-focused Senior Software Engineer to design, build, and scale...  ...systems are reliable and performant as demand grows. You'll...  ...with our frontend and machine learning teams. What You'll Do...  ...Python web frameworks) ~ Deep understanding of backend architecture... 
    Senior
    Performance

    Pangram

    New York, NY
    8 hours ago
  • [Position] Senior Software Engineer [About Our Client] Research & Development hub for artificial intelligence and deep learning technologies and is also home to a dedicated medical device...  ...-threading, synchronization, and performance optimization - Able to work in... 
    Senior
    Performance

    Cesna Recruitment

    San Diego, CA
    1 day ago
  • $184k - $287.5k

    A leading technology company is looking for a Senior Software Engineer to work on performance optimizations in deep learning frameworks using JAX. The role involves designing core components and collaborating with AI researchers. Candidates should have a BS in Computer... 
    Senior
    Performance

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU...  ...are looking for a highly motivated senior software engineer for an exciting role in our...  ...products in High Performance Computing and Deep Learning. What you will be doing:... 
    Senior
    Performance

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $185k - $230k

     ...a talented and passionate Senior Software Engineer to join our highly skilled...  ...ideal candidate will hold a deep comprehension of search and...  ...search system accuracy and performance. Your Impact Design...  ...needs by developing deep learning-based NLP algorithms to analyze... 
    Senior
    Performance
    Permanent employment

    Otter.ai

    Seattle, WA
    4 days ago
  • $184k - $287.5k

     ...NVIDIA is hiring senior engineers to develop its AI platform and more specifically its performance optimizations in deep learning frameworks using JAX, a tool that can differentiate between...  ..., numeric libraries, modular software design. ~ Highly motivated with excellent... 
    Senior
    Performance

    NVIDIA

    United States
    2 days ago
  • $184k - $287.5k

     ...for outstanding AI systems engineers to develop groundbreaking technologies...  ...in the inference systems software stack! We build innovative...  ...engineers at NVIDIA across deep learning frameworks, libraries,...  ...GPU kernel development and performance optimizations (especially using... 
    Senior
    Performance
    Remote work

    NVIDIA

    United States
    1 day ago
  • $190k - $280k

     ...About Sentry Software runs the world and the...  ...developers fix errors and performance issues before users...  ...the role As a Senior Software Engineer on Sentry's AI/ML team...  ...integrating AI and machine learning into our core...  ...Familiarity with deep learning frameworks (... 
    Senior
    Performance
    Hourly pay

    Sentry

    San Francisco, CA
    4 days ago
  • $184k - $287.5k

     ...computing. More recently, GPU deep learning ignited modern deep...  ...industry that bring exceptional performance and security improvements to...  ...willing to take on complex engineering tasks that progress towards...  ...experience in storage software such as Key-Value, File systems... 
    Senior
    Performance
    Remote work

    NVIDIA

    United States
    2 days ago
  • $184k - $287.5k

     ...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure...  ...for a compiler team within NVIDIA's deep learning software organization. This team...  ...and execution stack, targeting high-performance kernel generation for deep learning... 
    Senior
    Performance
    Work experience placement

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...unsupervised machine learning technology, advanced device...  ..., powerful decision engine and investigation...  ...to provide guaranteed performance lift from day one. DataVisor...  ...Our award-winning software platform is powered by...  ...machine learning, and deep learning to detect fraudulent... 
    Senior
    Performance

    DataVisor

    Mountain View, CA
    8 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Deep Learning Software Engineer, TensorRT Performance. Be the first to apply!