Senior Deep Learning Software Engineer, TensorRT Performance
$152k - $241.5kNVIDIA
We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing and improving the performance of NVIDIA's inference ecosystem! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like Generative AI, Recommenders and Vision that have put DL into every software solution. Join the team that builds the software to enable the performance optimization, deployment and serving of these DL inference solutions. We specialize in developing GPU-accelerated deep learning inference software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models.
Collaborate with the deep learning community to integrate TensorRT into OSS frameworks like TensorRT-EdgeLLM and PyTorch. Identify performance opportunities and optimize SoTA models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler algorithms, frontend operators and code generators across NVIDIA's inference ecosystem. Work and collaborate with a diverse set of teams involving workflow improvements, performance modeling, performance analysis, kernel development and inference software development. What you'll be doing:- Establish groundbreaking performance benchmarking methodologies and analysis workflows and identify performance issues and opportunities for NVIDIA's inference ecosystem (e.g. TensorRT/TensorRT-EdgeLLM/Torch-TensorRT)
- Contribute features and code to NVIDIA/OSS inference frameworks including but not limited to TensorRT/TensorRT-EdgeLLM/Torch-TensorRT.
- Develop new model pipelines for NVIDIA's inference ecosystem with optimized performance including but not limited to areas like quantization, scheduling, memory management, and distributed inference to set the gold standard for Gen AI performance.
- Work with cross-collaborative teams inside and outside of NVIDIA across generative AI, automotive, robotics, image understanding, and speech understanding to set directions and develop innovative inference solutions.
- Scale performance of deep learning models across different architectures and types of NVIDIA accelerators.
- Bachelors, Masters, PhD, or equivalent experience in relevant fields (Computer Science, Computer Engineering, EECS, AI).
- At least 3 years of relevant software development experience.
- Strong C++, Python programming and software engineering skills
- Experience with DL frameworks (e.g. PyTorch, JAX, TensorFlow, ONNX) and inference libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer).
- Experience with performance analysis and performance optimization
- Strong foundation and architectural knowledge of GPUs.
- Deep understanding of modern deep learning models and workloads (e.g. Transformers, Recommenders, ASR, TTS, Visual Understanding).
- Proficiency in one of the deep learning programming domain specific languages (e.g. CUDA/TileIR/CuTeDSL/cutlass/Triton).
- Prior contributions to major LLM inference frameworks (e.g. vLLM) or prior experience with graph compilers in deep learning inference (e.g. TorchDynamo/TorchInductor).
- Prior experience optimizing performance for low-latency, resource-constrained systems or embedded AI pipelines (e.g. Jetson systems or other edge AI accelerators).
NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact... ...in the Deep Learning Inference TensorRT software team. What you’ll be doing... ...platforms for functionality and performance Develop components of TensorRT,...SeniorPerformance- ...leading technology company in Santa Clara is seeking a Senior Deep Learning Software Engineer focused on optimizing NVIDIA's inference ecosystem.... ...should have a solid understanding of GPU architectures and performance analysis. The position offers competitive salary and...SeniorPerformanceRemote job
$152k - $241.5k
...about redefining how software is built in the age of... ...AI? Join NVIDIA’s TensorRT team to help lead a first... ...to produce high-performance, high-quality, modern... ...systems-thinking C++ engineer who wants to help scale... ...of state-of-the-art deep learning breakthroughs, and improve...SeniorPerformance$224k - $356.5k
...We are looking for a Senior Deep Learning Software Engineer to design and build our automated inference and... ...designing and implementing a high-performance execution environment, low-level GPU... ...NVIDIA's deep learning SDKs such as TensorRT. Prior experience in writing high...SeniorPerformance$152k - $241.5k
...in AI and machine learning to solve some of... ...talented and motivated engineers to join our TensorRT team in... ...industry-leading deep learning inference software for NVIDIA AI accelerators... .... As a Senior Software Engineer... ...of close-to-metal performance analysis, optimization...SeniorPerformance$152k - $241.5k
...inference? Join NVIDIA's TensorRT Edge-LLM team and help... .... We build the software stack that enables Large... ...robotics to deliver high-performance, production-ready... ..., Electrical/Computer Engineering, or a closely related... ...development experience. ~ Deep understanding of...SeniorPerformanceRemote work$184k - $287.5k
...We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced Deep Learning Engineer passionate about analyzing... ...GPU-accelerated Deep learning software like TensorRT, DL benchmarking software and performant solutions...SeniorPerformance$184k - $287.5k
...Artificial Intelligence, High Performance Computing and Visualization... ...looking for a motivated Deep Learning engineer to bring advanced CUDA features... ...principles (aka systems software fundamentals) ~... ...frameworks (e.g., PyTorch, JAX, TensorRT, vLLM, sgLang, Nemo, Megatron...SeniorPerformance$168.1k - $227.4k
...Description As a Software Development Engineer for Amazon Robotics,... ...This role combines deep expertise in software... ...development of scalable, high-performance computer vision... ...with deep learning frameworks (PyTorch,... ...deployment/optimization (TensorRT, ONNX) ~ Experience...SeniorPerformanceInternshipFlexible hours$152k - $241.5k
...advanced computer vision and deep learning. Our team builds large-... ...are looking for a seasoned software engineer to help build video analytics... ...focusing on usability, accuracy, performance, and scalability. This is... ...(such as CUDA, TensorRT, or comparable technologies...SeniorPerformanceWorldwide$159k - $207k
...that executes the software and neural... ...intersection of software engineering, machine learning, sensors, and... ...: As a senior engineer in the... ...improve the compute performance of our current... ...platforms. Dive deep into the full ML... ...including CUDA, TensorRT, or similar...SeniorPerformanceWork at office2 days per week$224k - $356.5k
...advanced computer vision and deep learning. Our team builds large-... ...-world impact. As a System Software Engineer for Vision AI, you will develop and optimize high-performance vision systems that turn massive... ...(such as CUDA, TensorRT, or comparable technologies...SeniorPerformance$184.9k - $250.2k
...AWS Neuron, the software development kit used... ...to accelerate deep learning and GenAI workloads... ...and training performance. The Inference... ...software boundary, our engineers build systematic... ...mentorship. Our senior members enjoy one... ...vLLM, SGLang, TensorRT or similar platforms...SeniorPerformanceWork experience placementInternshipFlexible hours$220k - $292k
...Senior Software Engineer, Perception Anduril Industries is a defense... ...systems to high-performance solid rocket motors.... ...Computer Vision and Machine Learning Engineering. You will... ...our UAVs, bringing deep expertise in Object... ...models with TensorRT and ONNX. ~ Experience...SeniorPerformanceFull timeWork experience placementImmediate startRemote workRelocation package- ...technology company in Santa Clara is seeking a Senior AI-Native Systems Software Engineer to design an AI-native framework, optimizing performance for critical use cases. This role... ...modern C++ skills, familiarity with deep learning frameworks, and a collaborative mindset...SeniorPerformance
$184k - $287.5k
Senior Software Engineer, RL Post-Training Frameworks page is loaded##... ...2015863Reinforcement learning post-training is... ...their need optimizing deep learning frameworks,... ..., CPUs, and LPUs for performance where it matters, contributing... ...(vLLM, SGLang, TensorRT-LLM) into RL training...SeniorPerformance- Senior Software Engineer, Machine Learning About us Moonware builds products to modernize airfield... ..., communications, and performance. By enhancing operational... ...Software Engineer with deep expertise in Computer Vision... ...for edge inference (TensorRT, ONNX, quantization, pruning...SeniorPerformanceWorldwide
$184k - $287.5k
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous... ...seeking a high-caliber Deep Learning Engineer to bridge the gap... ...etc. to boost E2E model performance for production deployments... ...deployment pipelines for TensorRT conversion.* Scale DL...SeniorPerformance$152k - $241.5k
...passionate about driving innovation in deep learning and eager to work on cutting-edge AI... ...applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive...SeniorPerformance- ...looking for a backend-focused Senior Software Engineer to design, build, and scale... ...systems are reliable and performant as demand grows. You'll... ...with our frontend and machine learning teams. What You'll Do... ...Python web frameworks) ~ Deep understanding of backend architecture...SeniorPerformance
- [Position] Senior Software Engineer [About Our Client] Research & Development hub for artificial intelligence and deep learning technologies and is also home to a dedicated medical device... ...-threading, synchronization, and performance optimization - Able to work in...SeniorPerformance
$184k - $287.5k
A leading technology company is looking for a Senior Software Engineer to work on performance optimizations in deep learning frameworks using JAX. The role involves designing core components and collaborating with AI researchers. Candidates should have a BS in Computer...SeniorPerformance$152k - $241.5k
...developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU... ...are looking for a highly motivated senior software engineer for an exciting role in our... ...products in High Performance Computing and Deep Learning. What you will be doing:...SeniorPerformance$185k - $230k
...a talented and passionate Senior Software Engineer to join our highly skilled... ...ideal candidate will hold a deep comprehension of search and... ...search system accuracy and performance. Your Impact Design... ...needs by developing deep learning-based NLP algorithms to analyze...SeniorPerformancePermanent employment$184k - $287.5k
...NVIDIA is hiring senior engineers to develop its AI platform and more specifically its performance optimizations in deep learning frameworks using JAX, a tool that can differentiate between... ..., numeric libraries, modular software design. ~ Highly motivated with excellent...SeniorPerformance$184k - $287.5k
...for outstanding AI systems engineers to develop groundbreaking technologies... ...in the inference systems software stack! We build innovative... ...engineers at NVIDIA across deep learning frameworks, libraries,... ...GPU kernel development and performance optimizations (especially using...SeniorPerformanceRemote work$190k - $280k
...About Sentry Software runs the world and the... ...developers fix errors and performance issues before users... ...the role As a Senior Software Engineer on Sentry's AI/ML team... ...integrating AI and machine learning into our core... ...Familiarity with deep learning frameworks (...SeniorPerformanceHourly pay$184k - $287.5k
...computing. More recently, GPU deep learning ignited modern deep... ...industry that bring exceptional performance and security improvements to... ...willing to take on complex engineering tasks that progress towards... ...experience in storage software such as Key-Value, File systems...SeniorPerformanceRemote work$184k - $287.5k
...Senior Software Engineer For Compiler Team NVIDIA's GPUs are at the core of modern AI infrastructure... ...for a compiler team within NVIDIA's deep learning software organization. This team... ...and execution stack, targeting high-performance kernel generation for deep learning...SeniorPerformanceWork experience placement- ...unsupervised machine learning technology, advanced device... ..., powerful decision engine and investigation... ...to provide guaranteed performance lift from day one. DataVisor... ...Our award-winning software platform is powered by... ...machine learning, and deep learning to detect fraudulent...SeniorPerformance
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Deep Learning Software Engineer, TensorRT Performance. Be the first to apply!
- graduate software developer United States
- rust software engineer United States
- senior software design engineer United States
- software engineer student United States
- software engineer amazon United States
- software developer positions United States
- software engineer full time United States
- software qa engineer United States
- new graduate software engineer United States
- junior software developer United States

