Senior Software Engineer - TensorRT Edge-LLM
$152k - $241.5kNVIDIA
Are you passionate about pushing the limits of real-time large language model inference? Join NVIDIA's TensorRT Edge-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack that enables Large Language, Vision-Language, and Multimodal (LLM/VLM/VLA) models to run efficiently on embedded and edge platforms - delivering cutting-edge generative AI experiences directly on-device.
What you'll be doing:- Develop and evolve a state-of-the-art inference framework in modern C++ that extends TensorRT with autoregressive model serving capabilities, including speculative decoding, LoRA, MoE, and KV cache management.
- Design and implement compiler and runtime optimizations tailored for transformer-based models running on constrained, real-time platforms.
- Collaborate with teams across CUDA, kernel libraries, compilers, and robotics to deliver high-performance, production-ready solutions.
- Contribute to CUDA kernel and operator development for critical transformer components such as attention, GEMM, and MoE.
- Benchmark, profile, and optimize inference performance across diverse embedded and automotive environments.
- Stay ahead of the rapidly evolving LLM/VLM ecosystem and bring emerging techniques into product-grade software.
- BS, MS, PhD, or equivalent experience in Computer Science, Electrical/Computer Engineering, or a closely related field.
- 4+ years of relevant software development experience.
- Deep understanding of transformer models and inference optimization techniques (e.g., quantization, tensor parallelism, or memory-efficient scheduling).
- Proficient programming ability with modern C++ (C++11/14/17 and beyond).
- Familiarity with popular LLM frameworks and libraries such as TensorRT, TensorRT-LLM, vLLM, SGLang, MLC-LLM, or FlashInfer.
- A track record of strong software design, execution, and collaboration across fields.
- Demonstrated development experience or open-source contributions to LLM inference frameworks and libraries, such as SGLang, vLLM, or FlashInfer.
- Proficiency with CUDA, including efficient kernel development, performance profiling, and GPU architecture fundamentals.
- Prior work on autoregressive LLM serving systems, including speculative decoding or KV cache management.
- Familiarity with compiler infrastructure for large language model inference.
- Exposure to robotics or embedded AI pipelines, including optimizing for low-latency, resource-constrained systems.
NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
$184k - $287.5k
...We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced... ...accelerated Deep learning software like TensorRT, DL benchmarking software and... ...accelerators, from datacenter GPUs to edge SoCs. Implement LLM inference,...Senior$152k - $241.5k
...We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced... ...accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler... ...libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer). Experience...SeniorRemote work$184k - $287.5k
Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles page is loaded## Senior DL Software... ...SOTA algorithms to make LLM/VLM fast, lean, and reliable enough... ...automate deployment pipelines for TensorRT conversion.* Scale DL model performance...Senior- ...Senior Software Engineer, LLM Performance SF Bay Area (Hybrid) Parasail is redefining AI infrastructure by enabling seamless deployment across... ...to Kubernetes-based deployments. Contribute to cutting-edge open-source LLM engines such as vLLM or SGLang to extend...Senior
$40 - $100 per hour
...Remote Senior Software Engineer (LLM) - 34953 Remote Senior Software Engineer (LLM) - 34953 3 days ago Be among the first 25 applicants Get AI... ...across evaluations. Collaborate with the team to identify edge cases and ambiguities in model behavior. Required...SeniorFull timeContract workFor contractorsRemote work- ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as... ...and help productionize LLM/SLM capabilities for... ...and intelligence on edge. You will collaborate... ...role requires strong software engineering, hands-on... ...(vLLM, Triton, TensorRT-LLM, llama.cpp). Exposure...Senior
$180k - $240k
...the next generation of powerful, meaningful products built with AI. Job Overview We’re seeking an exceptional Senior Software Engineer to join our LLM team. This role is focused on building and maintaining our LLM gateway service—a unified API platform that...SeniorFull timeRemote workEasy work- ...A cutting-edge technology company in the United States is seeking a Forward Deployed Engineer to lead software integration for robotic systems in real-time environments. This role requires a strong background in robotics software engineering, with hands-on experience...SeniorFlexible hours
- B Capital is seeking a backend Software Engineer to join the Einstein GPT Team in San Francisco. You will be building platform services for LLM technology supporting CRM cloud applications, focusing on performance, scalability, and efficiency. Ideal candidates have over...Senior
- ...healthcare technology company based in San Francisco is seeking a Senior Full-Stack Engineer to build and optimize core research infrastructure. This... ..., enhancing efficiency, and empowering doctors and patients by leveraging cutting-edge AI technologies. #J-18808-LjbffrSenior
- ...training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and... ...As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large...SeniorFor contractorsRemote workFlexible hours
- ...Texas Sports Academy Main is seeking a Software Engineer (AI-Forward) to build software for managing student records and AI tools. The role requires... ..., proficiency in AI coding tools, and involvement in LLM-powered features. You'll work closely with founders, moving features...SeniorRemote work
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping... ...Senior Engineering positions in the Deep Learning Inference TensorRT software team. What you’ll be doing: Craft and develop...Senior$100 per hour
...leader at the forefront of AI research and advanced system deployment, is looking for qualified Senior Software Engineers to assist in a one-time project to assist with their LLM training. Selected candidates will be asked to complete an AI interview, followed by...SeniorHourly payTemporary workRemote work$166k - $253k
...Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor... ...years. ABOUT THE TEAM Anduril Software Engineers in the Connected Warfare (CW) Division... ...delivery of winning capabilities. As a Senior Software Engineeron the ECC...SeniorFull timeFor contractorsWork experience placementImmediate start$196k - $294k
Nerdleveltech seeks a Senior Software Engineer to join their Trust & Safety team. Based in San Francisco, you'll protect millions of developers from... ...scale, using your skills in JavaScript, Python, and LLM methodologies. Responsibilities include analyzing threat actor...SeniorFlexible hours$168.1k - $227.4k
...deadline: May 27, 2026 AWS Edge Connectivity service enables... ...You'll architect and implement software security protocols across... ...standards Lead design and system engineers to perform tradeoffs and... ...-sharing and mentorship. Our senior members enjoy one-on-one mentoring...SeniorFull timeInternshipWork at officeFlexible hours$150k - $180k
A technology company in El Segundo is seeking a Senior Software Engineer to develop software systems for next-generation automated drilling rigs.... ...software with hardware, managing core infrastructure, and driving edge-computing strategies for analytics. The ideal candidate will...Senior$200k - $250k
...to offer a ticket to the millions of fans who browse our platform around the world. Join StubHub's Edge Services team as a Software/Security Engineer and help shape the global infrastructure that keeps millions of fans connected, secure, and engaged-right at...SeniorWork at officeRemote workWorldwideFlexible hours- ...Senior AI Engineer - RAG Systems Bright.AI is a high-growth Physical AI company transforming how... ...of real-world events-captured across edge devices, mobile sensors, and cloud infrastructure... ...are now hiring a Senior AI Engineer - LLM, RAG to lead the development of...SeniorImmediate start
$86.8k - $198k
...technology firm in McLean, Virginia is seeking a skilled Software Development Engineer to join their innovative engineering team. The ideal candidate... ...offers a dynamic work environment focused on cutting-edge technology and offers a competitive salary range of $86,80...Senior- ...provider is seeking an experienced AI Application Engineer to partner with business stakeholders and drive LLM application development. The role requires a Master... ...'s or Bachelor's in Computer Science, 8+ years of software development experience, and familiarity with AWS....Senior
$174k - $252k
Senior Software Engineer, ML Compilers, Edge TPU Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid-level role focused on driving progress, solving problems, and mentoring more junior team members; requires deeper expertise and applied knowledge within the relevant...SeniorFull timeTemporary work$150k - $250k
...seeking a highly experienced GNC Engineer to design and optimize next-... ...for our real-time edge autonomous systems. About AION... ...Network! Description Senior Edge AI Perception Engineer... ...Leverage NVIDIA DeepStream, TensorRT, CUDA, and TAO Toolkit to create...SeniorTemporary work$166k - $253k
...strategic competition, Anduril is committed to bringing cutting‑edge autonomy, AI, computer‑vision, sensor fusion, and networking... ...the military in months, not years. ABOUT THE TEAM Anduril Software Engineers in the Connected Warfare (CW) Division build, deploy, integrate...SeniorFor contractorsRelocation package$150k - $165k
...world impact at the confluence of cutting edge robotic technology, agriculture, AI, and... .... The Role: Are you passionate about software development and eager to contribute in a... ...support our robotics platforms Software Engineering: Develop high-quality, modular, and efficient...SeniorRemote work- A leading company is seeking a Senior AI professional with expertise in Large Language Models... ...scalable applications, and integrate LLM capabilities into various projects. Proficiency... .... Join a dynamic team to leverage cutting-edge AI technologies and contribute to...Senior
- Hewlett Packard Enterprise Development LP in Sunnyvale, California, is seeking a Senior Platform Software Engineer to contribute to cutting-edge software development for next-generation HPE routers. This hybrid role involves working on platform software and collaborating...Senior
$272k - $431.25k
...resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and... ...engines (such as vLLM, SGLang, TensorRT-LLM), with a focus on KV-cache... ...accelerators and memory pools. Mentor senior and junior engineers, set...Local areaRemote work$160k - $240k
...Senior Software Engineer - Edge Connectivity Platform Location New York Business Area Engineering and CTO Ref # 10050104 Description & Requirements Bloomberg Terminals operate across thousands of enterprise networks and the global Internet. The Network...SeniorTemporary workFor contractorsWork experience placementWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer - TensorRT Edge-LLM. Be the first to apply!
- graduate software developer United States
- rust software engineer United States
- senior software design engineer United States
- software engineer student United States
- software engineer amazon United States
- software developer positions United States
- software engineer full time United States
- software qa engineer United States
- new graduate software engineer United States
- junior software developer United States

