Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer - TensorRT Edge-LLM

$152k - $241.5k

NVIDIA

Are you passionate about pushing the limits of real-time large language model inference? Join NVIDIA's TensorRT Edge-LLM team and help shape the next generation of edge AI for automotive and robotics. We build the software stack that enables Large Language, Vision-Language, and Multimodal (LLM/VLM/VLA) models to run efficiently on embedded and edge platforms - delivering cutting-edge generative AI experiences directly on-device.

What you'll be doing:
  • Develop and evolve a state-of-the-art inference framework in modern C++ that extends TensorRT with autoregressive model serving capabilities, including speculative decoding, LoRA, MoE, and KV cache management.
  • Design and implement compiler and runtime optimizations tailored for transformer-based models running on constrained, real-time platforms.
  • Collaborate with teams across CUDA, kernel libraries, compilers, and robotics to deliver high-performance, production-ready solutions.
  • Contribute to CUDA kernel and operator development for critical transformer components such as attention, GEMM, and MoE.
  • Benchmark, profile, and optimize inference performance across diverse embedded and automotive environments.
  • Stay ahead of the rapidly evolving LLM/VLM ecosystem and bring emerging techniques into product-grade software.
What we need to see:
  • BS, MS, PhD, or equivalent experience in Computer Science, Electrical/Computer Engineering, or a closely related field.
  • 4+ years of relevant software development experience.
  • Deep understanding of transformer models and inference optimization techniques (e.g., quantization, tensor parallelism, or memory-efficient scheduling).
  • Proficient programming ability with modern C++ (C++11/14/17 and beyond).
  • Familiarity with popular LLM frameworks and libraries such as TensorRT, TensorRT-LLM, vLLM, SGLang, MLC-LLM, or FlashInfer.
  • A track record of strong software design, execution, and collaboration across fields.
Ways to stand out from the crowd:
  • Demonstrated development experience or open-source contributions to LLM inference frameworks and libraries, such as SGLang, vLLM, or FlashInfer.
  • Proficiency with CUDA, including efficient kernel development, performance profiling, and GPU architecture fundamentals.
  • Prior work on autoregressive LLM serving systems, including speculative decoding or KV cache management.
  • Familiarity with compiler infrastructure for large language model inference.
  • Exposure to robotics or embedded AI pipelines, including optimizing for low-latency, resource-constrained systems.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We hire some of the most brilliant and forward-thinking people in the world. If you thrive on innovation, autonomy, and technical excellence, come join us to shape the future of edge AI.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 152,000 USD - 241,500 USD for Level 3, and 184,000 USD - 287,500 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 21, 2026.

This posting is for an existing vacancy.


NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer - TensorRT Edge-LLM in United States vacancy
  • $184k - $287.5k

     ...We are now looking for a Senior Deep Learning Software Engineer, LLM Performance! NVIDIA is seeking an experienced...  ...accelerated Deep learning software like TensorRT, DL benchmarking software and...  ...accelerators, from datacenter GPUs to edge SoCs. Implement LLM inference,... 
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Deep Learning Software Engineer, TensorRT Performance! NVIDIA is seeking an experienced...  ...accelerators, from datacenter GPUs to edge SoCs. Implement graph compiler...  ...libraries (e.g. TensorRT, TensorRT-LLM, vLLM, SGLang, FlashInfer). Experience... 
    Senior
    Remote work

    NVIDIA

    United States
    3 days ago
  • $184k - $287.5k

    Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles page is loaded## Senior DL Software...  ...SOTA algorithms to make LLM/VLM fast, lean, and reliable enough...  ...automate deployment pipelines for TensorRT conversion.* Scale DL model performance... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...Senior Software Engineer, LLM Performance SF Bay Area (Hybrid) Parasail is redefining AI infrastructure by enabling seamless deployment across...  ...to Kubernetes-based deployments. Contribute to cutting-edge open-source LLM engines such as vLLM or SGLang to extend... 
    Senior

    Parasail

    San Francisco, CA
    1 day ago
  • $40 - $100 per hour

     ...Remote Senior Software Engineer (LLM) - 34953 Remote Senior Software Engineer (LLM) - 34953 3 days ago Be among the first 25 applicants Get AI...  ...across evaluations. Collaborate with the team to identify edge cases and ambiguities in model behavior. Required... 
    Senior
    Full time
    Contract work
    For contractors
    Remote work

    Turing Inc

    New York, NY
    4 days ago
  •  ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as...  ...and help productionize LLM/SLM capabilities for...  ...and intelligence on edge. You will collaborate...  ...role requires strong software engineering, hands-on...  ...(vLLM, Triton, TensorRT-LLM, llama.cpp). Exposure... 
    Senior

    Webex Events (formerly Socio)

    San Jose, CA
    1 day ago
  • $180k - $240k

     ...the next generation of powerful, meaningful products built with AI. Job Overview We’re seeking an exceptional Senior Software Engineer to join our LLM team. This role is focused on building and maintaining our LLM gateway service—a unified API platform that... 
    Senior
    Full time
    Remote work
    Easy work

    AssemblyAI

    New York, NY
    1 day ago
  •  ...A cutting-edge technology company in the United States is seeking a Forward Deployed Engineer to lead software integration for robotic systems in real-time environments. This role requires a strong background in robotics software engineering, with hands-on experience... 
    Senior
    Flexible hours

    Ditto

    New York, NY
    2 days ago
  • B Capital is seeking a backend Software Engineer to join the Einstein GPT Team in San Francisco. You will be building platform services for LLM technology supporting CRM cloud applications, focusing on performance, scalability, and efficiency. Ideal candidates have over... 
    Senior

    B Capital

    San Francisco, CA
    2 days ago
  •  ...healthcare technology company based in San Francisco is seeking a Senior Full-Stack Engineer to build and optimize core research infrastructure. This...  ..., enhancing efficiency, and empowering doctors and patients by leveraging cutting-edge AI technologies. #J-18808-Ljbffr
    Senior

    Sully

    San Francisco, CA
    4 days ago
  •  ...training pipelines, plus top AI researchers who specialize in software engineering, logical reasoning, STEM, multilinguality, multimodality, and...  ...As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large... 
    Senior
    For contractors
    Remote work
    Flexible hours

    Turing

    Denver, CO
    3 days ago
  •  ...Texas Sports Academy Main is seeking a Software Engineer (AI-Forward) to build software for managing student records and AI tools. The role requires...  ..., proficiency in AI coding tools, and involvement in LLM-powered features. You'll work closely with founders, moving features... 
    Senior
    Remote work

    Texas Sports Academy Main

    New York, NY
    2 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping...  ...Senior Engineering positions in the Deep Learning Inference TensorRT software team. What you’ll be doing: Craft and develop... 
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $100 per hour

     ...leader at the forefront of AI research and advanced system deployment, is looking for qualified Senior Software Engineers to assist in a one-time project to assist with their LLM training. Selected candidates will be asked to complete an AI interview, followed by... 
    Senior
    Hourly pay
    Temporary work
    Remote work

    Aquent

    United States
    3 days ago
  • $166k - $253k

     ...Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor...  ...years. ABOUT THE TEAM Anduril Software Engineers in the Connected Warfare (CW) Division...  ...delivery of winning capabilities. As a Senior Software Engineeron the ECC... 
    Senior
    Full time
    For contractors
    Work experience placement
    Immediate start

    Anduril Industries

    Costa Mesa, CA
    5 days ago
  • $196k - $294k

    Nerdleveltech seeks a Senior Software Engineer to join their Trust & Safety team. Based in San Francisco, you'll protect millions of developers from...  ...scale, using your skills in JavaScript, Python, and LLM methodologies. Responsibilities include analyzing threat actor... 
    Senior
    Flexible hours

    Nerdleveltech

    San Francisco, CA
    2 days ago
  • $168.1k - $227.4k

     ...deadline: May 27, 2026 AWS Edge Connectivity service enables...  ...You'll architect and implement software security protocols across...  ...standards Lead design and system engineers to perform tradeoffs and...  ...-sharing and mentorship. Our senior members enjoy one-on-one mentoring... 
    Senior
    Full time
    Internship
    Work at office
    Flexible hours

    Amazon

    Arlington, VA
    3 days ago
  • $150k - $180k

    A technology company in El Segundo is seeking a Senior Software Engineer to develop software systems for next-generation automated drilling rigs....  ...software with hardware, managing core infrastructure, and driving edge-computing strategies for analytics. The ideal candidate will... 
    Senior

    Durin

    El Segundo, CA
    1 day ago
  • $200k - $250k

     ...to offer a ticket to the millions of fans who browse our platform around the world. Join StubHub's Edge Services team as a Software/Security Engineer and help shape the global infrastructure that keeps millions of fans connected, secure, and engaged-right at... 
    Senior
    Work at office
    Remote work
    Worldwide
    Flexible hours

    StubHub

    New York, NY
    10 hours ago
  •  ...Senior AI Engineer - RAG Systems Bright.AI is a high-growth Physical AI company transforming how...  ...of real-world events-captured across edge devices, mobile sensors, and cloud infrastructure...  ...are now hiring a Senior AI Engineer - LLM, RAG to lead the development of... 
    Senior
    Immediate start

    BrightAI Corporation

    Palo Alto, CA
    5 days ago
  • $86.8k - $198k

     ...technology firm in McLean, Virginia is seeking a skilled Software Development Engineer to join their innovative engineering team. The ideal candidate...  ...offers a dynamic work environment focused on cutting-edge technology and offers a competitive salary range of $86,80... 
    Senior

    Phase2 Technology

    Mc Lean, VA
    5 days ago
  •  ...provider is seeking an experienced AI Application Engineer to partner with business stakeholders and drive LLM application development. The role requires a Master...  ...'s or Bachelor's in Computer Science, 8+ years of software development experience, and familiarity with AWS.... 
    Senior

    Savvas Learning

    California, MO
    1 day ago
  • $174k - $252k

    Senior Software Engineer, ML Compilers, Edge TPU Location: Mountain View, CA, USA; Kirkland, WA, USA. Mid-level role focused on driving progress, solving problems, and mentoring more junior team members; requires deeper expertise and applied knowledge within the relevant... 
    Senior
    Full time
    Temporary work

    Google Inc.

    Mountain View, CA
    4 days ago
  • $150k - $250k

     ...seeking a highly experienced GNC Engineer to design and optimize next-...  ...for our real-time edge autonomous systems. About AION...  ...Network! Description Senior Edge AI Perception Engineer...  ...Leverage NVIDIA DeepStream, TensorRT, CUDA, and TAO Toolkit to create... 
    Senior
    Temporary work

    Aion Robotics Corporation

    Arvada, CO
    5 days ago
  • $166k - $253k

     ...strategic competition, Anduril is committed to bringing cutting‑edge autonomy, AI, computer‑vision, sensor fusion, and networking...  ...the military in months, not years. ABOUT THE TEAM Anduril Software Engineers in the Connected Warfare (CW) Division build, deploy, integrate... 
    Senior
    For contractors
    Relocation package

    Slope

    Costa Mesa, CA
    2 days ago
  • $150k - $165k

     ...world impact at the confluence of cutting edge robotic technology, agriculture, AI, and...  .... The Role: Are you passionate about software development and eager to contribute in a...  ...support our robotics platforms Software Engineering: Develop high-quality, modular, and efficient... 
    Senior
    Remote work

    Aigen Robotics

    Redmond, WA
    5 days ago
  • A leading company is seeking a Senior AI professional with expertise in Large Language Models...  ...scalable applications, and integrate LLM capabilities into various projects. Proficiency...  .... Join a dynamic team to leverage cutting-edge AI technologies and contribute to... 
    Senior

    TechDigital Group

    Charlotte, NC
    5 days ago
  • Hewlett Packard Enterprise Development LP in Sunnyvale, California, is seeking a Senior Platform Software Engineer to contribute to cutting-edge software development for next-generation HPE routers. This hybrid role involves working on platform software and collaborating... 
    Senior

    Hewlett Packard Enterprise Development LP

    Sunnyvale, CA
    2 days ago
  • $272k - $431.25k

     ...resilient deployment of cutting-edge LLM workloads. We are seeking a Principal Systems Engineer to define the vision and...  ...engines (such as vLLM, SGLang, TensorRT-LLM), with a focus on KV-cache...  ...accelerators and memory pools. Mentor senior and junior engineers, set... 
    Local area
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $160k - $240k

     ...Senior Software Engineer - Edge Connectivity Platform Location New York Business Area Engineering and CTO Ref # 10050104 Description & Requirements Bloomberg Terminals operate across thousands of enterprise networks and the global Internet. The Network... 
    Senior
    Temporary work
    For contractors
    Work experience placement
    Worldwide

    Bloomberg

    New York, NY
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer - TensorRT Edge-LLM. Be the first to apply!