AI Inference Systems Engineer - TensorRT Special Platforms

NVIDIA

NVIDIA Corporation is looking for a passionate Software Engineer to join the TensorRT team in Santa Clara, California. You will drive innovation in deep learning and work with cutting-edge AI technology, contributing to high-performance AI inference solutions. Your role involves designing and developing deep learning software, collaborating with hardware and software teams, and producing robust C++ code. A Master's or PhD in relevant fields with strong C++ skills is essential. Join us to help build the future of AI inference technology! #J-18808-Ljbffr NVIDIA Corporation

Apply

Vacancy posted 1 day ago

Similar jobs that could be interesting for youBased on the AI Inference Systems Engineer - TensorRT Special Platforms in Santa Clara, CA vacancy

Principal AI Inference Systems Engineer
...experiences-from AI and data centers,... ...gaming and embedded systems. Grounded in a culture... ...Staff AI Infra Engineer who is passionate... ...benchmarks, with a special focus on AI/ML... ...LLM training and inference on AMD GPUs, improving... ...training or inference platforms using Kubernetes,...
Platform
Advanced Micro Devices , Inc.
Santa Clara, CA
5 days ago
Software Engineer, TensorRT Specialized Platforms - New College Grad 2025
$124k - $195.5k
...to work on cutting‑edge AI technology? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront... ...contributing to high‑performance AI inference solutions for specialized platforms and applications. Your... ...functions, classes, and systems to improve robustness...
Platform
Internship
NVIDIA Gruppe
Santa Clara, CA
9 hours ago
Senior Software Engineer, Deep Learning Inference - TensorRT
$152k - $241.5k
...for a Senior Software Engineer for Deep Learning Inference! Would you like to... ...Learning Inference TensorRT software team.What you... ...scaled to multiple platforms for functionality and... ...developing System Software.Proficiency... ...vacancy.NVIDIA uses AI tools in its recruiting...
Platform
NVIDIA
Santa Clara, CA
9 hours ago
Senior ML Compiler & Inference Systems Engineer
$152k - $287.5k
...Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role... ...involves developing algorithms for their LPX inference and compiler stack, optimizing the... ...performance of neural network workloads on NVIDIA platforms. Ideal candidates will possess an MS or...
Platform
NVIDIA Gruppe
Santa Clara, CA
9 hours ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme... ...such as containerd/CRI‑O/CRIU. Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and...
Platform
NVIDIA Gruppe
Santa Clara, CA
9 hours ago
Senior Systems Research Engineer - Embodied AI & Robotics
...Description As a Senior Systems Research Engineer , you will join a... ...and build embodied AI applications at the... ...specifically for embedded platforms to achieve low-... ...diverse frameworks before specializing in high-impact... ...to real-time onboard inference—while serving as a...
Platform
Intuitive
Sunnyvale, CA
4 days ago
Senior Software Engineer, Deep Learning Inference - Automotive Safety
$152k - $241.5k
...work on cutting-edge AI technology for... ...applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the... ...-performance AI inference solutions for... ...safety and other specialized platforms. Your expertise will... ...functions, classes, and systems to support...
Platform
NVIDIA
Santa Clara, CA
1 day ago
Staff Software Engineer, Inference
$188k - $275k
...Staff Software Engineer, Inference CoreWeave is The... ...Essential Cloud for AI™. Built for... ...CoreWeave delivers a platform of technology,... ...management, and system-wide optimizations... ...as vLLM, Triton, TensorRT-LLM, Ray Serve, or... ...requirements for specialized skill sets. New hires...
Platform
Permanent employment
Temporary work
Casual work
Work at office
Remote work
Flexible hours
CoreWeave
Sunnyvale, CA
4 days ago
Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026
$124k - $195.5k
...Learning Software Engineer, TensorRT Performance!... ...performance of NVIDIA’s inference ecosystem! NVIDIA... ...like Generative AI, Recommenders and... ...solutions. We specialize in developing GPU... ...resource-constrained systems or embedded AI... ...effective computing platform driving our...
Platform
Remote work
NVIDIA
Santa Clara, CA
5 days ago
Senior Software Engineer II, Inference
$165k - $242k
...Senior Software Engineer II, Inference Sunnyvale, CA... ...Essential Cloud for AI™. Built for... ...CoreWeave delivers a platform of technology, tools... ...distributed systems or cloud services... ...frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve,... ...requirements for specialized skill sets. New...
Platform
Permanent employment
Temporary work
Casual work
Work at office
Remote work
Flexible hours
Shift work
CoreWeave
Sunnyvale, CA
3 days ago
Principal Engineer - Distributed AI Systems Architecture (Heterogeneous Compute)
$255.85k - $361.2k
...are seeking a Principal Engineer to define and architect... ...of distributed AI systems across heterogeneous compute platforms, including CPUs, GPUs,... ...across CPUs, GPUs, and specialized accelerators. Stateful... ...Experience with AI/ML systems, inference infrastructure, or large...
Platform
Local area
Shift work
Intel Corporation
Santa Clara, CA
4 days ago
Principal Engineer - AI Agents and Systems
$272k - $431.25k
...looking for a Principal Engineer to serve as a key... ...advanced AI agent frameworks and... ...combining powerful local inference (Nemotron models)... ...AI operating system. What You Will... ...GeForce GPUs using TensorRT and CUDA. Partner... ...-source AI agent platforms or orchestration tools...
Platform
Local area
Worldwide
NVIDIA
Santa Clara, CA
5 days ago
Senior Backend Engineer: Distributed Systems for AI Inference
...Time · Department: Backend Engineer · Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into the... ...and resilient distributed systems. You’ll work closely... ...throughput, low-latency AI model inference and data services....
Platform
Full time
Neara
Palo Alto, CA
8 hours ago
Senior DL Inference Engineer (TensorRT) - Equity
...Gruppe in Santa Clara, California is seeking a Senior Software Engineer specializing in Deep Learning Inference. In this role, you will craft and develop high-performance software tailored for scalable platforms while collaborating with experts in the field. The ideal...
Platform
NVIDIA Gruppe
Santa Clara, CA
9 hours ago
Senior ML Systems Engineer
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑... ...industry‑leading training and inference speeds and empowers machine... ...a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible...
Platform
Internship
Cerebras
Sunnyvale, CA
2 days ago
Senior Backend Engineer, ML Inference Systems
$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ... ...Drive technical direction of our inference platform, with a focus on low-latency, high-...
Platform
Work at office
Worldwide
Relocation package
Unity Technologies
Mountain View, CA
1 day ago
Board and Systems Design Engineer, Custom Silicon
$132k - $189k
...s degree in Electrical Engineering, Computer Engineering,... ...Experience with test system development in support... ...to shape the future of AI/ML hardware acceleration... ...providing the essential platforms that enable developers... ...you have a disability or special need that requires accommodation...
Platform
Full time
Worldwide
Google
Sunnyvale, CA
1 day ago
Machine Learning Systems Engineer, Networking
$152k - $241.5k
...Join our team of innovative engineers who are building an AI Data Center AIOps platform that turns raw, high-volume telemetry... ...they impact AI training and inference. The core challenge of this role... ...implementing ML algorithms directly in systems languages for latency-sensitive...
Platform
NVIDIA
Santa Clara, CA
1 day ago
Senior AI Systems Performance Engineer
...Senior AI Systems Performance Engineer Palo Alto, California, United States The... ...full-stack, generative AI platform, from chip to model,... ...performance for large-scale AI inference. Responsibilities... ...DeepSpeed, Megatron, vLLM, or TensorRT. Strong GPU programming...
Platform
SambaNova Systems
Palo Alto, CA
3 days ago
Senior Business Systems Engineer- Data Center Systems II
$165k - $242k
...Senior Business Systems Engineer- Data Center Systems II Livingston, NJ... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and... ...on role requirements for specialized skill sets. New hires will be...
Platform
Temporary work
Casual work
Work at office
Immediate start
Remote work
Flexible hours
CoreWeave
Sunnyvale, CA
3 days ago
Senior AI Systems Performance Engineer: Drive SOTA Inference
A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems....
Platform
SambaNova
Palo Alto, CA
1 day ago
Staff Software Engineer - Secondary Driving System
$218.8k - $335.3k
...Motors, our Embodied AI teams are... ...learning to build systems that are both intelligent... ...a Staff Software Engineer to provide... ...broader autonomy, platform, and hardware roadmaps... ...accelerator-based ML inference , model... ...optimization (e.g.,TensorRT, ONNX Runtime, custom...
Platform
Local area
Work from home
Flexible hours
General Motors
Sunnyvale, CA
1 day ago
Senior AI Systems Engineer - Special Projects
...seeks a skilled Machine Learning Software Engineer to develop and integrate intelligent... ...Objective-C, and a deep understanding of Apple platforms. Candidates will collaborate with ML... ...contributing to the creation of seamless AI solutions that enhance user experiences....
Platform
Apple Inc.
Cupertino, CA
2 days ago
Senior Inference Technical Product Marketing Manager - Accelerated Computing
...will be pivotal in our inference marketing. You will work closely with engineering to understand the technical... ...leadership position in AI inference. What You’ll... ...NVIDIA’s inference platform technical go-to-market... ...Triton Inference Server, TensorRT‑LLM, vLLM, SGLang). Market...
Platform
NVIDIA Gruppe
Santa Clara, CA
9 hours ago
Senior Software Engineer - TensorRT Edge-LLM
...large language model inference? Join NVIDIA’s TensorRT Edge‑LLM team and... ...generation of edge AI for automotive and robotics... ...embedded and edge platforms — delivering cutting... ...Electrical/Computer Engineering, or a closely... ...autoregressive LLM serving systems, including...
Platform
NVIDIA Gruppe
Santa Clara, CA
9 hours ago
Staff Inference ML Runtime Engineer
...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our... ...the full potential of our platform, leveraging its... ...state‑of‑the‑art generative AI models on our custom hardware... ...as vLLM, SGLang, and TensorRT‑LLM. Solid understanding...
Platform
Dormont Manufacturing Company
Sunnyvale, CA
1 day ago
Senior AI Runtime & Systems Engineer (Embedded Linux)
d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with...
Platform
3 days per week
d-Matrix
Santa Clara, CA
2 days ago
Senior Software Engineer, Machine Learning Inference
$152k - $241.5k
...driving advancements in AI and machine learning... ...and motivated engineers to join our TensorRT team in developing the... ...leading deep learning inference software for NVIDIA AI... ...Frameworks, Compilers, or System Software. ~... ...-effective computing platform driving our success in...
Platform
NVIDIA
Santa Clara, CA
3 days ago
Senior Deep Learning Software Engineer, Inference
$184k - $287.5k
...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor... ...powers today’s most sophisticated AI applications. Our team is... ...a central role in improving these platforms, facilitating smooth deployment and...
Platform
NVIDIA Gruppe
Santa Clara, CA
8 hours ago
Senior Inference Technical Product Marketing Manager - Accelerated Computing
$148k - $235.75k
...business and pivotal in our inference marketing. You will be... ...on working with engineering to understand the technical... ...position in AI inference. Want to... ...drive NVIDIA’s inference platform technical go-to-market... ...Triton Inference Server, TensorRT-LLM, vLLM, SGLang) ~...
Platform
NVIDIA
Santa Clara, CA
5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Systems Engineer - TensorRT Special Platforms. Be the first to apply!