AI Inference Systems Engineer - TensorRT Special Platforms
NVIDIA
NVIDIA Corporation is looking for a passionate Software Engineer to join the TensorRT team in Santa Clara, California. You will drive innovation in deep learning and work with cutting-edge AI technology, contributing to high-performance AI inference solutions. Your role involves designing and developing deep learning software, collaborating with hardware and software teams, and producing robust C++ code. A Master's or PhD in relevant fields with strong C++ skills is essential. Join us to help build the future of AI inference technology! #J-18808-Ljbffr NVIDIA Corporation
- ...experiences-from AI and data centers,... ...gaming and embedded systems. Grounded in a culture... ...Staff AI Infra Engineer who is passionate... ...benchmarks, with a special focus on AI/ML... ...LLM training and inference on AMD GPUs, improving... ...training or inference platforms using Kubernetes,...Platform
$124k - $195.5k
...to work on cutting‑edge AI technology? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront... ...contributing to high‑performance AI inference solutions for specialized platforms and applications. Your... ...functions, classes, and systems to improve robustness...PlatformInternship$152k - $241.5k
...for a Senior Software Engineer for Deep Learning Inference! Would you like to... ...Learning Inference TensorRT software team.What you... ...scaled to multiple platforms for functionality and... ...developing System Software.Proficiency... ...vacancy.NVIDIA uses AI tools in its recruiting...Platform$152k - $287.5k
...Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role... ...involves developing algorithms for their LPX inference and compiler stack, optimizing the... ...performance of neural network workloads on NVIDIA platforms. Ideal candidates will possess an MS or...Platform$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme... ...such as containerd/CRI‑O/CRIU. Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and...Platform- ...Description As a Senior Systems Research Engineer , you will join a... ...and build embodied AI applications at the... ...specifically for embedded platforms to achieve low-... ...diverse frameworks before specializing in high-impact... ...to real-time onboard inference—while serving as a...Platform
$152k - $241.5k
...work on cutting-edge AI technology for... ...applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the... ...-performance AI inference solutions for... ...safety and other specialized platforms. Your expertise will... ...functions, classes, and systems to support...Platform$188k - $275k
...Staff Software Engineer, Inference CoreWeave is The... ...Essential Cloud for AI™. Built for... ...CoreWeave delivers a platform of technology,... ...management, and system-wide optimizations... ...as vLLM, Triton, TensorRT-LLM, Ray Serve, or... ...requirements for specialized skill sets. New hires...PlatformPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hours$124k - $195.5k
...Learning Software Engineer, TensorRT Performance!... ...performance of NVIDIA’s inference ecosystem! NVIDIA... ...like Generative AI, Recommenders and... ...solutions. We specialize in developing GPU... ...resource-constrained systems or embedded AI... ...effective computing platform driving our...PlatformRemote work$165k - $242k
...Senior Software Engineer II, Inference Sunnyvale, CA... ...Essential Cloud for AI™. Built for... ...CoreWeave delivers a platform of technology, tools... ...distributed systems or cloud services... ...frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve,... ...requirements for specialized skill sets. New...PlatformPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$255.85k - $361.2k
...are seeking a Principal Engineer to define and architect... ...of distributed AI systems across heterogeneous compute platforms, including CPUs, GPUs,... ...across CPUs, GPUs, and specialized accelerators. Stateful... ...Experience with AI/ML systems, inference infrastructure, or large...PlatformLocal areaShift work$272k - $431.25k
...looking for a Principal Engineer to serve as a key... ...advanced AI agent frameworks and... ...combining powerful local inference (Nemotron models)... ...AI operating system. What You Will... ...GeForce GPUs using TensorRT and CUDA. Partner... ...-source AI agent platforms or orchestration tools...PlatformLocal areaWorldwide- ...Time · Department: Backend Engineer · Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into the... ...and resilient distributed systems. You’ll work closely... ...throughput, low-latency AI model inference and data services....PlatformFull time
- ...Gruppe in Santa Clara, California is seeking a Senior Software Engineer specializing in Deep Learning Inference. In this role, you will craft and develop high-performance software tailored for scalable platforms while collaborating with experts in the field. The ideal...Platform
- Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑... ...industry‑leading training and inference speeds and empowers machine... ...a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible...PlatformInternship
$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ... ...Drive technical direction of our inference platform, with a focus on low-latency, high-...PlatformWork at officeWorldwideRelocation package$132k - $189k
...s degree in Electrical Engineering, Computer Engineering,... ...Experience with test system development in support... ...to shape the future of AI/ML hardware acceleration... ...providing the essential platforms that enable developers... ...you have a disability or special need that requires accommodation...PlatformFull timeWorldwide$152k - $241.5k
...Join our team of innovative engineers who are building an AI Data Center AIOps platform that turns raw, high-volume telemetry... ...they impact AI training and inference. The core challenge of this role... ...implementing ML algorithms directly in systems languages for latency-sensitive...Platform- ...Senior AI Systems Performance Engineer Palo Alto, California, United States The... ...full-stack, generative AI platform, from chip to model,... ...performance for large-scale AI inference. Responsibilities... ...DeepSpeed, Megatron, vLLM, or TensorRT. Strong GPU programming...Platform
$165k - $242k
...Senior Business Systems Engineer- Data Center Systems II Livingston, NJ... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and... ...on role requirements for specialized skill sets. New hires will be...PlatformTemporary workCasual workWork at officeImmediate startRemote workFlexible hours- A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems....Platform
$218.8k - $335.3k
...Motors, our Embodied AI teams are... ...learning to build systems that are both intelligent... ...a Staff Software Engineer to provide... ...broader autonomy, platform, and hardware roadmaps... ...accelerator-based ML inference , model... ...optimization (e.g.,TensorRT, ONNX Runtime, custom...PlatformLocal areaWork from homeFlexible hours- ...seeks a skilled Machine Learning Software Engineer to develop and integrate intelligent... ...Objective-C, and a deep understanding of Apple platforms. Candidates will collaborate with ML... ...contributing to the creation of seamless AI solutions that enhance user experiences....Platform
- ...will be pivotal in our inference marketing. You will work closely with engineering to understand the technical... ...leadership position in AI inference. What You’ll... ...NVIDIA’s inference platform technical go-to-market... ...Triton Inference Server, TensorRT‑LLM, vLLM, SGLang). Market...Platform
- ...large language model inference? Join NVIDIA’s TensorRT Edge‑LLM team and... ...generation of edge AI for automotive and robotics... ...embedded and edge platforms — delivering cutting... ...Electrical/Computer Engineering, or a closely... ...autoregressive LLM serving systems, including...Platform
- ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our... ...the full potential of our platform, leveraging its... ...state‑of‑the‑art generative AI models on our custom hardware... ...as vLLM, SGLang, and TensorRT‑LLM. Solid understanding...Platform
- d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with...Platform3 days per week
$152k - $241.5k
...driving advancements in AI and machine learning... ...and motivated engineers to join our TensorRT team in developing the... ...leading deep learning inference software for NVIDIA AI... ...Frameworks, Compilers, or System Software. ~... ...-effective computing platform driving our success in...Platform$184k - $287.5k
...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor... ...powers today’s most sophisticated AI applications. Our team is... ...a central role in improving these platforms, facilitating smooth deployment and...Platform$148k - $235.75k
...business and pivotal in our inference marketing. You will be... ...on working with engineering to understand the technical... ...position in AI inference. Want to... ...drive NVIDIA’s inference platform technical go-to-market... ...Triton Inference Server, TensorRT-LLM, vLLM, SGLang) ~...Platform
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Inference Systems Engineer - TensorRT Special Platforms. Be the first to apply!
- senior ai engineer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- ai engineer remote Santa Clara, CA
- ai engineer Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- ai developer Santa Clara, CA
- machine learning ai engineer Santa Clara, CA
- healthcare systems engineer Santa Clara, CA
- application system engineer Santa Clara, CA
- operating system engineer Santa Clara, CA

