Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Inference Systems Engineer - TensorRT Special Platforms

NVIDIA

NVIDIA Corporation is looking for a passionate Software Engineer to join the TensorRT team in Santa Clara, California. You will drive innovation in deep learning and work with cutting-edge AI technology, contributing to high-performance AI inference solutions. Your role involves designing and developing deep learning software, collaborating with hardware and software teams, and producing robust C++ code. A Master's or PhD in relevant fields with strong C++ skills is essential. Join us to help build the future of AI inference technology! #J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the AI Inference Systems Engineer - TensorRT Special Platforms in Santa Clara, CA vacancy
  •  ...experiences-from AI and data centers,...  ...gaming and embedded systems. Grounded in a culture...  ...Staff AI Infra Engineer who is passionate...  ...benchmarks, with a special focus on AI/ML...  ...LLM training and inference on AMD GPUs, improving...  ...training or inference platforms using Kubernetes,... 
    Platform

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    5 days ago
  • $124k - $195.5k

     ...to work on cutting‑edge AI technology? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront...  ...contributing to high‑performance AI inference solutions for specialized platforms and applications. Your...  ...functions, classes, and systems to improve robustness... 
    Platform
    Internship

    NVIDIA Gruppe

    Santa Clara, CA
    9 hours ago
  • $152k - $241.5k

     ...for a Senior Software Engineer for Deep Learning Inference! Would you like to...  ...Learning Inference TensorRT software team.What you...  ...scaled to multiple platforms for functionality and...  ...developing System Software.Proficiency...  ...vacancy.NVIDIA uses AI tools in its recruiting... 
    Platform

    NVIDIA

    Santa Clara, CA
    9 hours ago
  • $152k - $287.5k

     ...Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role...  ...involves developing algorithms for their LPX inference and compiler stack, optimizing the...  ...performance of neural network workloads on NVIDIA platforms. Ideal candidates will possess an MS or... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    9 hours ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme...  ...such as containerd/CRI‑O/CRIU. Experience with cloud platforms (AWS/GCP/Azure), infrastructure as code, CI/CD, and... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    9 hours ago
  •  ...Description As a Senior Systems Research Engineer , you will join a...  ...and build embodied AI applications at the...  ...specifically for embedded platforms to achieve low-...  ...diverse frameworks before specializing in high-impact...  ...to real-time onboard inference—while serving as a... 
    Platform

    Intuitive

    Sunnyvale, CA
    4 days ago
  • $152k - $241.5k

     ...work on cutting-edge AI technology for...  ...applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the...  ...-performance AI inference solutions for...  ...safety and other specialized platforms. Your expertise will...  ...functions, classes, and systems to support... 
    Platform

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $188k - $275k

     ...Staff Software Engineer, Inference CoreWeave is The...  ...Essential Cloud for AI™. Built for...  ...CoreWeave delivers a platform of technology,...  ...management, and system-wide optimizations...  ...as vLLM, Triton, TensorRT-LLM, Ray Serve, or...  ...requirements for specialized skill sets. New hires... 
    Platform
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $124k - $195.5k

     ...Learning Software Engineer, TensorRT Performance!...  ...performance of NVIDIA’s inference ecosystem! NVIDIA...  ...like Generative AI, Recommenders and...  ...solutions. We specialize in developing GPU...  ...resource-constrained systems or embedded AI...  ...effective computing platform driving our... 
    Platform
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • $165k - $242k

     ...Senior Software Engineer II, Inference Sunnyvale, CA...  ...Essential Cloud for AI™. Built for...  ...CoreWeave delivers a platform of technology, tools...  ...distributed systems or cloud services...  ...frameworks (vLLM, Triton, TensorRT-LLM, Ray Serve,...  ...requirements for specialized skill sets. New... 
    Platform
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $255.85k - $361.2k

     ...are seeking a Principal Engineer to define and architect...  ...of distributed AI systems across heterogeneous compute platforms, including CPUs, GPUs,...  ...across CPUs, GPUs, and specialized accelerators. Stateful...  ...Experience with AI/ML systems, inference infrastructure, or large... 
    Platform
    Local area
    Shift work

    Intel Corporation

    Santa Clara, CA
    4 days ago
  • $272k - $431.25k

     ...looking for a Principal Engineer to serve as a key...  ...advanced AI agent frameworks and...  ...combining powerful local inference (Nemotron models)...  ...AI operating system. What You Will...  ...GeForce GPUs using TensorRT and CUDA. Partner...  ...-source AI agent platforms or orchestration tools... 
    Platform
    Local area
    Worldwide

    NVIDIA

    Santa Clara, CA
    5 days ago
  •  ...Time · Department: Backend Engineer · Work type: On-Site About A rchetype AI Archetype AI is developing the world's first AI platform to bring AI into the...  ...and resilient distributed systems. You’ll work closely...  ...throughput, low-latency AI model inference and data services.... 
    Platform
    Full time

    Neara

    Palo Alto, CA
    8 hours ago
  •  ...Gruppe in Santa Clara, California is seeking a Senior Software Engineer specializing in Deep Learning Inference. In this role, you will craft and develop high-performance software tailored for scalable platforms while collaborating with experts in the field. The ideal... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    9 hours ago
  • Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer‑...  ...industry‑leading training and inference speeds and empowers machine...  ...a versatile and experienced engineer to join our SOTA Training Platform team. This team is responsible... 
    Platform
    Internship

    Cerebras

    Sunnyvale, CA
    2 days ago
  • $135.8k - $237.05k

     ...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ...  ...Drive technical direction of our inference platform, with a focus on low-latency, high-... 
    Platform
    Work at office
    Worldwide
    Relocation package

    Unity Technologies

    Mountain View, CA
    1 day ago
  • $132k - $189k

     ...s degree in Electrical Engineering, Computer Engineering,...  ...Experience with test system development in support...  ...to shape the future of AI/ML hardware acceleration...  ...providing the essential platforms that enable developers...  ...you have a disability or special need that requires accommodation... 
    Platform
    Full time
    Worldwide

    Google

    Sunnyvale, CA
    1 day ago
  • $152k - $241.5k

     ...Join our team of innovative engineers who are building an AI Data Center AIOps platform that turns raw, high-volume telemetry...  ...they impact AI training and inference. The core challenge of this role...  ...implementing ML algorithms directly in systems languages for latency-sensitive... 
    Platform

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...Senior AI Systems Performance Engineer Palo Alto, California, United States The...  ...full-stack, generative AI platform, from chip to model,...  ...performance for large-scale AI inference. Responsibilities...  ...DeepSpeed, Megatron, vLLM, or TensorRT. Strong GPU programming... 
    Platform

    SambaNova Systems

    Palo Alto, CA
    3 days ago
  • $165k - $242k

     ...Senior Business Systems Engineer- Data Center Systems II Livingston, NJ...  ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and...  ...on role requirements for specialized skill sets. New hires will be... 
    Platform
    Temporary work
    Casual work
    Work at office
    Immediate start
    Remote work
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems.... 
    Platform

    SambaNova

    Palo Alto, CA
    1 day ago
  • $218.8k - $335.3k

     ...Motors, our Embodied AI teams are...  ...learning to build systems that are both intelligent...  ...a Staff Software Engineer to provide...  ...broader autonomy, platform, and hardware roadmaps...  ...accelerator-based ML inference , model...  ...optimization (e.g.,TensorRT, ONNX Runtime, custom... 
    Platform
    Local area
    Work from home
    Flexible hours

    General Motors

    Sunnyvale, CA
    1 day ago
  •  ...seeks a skilled Machine Learning Software Engineer to develop and integrate intelligent...  ...Objective-C, and a deep understanding of Apple platforms. Candidates will collaborate with ML...  ...contributing to the creation of seamless AI solutions that enhance user experiences.... 
    Platform

    Apple Inc.

    Cupertino, CA
    2 days ago
  •  ...will be pivotal in our inference marketing. You will work closely with engineering to understand the technical...  ...leadership position in AI inference. What You’ll...  ...NVIDIA’s inference platform technical go-to-market...  ...Triton Inference Server, TensorRT‑LLM, vLLM, SGLang). Market... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    9 hours ago
  •  ...large language model inference? Join NVIDIA’s TensorRT Edge‑LLM team and...  ...generation of edge AI for automotive and robotics...  ...embedded and edge platforms — delivering cutting...  ...Electrical/Computer Engineering, or a closely...  ...autoregressive LLM serving systems, including... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    9 hours ago
  •  ...About The Role The Inference ML Engineering team at Cerebras Systems is dedicated to enabling our...  ...the full potential of our platform, leveraging its...  ...state‑of‑the‑art generative AI models on our custom hardware...  ...as vLLM, SGLang, and TensorRT‑LLM. Solid understanding... 
    Platform

    Dormont Manufacturing Company

    Sunnyvale, CA
    1 day ago
  • d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with... 
    Platform
    3 days per week

    d-Matrix

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...driving advancements in AI and machine learning...  ...and motivated engineers to join our TensorRT team in developing the...  ...leading deep learning inference software for NVIDIA AI...  ...Frameworks, Compilers, or System Software. ~...  ...-effective computing platform driving our success in... 
    Platform

    NVIDIA

    Santa Clara, CA
    3 days ago
  • $184k - $287.5k

     ...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor...  ...powers today’s most sophisticated AI applications. Our team is...  ...a central role in improving these platforms, facilitating smooth deployment and... 
    Platform

    NVIDIA Gruppe

    Santa Clara, CA
    8 hours ago
  • $148k - $235.75k

     ...business and pivotal in our inference marketing. You will be...  ...on working with engineering to understand the technical...  ...position in AI inference. Want to...  ...drive NVIDIA’s inference platform technical go-to-market...  ...Triton Inference Server, TensorRT-LLM, vLLM, SGLang) ~... 
    Platform

    NVIDIA

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Systems Engineer - TensorRT Special Platforms. Be the first to apply!