Senior DL Engineer: Edge Model Optimization & Inference
NVIDIA Gruppe
NVIDIA Gruppe is looking for a skilled professional to enhance the performance of large-scale models through advanced optimization techniques in Santa Clara, California. Candidates should have a strong background in DL model training and deployment, ideally with a PhD or equivalent experience in Computer Science. The role offers an opportunity to work closely with cutting-edge technologies and a collaborative team. Benefits include a competitive salary range and eligibility for equity. #J-18808-Ljbffr
$184k - $287.5k
...Develop state‑of‑the‑art model optimization techniques—speculative... ...strategies for inference, such as automated model... ...TensorRT conversion. Scale DL model performance across diverse NVIDIA edge architectures,... ...Computer Science, Computer Engineering, or a related technical...Senior- NVIDIA is seeking a Senior DL Algorithms Engineer to optimize LLM/Omni models and enhance performance across its software stack. The ideal candidate will have... ...years of experience in deep learning, specifically in inference. This role involves profiling, analyzing...Senior
$244.8k
...research groups dedicated to generative models for content creation, image... ...experienced Multimodal Model Training and Inference Optimization Engineer with expertise in optimizing AI model... ...ideal candidate will work at the cutting edge of AI efficiency, enhancing the...SeniorTemporary workLocal area- ...NVIDIA Gruppe is looking for a skilled engineer to join their TensorRT Edge-LLM team in Santa Clara, California. The role involves developing a state-of-the-art inference framework for large language models and optimizing it for real-time performance on embedded platforms...Senior
$184k - $356.5k
...technology company in California is seeking a Senior DL Algorithms Engineer to drive inference performance for Deep Learning... ...The role involves implementing advanced model inference and collaborating with co-design teams to optimize performance across hardware and...Senior$184k - $287.5k
Senior DL Algorithms Engineer - Inference Performance page is loaded## Senior DL Algorithms Engineer - Inference... ...of performance analysis and optimization to help us squeeze every last clock... ...Implement language and multimodal model inference as part of NVIDIA Inference...Senior$184k - $356.5k
...NVIDIA Gruppe is looking for a Senior Software Engineer specializing in Deep Learning Inference in Santa Clara, California. You will design and optimize GPU-accelerated software critical for advanced AI applications, contributing to libraries like vLLM and SGLang. Ideal...Senior$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior$152k - $241.5k
...NVIDIA Gruppe is looking for skilled software engineers to join the CUDA Tile team, focusing on a new tile-based programming model for NVIDIA GPUs. The ideal candidates will... ...at least 3 years of experience in compiler optimization and proficient skills in C/C++ programming....Senior- ...NVIDIA Gruppe in Santa Clara, California is seeking a Senior Software Engineer specializing in Deep Learning Inference. In this role, you will craft and develop high-performance software tailored for scalable platforms while collaborating with experts in the field. The...Senior
$224k - $356.5k
...high-performance computing. As a Senior / Principal Deep Learning Engineer — Model Evaluation & AI Systems, you... ...Work alongside model training, inference, and product divisions to provide... ...signals that inform release and optimization decisions. What we need to see:...Senior$184k - $356.5k
...company in California is seeking a Senior Deep Learning Software Engineer focused on performance optimization of LLM models. You will analyze and enhance LLM inference performance, working in cross-... ...collaborative teams to implement cutting-edge algorithms. The ideal candidate...SeniorFull time$184k - $287.5k
...Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and... ...will design, implement, and optimize kernels while collaborating with cross...Senior$152k - $241.5k
...eager to work on cutting-edge AI technology for... ...s TensorRT team as a Senior Software Engineer, and be at the forefront... ...high-performance AI inference solutions for... ...Contribute to performance optimization and benchmarking efforts... ...-art deep learning models (such as Large Language...Senior- ...A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering fundamentals...Senior
- ...Advanced Micro Devices in Santa Clara seeks a Senior ML Engineer focused on optimizing large language model inference runtimes. The role involves architecting distributed systems and enhancing performance across GPUs. Ideal candidates will have expertise in Python and...Senior
$152k - $241.5k
Responsibilities Build performance modeling and prediction tools for AI workloads at Data... ...TensorFlow, distributed training and inference Knowledge of GPU cluster job scheduling... ...makes a candidate stand out Proven SW engineering experience experience in deploying SW at...Senior$152k - $241.5k
...moving multifaceted software team! This software engineering role involves developing datacenter scale performance modeling and predictions tools for AI researchers... ...PyTorch and TensorFlow, distributed training and inference. Knowledge of GPU cluster job scheduling (Slurm...Senior$152k - $241.5k
...AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers... ...areas, e.g. large language models, generative AI,... ...been the backbone of NVIDIA’s inference engine, spanning across data... ...networks and developing compiler optimization algorithms. Collaborating...Senior- ...individual to focus on AI and multi-modal large models in Santa Clara, California. This position involves designing and optimizing computer vision algorithms, implementing C++ solutions, and contributing to cutting-edge technologies shaping the future of transportation...Senior
$184k - $287.5k
...every new AI-powered application is built. We are seeking a senior vision language model engineer to design and build agentic data and training workflows... ...at NVIDIA and contribute to a team that is pushing the edges of what can be done in AI and computer vision. We’re...Senior$152k - $287.5k
...NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...Senior- ...Advanced Micro Devices is seeking a strategic software engineering lead in Santa Clara, California. This role involves improving... ...software. Key responsibilities include developing techniques for inference optimization and supporting the ROCm ecosystem expansion. A Bachelor’s...Senior
$212.8k
...- Convert and compile ML models for execution on edge NPUs, and apply quantization... ...- Apply hardware-aware optimization strategies, such as... ...Computer Science, Electrical Engineering, Computer Engineering, or... ...- Understanding of model inference constraints on edge devices...Temporary workLocal area- ...· Department: Backend Engineer · Work type: On-Site About... ...building a foundation model for the physical world,... ...to work on the cutting edge of physical AI and don’... ...own critical services, optimize for latency and... ..., low-latency AI model inference and data services. Partner...SeniorFull time
$184k - $287.5k
...seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect... ...implement high-performance inference stacks, optimize GPU kernels and compilers, drive industry benchmarks...Senior$184k - $287.5k
...accelerating it. The TensorRT inference platform is the backbone... ...deployment of cutting-edge deep learning models on every NVIDIA GPU. With... ...highly skilled and driven Engineering Manager to take the lead in... ...kernel development, runtime optimizations, and frameworks for LLM inference...- NVIDIA is looking for a Deep Learning Software Engineer to analyze and optimize the performance of our inference ecosystem. This role involves developing benchmarking methodologies and features for frameworks like TensorRT, as well as working cross-functionally across various...
$128.7k - $261.3k
...repeatable, high-velocity model deployments through... ...deployment and infra engineers to ship numerically robust... ...focused onmodel optimization and deployment, with significant... .../ efficient inference or relevant experience... ...Give You A Competitive Edge (Preferred Qualifications...SeniorLocal areaRemote workWork from homeRelocation packageFlexible hours- ...week. The role: Analog Design Engineer, Senior / Staff /Sr. Staff What You... ...for Artificial Intelligence Inference Accelerator and High-Speed... ...from 4nm and below, and to optimize design and layout to achieve... ...of design/layout. • Provide model of circuit for backend verification...Senior3 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior DL Engineer: Edge Model Optimization & Inference. Be the first to apply!
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior software test automation engineer Santa Clara, CA
- senior design technologist Santa Clara, CA
- senior design verification engineer Santa Clara, CA
- senior director quality Santa Clara, CA
- senior director of development Santa Clara, CA
- sr project engineer Santa Clara, CA

