Senior Quantized Inference Engineer for Efficient AI
$152k - $287.5kNVIDIA
NVIDIA is seeking a Senior Software Engineer for Quantized Inference in Santa Clara, CA. This role involves implementing quantized and sparse recipes in inference engines, ensuring efficient model export pipelines, and optimizing throughput for large language models. The ideal candidate will have extensive experience in software engineering, particularly in Python and C++, along with a strong academic background in Computer Science. The salary ranges from $152,000 to $287,500, including equity and benefits. #J-18808-Ljbffr
- ...A leading technology firm is seeking a Senior Software Engineer for Quantized Inference to implement quantized recipes for advanced model optimization. This role demands strong skills in Python and C++, alongside experience in ML accelerators and software engineering...Senior
$152k - $241.5k
...Senior Software Engineer, Quantized Inference page is loaded## Senior Software Engineer, Quantized Inferencelocations... ...the discovery and deployment of efficient inference recipes for LLMs. A recipe... ...concise, well-tested code; fluent with AI-assisted tooling* Experience with...Senior$184k - $287.5k
...Position Overview We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-performance inference stacks, optimize GPU kernels and...Senior$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior- ...computing experiences—from AI and data centers, to PCs,... ...your career. THE ROLE As a senior member of the LLM inference framework team, you will... ...intersection of inference engines, distributed systems, and... ...throughput, latency, and memory efficiency across single‑GPU and...Senior
$184k - $356.5k
...NVIDIA Gruppe is looking for a Senior Software Engineer specializing in Deep Learning Inference in Santa Clara, California. You will design and optimize GPU-accelerated software critical for advanced AI applications, contributing to libraries like vLLM and SGLang. Ideal...Senior- ...Advanced Micro Devices is seeking a strategic software engineering lead in Santa Clara, California. This role involves improving application... .... Key responsibilities include developing techniques for inference optimization and supporting the ROCm ecosystem expansion. A Bachelor...Senior
$152k - $241.5k
...recently, GPU deep learning ignited modern AI — the next era of computing — with the... ...looking for an AI & Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for... ...our DLC has been the backbone of NVIDIA’s inference engine, spanning across data centers,...Senior$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...Senior$152k - $241.5k
...learning and eager to work on cutting-edge AI technology for safety-critical applications? Join NVIDIA's TensorRT team as a Senior Software Engineer, and be at the forefront of technology, enabling high-performance AI inference solutions for automotive safety and other...Senior- ...Full Time · Department: Backend Engineer · Work type: On-Site About A rchetype AI Archetype AI is developing the... ..., low-latency AI model inference and data services. Partner with... ...push the limits of scalability, efficiency, and reliability. Own problems...SeniorFull time
$135.8k - $237.05k
...Mountain View, CA, USA Senior Backend Engineer, ML Inference Systems Location Mountain View, CA, USA Department AI & Machine Learning Requisition ID JOBREQ-26... ...Ensure the reliability, scalability, and efficiency of our systems in production using...SeniorWork at officeWorldwideRelocation package- ...computing experiences-from AI and data centers, to PCs,... ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving... ...LLM training and inference on AMD GPUs, improving kernel... ...communication, and end-to-end system efficiency. • Develop and enhance...
$207k - $300k
Senior Research Engineer, On-Device Inference, Robotics, DeepMind corporate_fare DeepMind place Mountain View, CA, USA... ...to align model architectures with AI accelerators (e.g., distillation).... ...company and, in order to facilitate efficient collaboration and communication...SeniorFull time- ...leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara,... ...AI systems software for inference applications including deep learning... ...s leading technology and ensure the efficiency of AI workloads. Ideal candidates will...Senior
$184k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a... ...systems development. The role involves building efficient kernels and compilers for AI workloads...Senior$184k - $287.5k
...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key contributor, you... ...powers today’s most sophisticated AI applications. Our team is responsible... ..., which are at the forefront of efficient large-scale model serving and...Senior$152k - $241.5k
...driving advancements in AI and machine learning to... ...talented and motivated engineers to join our TensorRT... ...-leading deep learning inference software for NVIDIA AI accelerators. As a Senior Software Engineer in the... ...CUDA for seamless and efficient deployment of state-of-...Senior$152k - $241.5k
...NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM serving by... ...request lifecycle management, and KV‑cache efficiency (paging/sharding) to improve throughput and...Senior- ...generation computing experiences—from AI and data centers, to PCs,... ...and SOTA LLM and Multimodal inference at scale across multi-GPU and... .... THE PERSON: Skilled engineer with strong technical and analyticalexpertisein... ...other DSLs for AI operator efficiency. Collaborate with GPU Library...Senior
$126k - $248k
...MongoDB is seeking a Senior Engineer in Palo Alto to help build a next-generation inference platform integrated with MongoDB Atlas. This role involves designing and building... ...of the inference platform, collaborating with AI engineers, and improving performance in a cloud-...Senior$152k - $241.5k
...NVIDIA is hiring an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler (DLC) team. What you’ll be doing Analyzing deep learning networks and developing compiler optimization algorithms. Collaborating with members of the deep learning software framework...Senior$152k - $241.5k
NVIDIA in Santa Clara is seeking a Compiler Engineer to drive technical innovation in AI workloads and optimize NVIDIA GPUs. The role involves participating in hands-on development, collaborating across divisions, and solving complex compilation problems. Applicants should...Senior$165k - $242k
...Senior Software Engineer II, Inference Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology... ...custom accelerators for high-efficiency workloads ~ Hands-on experience with...SeniorPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$155k - $250k
...MixMode is seeking a Senior Analog Design Engineer to contribute to analog-mixed signal integrated circuit design, including state-of-the-art AI engine development and circuit layout optimization. The ideal candidate has over 14 years of experience in high-speed circuit...Senior$168k - $270.25k
...Senior Software Engineer, Distributed Systems - NIM Factory page is loaded## Senior... ...upon which every new AI-powered application is built... ...and automation for NVIDIA Inference Microservices (NIMs). The right... ...expertise to design an efficient, scalable and reliable automation...SeniorRemote work- A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal... ...dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro DevicesSenior
$168k - $270.25k
NVIDIA is seeking a senior software engineer to automate and optimize performance analysis workflows for AI training and inference. You will design and build tools that enhance efficiency across engineering teams. Ideal candidates will possess an M.S. or PhD in a relevant...Senior$152k - $287.5k
...NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...Senior- ...builds the world's largest AI chip, 56 times larger than... ...industry‑leading training and inference speeds and empowers machine... ...The Role We are hiring a Senior Performance Engineer to join our Product team. You... ...) with a systems or efficiency focus. Contributions to open...SeniorContract workShift work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Quantized Inference Engineer for Efficient AI. Be the first to apply!
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior software test automation engineer Santa Clara, CA
- senior design technologist Santa Clara, CA
- senior design verification engineer Santa Clara, CA
- senior director quality Santa Clara, CA
- senior director of development Santa Clara, CA
- sr project engineer Santa Clara, CA

