Senior AI Systems Engineer — SGLang & Inference on GPUs
Advanced Micro Devices
A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent work, with a strong background in C++ and Python. Responsibilities include optimizing frameworks like TensorFlow and PyTorch and working closely with GPU software teams. This role promises a dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
$184k - $287.5k
NVIDIA Gruppe is seeking talented AI systems engineers to advance innovative technologies in AI inference systems software. This role involves developing cutting-edge libraries, code generators, and kernel technologies for NVIDIA's architecture, emphasizing high-impact...Senior- ...computing experiences-from AI and data centers,... ...and embedded systems. Grounded in a culture... ...frameworks for AMD GPUs. Your work will be... ...LLM and Multimodal inference at scale across... ...Skilled engineer with strong technical... ...TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream...Senior
$184k - $356.5k
NVIDIA Gruppe is looking for skilled software engineers to develop AI inference systems that operate with high efficiency. The role involves architecting high-performance inference frameworks and optimizing GPU processes. Ideal candidates should have extensive programming...Senior- ...computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of... ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving... ...LLM training and inference on AMD GPUs, improving kernel, communication...Suggested
$184k - $287.5k
A leading technology company is seeking a Senior Software Engineer for AI and DL Kernel Libraries in Santa Clara, CA. The role involves designing... ...with 6+ years of experience preferably in deep learning systems. The position offers a salary range of $184,000 - $287,50...SeniorRemote job$212.8k
...are dedicated to building the inference infrastructure for ultra-... ...models, and frontier multimodal AI systems. Our mission is to provide a... ...Computer Science, Software Engineering, Artificial Intelligence,... ...frameworks such as vLLM and SGLang, with hands-on experience in...SeniorTemporary workLocal area- A leading AI infrastructure company in California is seeking a Member of Technical Staff — Inference to design and optimize large-scale AI inference systems. The role demands 5+ years in systems engineering and expertise in large-scale inference systems. Successful candidates...SeniorFlexible hours
- ...experiences—from AI and data centers,... ...gaming and embedded systems. Grounded in a... .... THE ROLE As a senior member of the LLM inference framework team, you... ...models on AMD GPUs. You will work at... ...such as vLLM and SGLang to make AMD a first... ...of inference engines, distributed systems...Senior
$184k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves...Senior$152k - $241.5k
...NVIDIA Gruppe is seeking an AI & Deep Learning Compiler Engineer for its Deep Learning & AI Compiler team in Santa Clara, California. This role involves... ...compiler algorithms to enhance performance on NVIDIA GPUs. Ideal candidates should have a Bachelor’s, Master’s, or...Senior$152k - $241.5k
...and benchmark GenAI inference on NVIDIA's latest... ...TensorRT-LLM, SGLang, and vLLM, building... ...of GPU performance engineering and public accountability... ...other emerging AI use cases.... ...across clusters of GPUs. Establish performance... ...high-performance systems. Deep understanding...$120.1k - $225.7k
...Role Entails End-to-End Inference Optimization: Lead the... ...team members to build a robust AI inference technical ecosystem... ...Computer Science, Electronic Engineering, AI, or related fields; significant... ...Intelligent Routing . Systems Proficiency: Expert in...SeniorRelocation package- A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems....Senior
$152k - $241.5k
...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...Senior$160k - $180k
...team members. What You’ll Do As a Senior AI Systems Engineer, you will architect, deploy, and manage... ...large-scale AI model training and inference. You will ensure our machine learning... ...scale using frameworks like vLLM and SGLang. Data Engineering for AI: Experience...SeniorLocal area- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...Senior
$152k - $241.5k
...seeking a talented individual to optimize and benchmark GenAI inference using the latest acceleration technologies. The role involves driving... ...benchmark results and architecting distributed inference systems. Required qualifications include a relevant degree and significant...Senior- A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations...Senior
- NVIDIA Corporation is looking for a Senior Inference Engineer to advance AIConfigurator, enhancing model serving and performance for large-scale LLM inference. This role entails developing production-quality APIs and integrating complex deployment configurations on NVIDIA...Senior
$152k - $241.5k
...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer - AI Inference to advance open‑source LLM... ...inference engines like vLLM and SGLang-ensuring they run best‑in‑class on NVIDIA GPUs and systems-and by improving the...Senior$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme... ....g., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance:...Senior$152k - $287.5k
...NVIDIA Gruppe is seeking a Senior Machine Learning Applications and Compiler Engineer in Santa Clara, California. This role involves developing algorithms for their LPX inference and compiler stack, optimizing the performance of neural network workloads on NVIDIA platforms...Senior- NVIDIA Gruppe is seeking an experienced engineer to join the new Agentic Engineering team within the Deep Learning Framework Group. This... ..., showcasing strong Python skills and knowledge of GPU systems. A passion for the evolving landscape of ML hardware is essential...Senior
$152k - $241.5k
NVIDIA Corporation is seeking a Senior Systems Software Engineer in Santa Clara to develop innovative AI products focused on semiconductor inspection. This role requires expertise in deep learning and computer vision, alongside strong programming skills in Python. The...Senior$152k - $241.5k
NVIDIA Gruppe in Santa Clara is seeking a Sr. Software Engineer specializing in AI-driven semiconductor inspection. This role involves defining and prototyping AI system architectures, developing next-gen products, and collaborating with various teams to enhance inspection...Senior- NVIDIA Corporation in Santa Clara is seeking a talented individual to enhance semiconductor inspection through AI. The role involves developing inspection workflows and collaborating with various teams to create innovative AI products. Candidates should possess an MS or...Senior
$152k - $241.5k
...NVIDIA's GPUs are at the core of modern AI infrastructure, from training large-scale models to running inference in production. That position depends on... ...hardware, and compiler engineering is a big part of what makes... ...compilers and AI systems. We build innovative AI...Senior$152k - $287.5k
NVIDIA Gruppe is seeking a highly motivated Software Engineer to contribute to the design and development of large-scale AI systems. The successful candidate will work on scalable infrastructure for ML training and cloud-native platforms, leveraging cutting-edge technologies...Senior- ...Senior AI Systems Performance Engineer Palo Alto, California, United States The era of pervasive AI has arrived. In this era, organizations will... ...to deliver world-record performance for large-scale AI inference. Responsibilities Bring up and optimize...Senior
- NVIDIA Gruppe in Santa Clara seeks a Software Engineer to join the Managed AI Research Superclusters team. You'll design and operate cutting-edge... ...candidate has over 5 years of experience in distributed systems, excellent programming skills in C++, Python or Go, and a...Senior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Systems Engineer — SGLang & Inference on GPUs. Be the first to apply!
- ai engineer remote Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- machine learning ai engineer Santa Clara, CA
- ai engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai ml engineer Santa Clara, CA
- electronic systems engineer Santa Clara, CA
- space systems engineer Santa Clara, CA
- systems engineer Santa Clara, CA

