Senior AI Systems Engineer — SGLang & Inference on GPUs
Advanced Micro Devices
A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent work, with a strong background in C++ and Python. Responsibilities include optimizing frameworks like TensorFlow and PyTorch and working closely with GPU software teams. This role promises a dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices
- ...computing experiences-from AI and data centers,... ...and embedded systems. Grounded in a culture... ...frameworks for AMD GPUs. Your work will be... ...LLM and Multimodal inference at scale across... ...Skilled engineer with strong technical... ...TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream...Senior
$184k - $356.5k
NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa... ...software for advanced AI applications, including developing... ...performance deep learning frameworks like SGLang and vLLM. Candidates should have...Senior- A leading AI infrastructure company in California is seeking a Member of Technical Staff — Inference to design and optimize large-scale AI inference systems. The role demands 5+ years in systems engineering and expertise in large-scale inference systems. Successful candidates...SeniorFlexible hours
- A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems....Senior
$152k - $241.5k
...advancements in AI and machine... ...and motivated engineers to join our TensorRT... ...deep learning inference software for NVIDIA... .... As a Senior Software Engineer... ...on NVIDIA GPUs. If you're ready... ...Compilers, or System Software. ~ Excellent... ...-LLM, vLLM, SGLang. Experience...Senior- A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations...Senior
- A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach...Senior
$152k - $241.5k
...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM... ...inference engines like vLLM and SGLang-ensuring they run best‑in‑class on NVIDIA GPUs and systems-and by improving the...Senior$184k - $287.5k
...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key... ...today's most sophisticated AI applications. Our team... ...frameworks, including SGLang and vLLM, which are at... ...accelerators, from datacenter GPUs to edge SoCs. You'll...SeniorRemote work- d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with...Senior3 days per week
$136.5k - $253.5k
...highly skilled and experienced AI Systems Engineer to join our team. This is a hands‑on, senior individual contributor role that... ...experience supporting compute, GPUs, and AI services on both GCP... ...TGI , TensorRT‑LLM ) to maximize inference throughput and minimize latency...Senior$184k - $287.5k
...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme... ....g., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance...Senior- ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as a Senior AI/ML DevOps... ...build and operate scalable AI systems that move from prototype to production... ...AI services, optimizing inference performance from CPU and small GPUs to large multi-GPU servers, including...Senior
$170.5k - $240.71k
Intel Corporation is seeking an experienced AI Software Development Engineer to drive optimization of AI inference workloads. Responsibilities include optimizing Large Language Models on GPUs and developing efficient graph-based compilation flows. Candidates should have...$184k - $287.5k
...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software... ...source communities like FlashInfer, vLLM, and SGLang What we need to see: ~ Masters degree...SeniorRemote work- A leading technology company in Santa Clara is seeking a Senior AI-Native Systems Software Engineer to design an AI-native framework, optimizing performance for critical use cases. This role requires strong modern C++ skills, familiarity with deep learning frameworks,...Senior
$152k - $241.5k
...unlimited potential of AI to define the next era... ...Deep Learning Compiler Engineer. NVIDIA is hiring software... ...the world are using GPUs to power a revolution in... ...generative AI, recommendation systems, image classification,... ...backbone of NVIDIA’s inference engine, spanning across...Senior$272k - $431.25k
...platform for every new AI-powered application.... ...a Principal Software Engineer - AI Inference to advance open-... ...engines like vLLM and SGLang. You will ensure they... ...outstandingly on NVIDIA GPUs and systems. You will also strengthen... ...community. Mentor senior engineers, raise the...Remote work- A leading technology company in Sunnyvale, CA is seeking a Software Engineer III to develop next-generation technologies related to AI and ML. This role requires a Bachelor's degree and experience in programming, particularly with Python or C++, along with expertise in...Senior
$152k - $241.5k
...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a... ...Language Models, on NVIDIA GPUs? We are now welcoming... ...Experience developing System Software. Proficiency... ...vacancy. NVIDIA uses AI tools in its recruiting processes...Senior- Available Positions SambaNova Systems employs some of the greatest minds and talent in AI and machine learning. If... ...we want to hear from you. Senior AI Systems Performance Engineer Palo Alto, California,... ...performance for large-scale AI inference. Responsibilities Bring up...SeniorFull timeTemporary workLocal areaFlexible hours
- ...Machine Learning Systems Engineer We are looking for Machine Learning Systems... ...leader in 3D generative AI, recognized as the No.1 in popularity... ...in both training and inference. Your next challenge at Meshy... ...across hundreds to thousands of GPUs. Implementing and...Part timeRemote work
$163.5k - $212.4k
NIO is seeking a Senior AI Inference Infrastructure Software Engineer in San Jose, CA, specializing in building scalable inference systems for large language and vision-language models. This role requires over 5 years of software development experience and strong skills...Senior$195.2k - $275.58k
**Welcome!**.Senior AI Algorithm Engineer in oneDNN page is loaded## Senior AI Algorithm... ...graphics, and discrete GPUs.**Responsibilities***... ...best‐in‐class deep‐learning inference and training throughput on... ...learning, HPC, compilers, and systems optimization.* Contribute to...SeniorLocal areaImmediate startRemote workWorldwideFlexible hoursShift work$152k - $241.5k
Senior Software Engineer, Quantized Inference page is loaded## Senior Software Engineer, Quantized... ...engines (vLLM, TRT-LLM, SGLang). The candidate will... ...across the team: CI, build systems, training infrastructure,... ...tested code; fluent with AI-assisted tooling* Experience...Senior$128.7k - $261.3k
**Role**As a **Senior System Performance Engineer** on GM’s AV System Performance Team, you will design... ...compute environments (e.g., GPUs, DSPs, or accelerators).Familiarity... ...level software stacksExperience with AI/ML applications or inference software. · **The salary range...SeniorFlexible hours- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI... ...industry-leading training and inference speeds and empowers machine learning... ...a versatile and experienced engineer to join our SOTA Training...SeniorInternship
- ...computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture... ...a strategic software engineering lead who is passionate... ...scale-up and scale-out inference. Develop methods and... ...experience with at least one of sglang, or vllm and with...
$176k - $420k
...What to Expect Tesla's AI team is pushing the... ...deploy large-scale ML systems powering products from... ...model architecture and engineer algorithmic optimizations... ...make large-scale model inference fast, reliable, and hardware... ...deploying ML models on GPUs, TPUs, or NPUs Hands...Hourly payFull timeTemporary workFlexible hours$151.8k
...AI Inference Engineer We are looking for an AI Inference Engineer with a solid background in speech... ...-the-art automatic speech recognition system and ship it to various Zoom products.... ...hardware-specific optimizations for Nvidia GPUs. Proposing new model structures by...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AI Systems Engineer — SGLang & Inference on GPUs. Be the first to apply!
- machine learning ai engineer Santa Clara, CA
- senior ai engineer Santa Clara, CA
- ai engineer remote Santa Clara, CA
- ai ml engineer Santa Clara, CA
- ai engineer Santa Clara, CA
- ai developer Santa Clara, CA
- ai prompt engineer Santa Clara, CA
- operations support system engineer Santa Clara, CA
- mission system engineer Santa Clara, CA
- unix linux systems engineer Santa Clara, CA

