Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AI Systems Engineer — SGLang & Inference on GPUs

Advanced Micro Devices

A leading technology company is seeking a skilled engineer to optimize deep learning frameworks and enhance GPU kernel performance. The ideal candidate excels in collaborative and independent work, with a strong background in C++ and Python. Responsibilities include optimizing frameworks like TensorFlow and PyTorch and working closely with GPU software teams. This role promises a dynamic work environment with a focus on innovative solutions and advancing AI technologies. #J-18808-Ljbffr Advanced Micro Devices

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior AI Systems Engineer — SGLang & Inference on GPUs in Santa Clara, CA vacancy
  •  ...computing experiences-from AI and data centers,...  ...and embedded systems. Grounded in a culture...  ...frameworks for AMD GPUs. Your work will be...  ...LLM and Multimodal inference at scale across...  ...Skilled engineer with strong technical...  ...TensorFlow, PyTorch, and SGLang on AMD GPUs via upstream... 
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    1 day ago
  • $184k - $356.5k

    NVIDIA Corporation is seeking a Senior Deep Learning Software Engineer specializing in Inference to join their growing team in Santa...  ...software for advanced AI applications, including developing...  ...performance deep learning frameworks like SGLang and vLLM. Candidates should have... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    12 hours ago
  • A leading AI infrastructure company in California is seeking a Member of Technical Staff — Inference to design and optimize large-scale AI inference systems. The role demands 5+ years in systems engineering and expertise in large-scale inference systems. Successful candidates... 
    Senior
    Flexible hours

    RadixArk

    Palo Alto, CA
    1 day ago
  • A leader in AI technology in Palo Alto is seeking a Senior AI Systems Performance Engineer to optimize the latest foundation models on their innovative platform. This role involves collaborating with cross-functional teams to push the performance limits of AI systems.... 
    Senior

    SambaNova

    Palo Alto, CA
    12 hours ago
  • $152k - $241.5k

     ...advancements in AI and machine...  ...and motivated engineers to join our TensorRT...  ...deep learning inference software for NVIDIA...  .... As a Senior Software Engineer...  ...on NVIDIA GPUs. If you're ready...  ...Compilers, or System Software. ~ Excellent...  ...-LLM, vLLM, SGLang. Experience... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • A leading technology company is seeking a Senior AI Software Engineer to join their team in Santa Clara, California. In this role, you will innovate and develop groundbreaking AI systems software for inference applications including deep learning framework optimizations... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    12 hours ago
  • A leading technology company is seeking a Senior System Software Engineer to develop GPU-accelerated AI inference serving software. The ideal candidate will have over 5 years of experience with deep learning software, strong skills in Rust and C++, and a collaborative approach... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $152k - $241.5k

     ...platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer – AI Inference to advance open‑source LLM...  ...inference engines like vLLM and SGLang-ensuring they run best‑in‑class on NVIDIA GPUs and systems-and by improving the... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $184k - $287.5k

     ...NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference for our growing team. As a key...  ...today's most sophisticated AI applications. Our team...  ...frameworks, including SGLang and vLLM, which are at...  ...accelerators, from datacenter GPUs to edge SoCs. You'll... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    5 days ago
  • d-Matrix, based in Santa Clara, CA, is seeking a Staff Runtime Systems Engineer to lead the development of runtime software for AI inference platforms. You'll be responsible for architecting and developing firmware for multiprocessor systems-on-chip, collaborating with... 
    Senior
    3 days per week

    d-Matrix

    Santa Clara, CA
    1 day ago
  • $136.5k - $253.5k

     ...highly skilled and experienced AI Systems Engineer to join our team. This is a hands‑on, senior individual contributor role that...  ...experience supporting compute, GPUs, and AI services on both GCP...  ...TGI , TensorRT‑LLM ) to maximize inference throughput and minimize latency... 
    Senior

    Cadence

    San Jose, CA
    12 hours ago
  • $184k - $287.5k

     ...We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme...  ....g., PyTorch) and inference engines (e.g., vLLM and SGLang). Familiarity with GPU programming and performance... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Senior AI/ML DevOps Engineer Join Cisco's CX AI Incubation Team as a Senior AI/ML DevOps...  ...build and operate scalable AI systems that move from prototype to production...  ...AI services, optimizing inference performance from CPU and small GPUs to large multi-GPU servers, including... 
    Senior

    Webex Events (formerly Socio)

    San Jose, CA
    2 days ago
  • $170.5k - $240.71k

    Intel Corporation is seeking an experienced AI Software Development Engineer to drive optimization of AI inference workloads. Responsibilities include optimizing Large Language Models on GPUs and developing efficient graph-based compilation flows. Candidates should have... 

    Intel Corporation

    Santa Clara, CA
    12 hours ago
  • $184k - $287.5k

     ...We're looking for outstanding AI systems engineers to develop groundbreaking technologies in the inference systems software stack! We build innovative AI systems software...  ...source communities like FlashInfer, vLLM, and SGLang What we need to see: ~ Masters degree... 
    Senior
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • A leading technology company in Santa Clara is seeking a Senior AI-Native Systems Software Engineer to design an AI-native framework, optimizing performance for critical use cases. This role requires strong modern C++ skills, familiarity with deep learning frameworks,... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

     ...unlimited potential of AI to define the next era...  ...Deep Learning Compiler Engineer. NVIDIA is hiring software...  ...the world are using GPUs to power a revolution in...  ...generative AI, recommendation systems, image classification,...  ...backbone of NVIDIA’s inference engine, spanning across... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $272k - $431.25k

     ...platform for every new AI-powered application....  ...a Principal Software Engineer - AI Inference to advance open-...  ...engines like vLLM and SGLang. You will ensure they...  ...outstandingly on NVIDIA GPUs and systems. You will also strengthen...  ...community. Mentor senior engineers, raise the... 
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  • A leading technology company in Sunnyvale, CA is seeking a Software Engineer III to develop next-generation technologies related to AI and ML. This role requires a Bachelor's degree and experience in programming, particularly with Python or C++, along with expertise in... 
    Senior

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $152k - $241.5k

     ...We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a...  ...Language Models, on NVIDIA GPUs? We are now welcoming...  ...Experience developing System Software. Proficiency...  ...vacancy. NVIDIA uses AI tools in its recruiting processes... 
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  • Available Positions SambaNova Systems employs some of the greatest minds and talent in AI and machine learning. If...  ...we want to hear from you. Senior AI Systems Performance Engineer Palo Alto, California,...  ...performance for large-scale AI inference. Responsibilities Bring up... 
    Senior
    Full time
    Temporary work
    Local area
    Flexible hours

    SambaNova

    Palo Alto, CA
    12 hours ago
  •  ...Machine Learning Systems Engineer We are looking for Machine Learning Systems...  ...leader in 3D generative AI, recognized as the No.1 in popularity...  ...in both training and inference. Your next challenge at Meshy...  ...across hundreds to thousands of GPUs. Implementing and... 
    Part time
    Remote work

    Meshy

    Sunnyvale, CA
    2 days ago
  • $163.5k - $212.4k

    NIO is seeking a Senior AI Inference Infrastructure Software Engineer in San Jose, CA, specializing in building scalable inference systems for large language and vision-language models. This role requires over 5 years of software development experience and strong skills... 
    Senior

    nio.com

    San Jose, CA
    1 day ago
  • $195.2k - $275.58k

    **Welcome!**.Senior AI Algorithm Engineer in oneDNN page is loaded## Senior AI Algorithm...  ...graphics, and discrete GPUs.**Responsibilities***...  ...best‐in‐class deep‐learning inference and training throughput on...  ...learning, HPC, compilers, and systems optimization.* Contribute to... 
    Senior
    Local area
    Immediate start
    Remote work
    Worldwide
    Flexible hours
    Shift work

    Intel Corporation

    Santa Clara, CA
    1 day ago
  • $152k - $241.5k

    Senior Software Engineer, Quantized Inference page is loaded## Senior Software Engineer, Quantized...  ...engines (vLLM, TRT-LLM, SGLang). The candidate will...  ...across the team: CI, build systems, training infrastructure,...  ...tested code; fluent with AI-assisted tooling* Experience... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $128.7k - $261.3k

    **Role**As a **Senior System Performance Engineer** on GM’s AV System Performance Team, you will design...  ...compute environments (e.g., GPUs, DSPs, or accelerators).Familiarity...  ...level software stacksExperience with AI/ML applications or inference software. · **The salary range... 
    Senior
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI...  ...industry-leading training and inference speeds and empowers machine learning...  ...a versatile and experienced engineer to join our SOTA Training... 
    Senior
    Internship

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    1 day ago
  •  ...computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture...  ...a strategic software engineering lead who is passionate...  ...scale-up and scale-out inference. Develop methods and...  ...experience with at least one of sglang, or vllm and with... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    12 hours ago
  • $176k - $420k

     ...What to Expect Tesla's AI team is pushing the...  ...deploy large-scale ML systems powering products from...  ...model architecture and engineer algorithmic optimizations...  ...make large-scale model inference fast, reliable, and hardware...  ...deploying ML models on GPUs, TPUs, or NPUs Hands... 
    Hourly pay
    Full time
    Temporary work
    Flexible hours

    Tesla

    Palo Alto, CA
    12 hours ago
  • $151.8k

     ...AI Inference Engineer We are looking for an AI Inference Engineer with a solid background in speech...  ...-the-art automatic speech recognition system and ship it to various Zoom products....  ...hardware-specific optimizations for Nvidia GPUs. Proposing new model structures by... 

    Zoom Video Communications

    San Jose, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AI Systems Engineer — SGLang & Inference on GPUs. Be the first to apply!