Kubernetes AI Inference Tech Lead

Advanced Micro Devices , Inc.

Advanced Micro Devices in Santa Clara, California, seeks a strategic software engineering lead. This role entails developing techniques for optimizing key applications, particularly for large-scale inference within the K8s ecosystem. Successful candidates should possess leadership skills, effective communication abilities, and a strong background in software engineering. The position requires a Bachelor's or Master's degree in Computer Science or related fields. Benefits details are available under AMD benefits at a glance. #J-18808-Ljbffr

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Kubernetes AI Inference Tech Lead in Santa Clara, CA vacancy

Staff Software Engineer: AI Inference Infra & Kubernetes
...infrastructure for high-performance, low-latency inference services. Applicants should have a... .... The position involves deploying Kubernetes services, optimizing resource allocation... ...The environment supports growth and diversity in tech. #J-18808-Ljbffr Cerebras Systems
Suggested
Cerebras Systems
Sunnyvale, CA
1 day ago
Tech Lead, Data & Inference Engineer
...Tech Lead, Data & Inference Engineer San Jose, California, United States About the Job Tech Lead,... ...with a specialized vertical in Applied AI, Machine Learning, and Data Science. We... ..., and observability. ~ Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or...
Suggested
Full time
Catalyst Labs, LLC
San Jose, CA
21 hours ago
Senior Software Engineer, Inference
$152k - $204k
...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...hardware teams to evolve our Kubernetes-native inference platform and...
Suggested
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
Shift work
CoreWeave
Sunnyvale, CA
1 day ago
Senior Software Engineer I, Inference
$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue,... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...hardware teams to evolve our Kubernetes-native inference platform and...
Suggested
Permanent employment
Temporary work
Casual work
Work at office
Remote work
Flexible hours
Shift work
CoreWeave
Sunnyvale, CA
1 day ago
Staff Software Engineer, Inference
$188k - $275k
...Staff Software Engineer, Inference CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave... ...AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...builds and operates CoreWeave's Kubernetes-native inference platform,...
Suggested
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
2 days ago
Tech Lead Manager, AI Inference Benchmarking
$224k - $356.5k
...NVIDIA Gruppe in Santa Clara is seeking a Technical Lead Manager to lead the AIPerf engineering team. In this role, you will be responsible... ...the AIPerf platform as the leading benchmarking tool in AI performance measurement. Candidates should have over 8 years of software...
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Software Engineer, Inference AI/ML
$92k - $135k
...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...What You'll Do: Join the Inference team to ship production features... ...Exposure to containers and Kubernetes (coursework or projects welcome...
Permanent employment
Temporary work
Casual work
Internship
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
1 day ago
Senior Software Engineer, AI Inference Systems
$184k - $287.5k
...software engineers to join us and build AI inference systems that serve large-scale models... ...and NVIDIA’s submissions to the industry‑leading MLPerf Inference benchmarking suite.... ...containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces...
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Staff Software Engineer, Inference
$188k - $275k
...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...at What You'll Do: Inference Platform Team The Inference... ...builds and operates CoreWeave's Kubernetes-native inference platform,...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
16 days ago
Staff Software Engineer, Inference Platform
...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our... ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning... ...systems architecture ideally with kubernetes. ~ Strong track record of making...
CEREBRAS SYSTEMS INC.
Sunnyvale, CA
2 days ago
Senior Staff Engineer — AI Inference & Cloud Infra
$230k - $250k
...involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a... ...and experience with containerization tools like Docker and Kubernetes. A competitive salary between $230,000 and $250,000 is offered...
Cerebras Systems
Sunnyvale, CA
1 day ago
Principal AI Inference Systems Engineer
...generation computing experiences-from AI and data centers, to PCs,... ...Key Responsibilities: • Lead technical initiatives and provide... ...accelerate LLM training and inference on AMD GPUs, improving kernel,... ...or inference platforms using Kubernetes, Ray, or Kubeflow. • Familiarity...
Advanced Micro Devices , Inc.
Santa Clara, CA
3 days ago
Senior Software Development Engineer - SGLang and Inference Stack
...products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems. You will collaborate...
Advanced Micro Devices , Inc.
Santa Clara, CA
21 hours ago
Senior Software Development Engineer - LLM Inference Framework
...products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building and optimizing...
Advanced Micro Devices , Inc.
Santa Clara, CA
21 hours ago
Software Engineer, Kubernetes
$153k - $204k
...Software Engineer, Kubernetes Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
1 day ago
Software Engineer - AI & Edge Kubernetes Orchestration - San Jose, CA
$120k - $140k
...ZEDEDA ZEDEDA unlocks the value of AI where it matters most, enabling enterprises... ...AI model lifecycle management with Kubernetes-based edge orchestration. Build and extend... ...networks, deep learning, model training and inference, and attention mechanisms (self-attention...
Permanent employment
Temporary work
Work at office
3 days per week
ZEDEDA
San Jose, CA
2 days ago
Staff Software Engineer, Managed Orchestration (Managed Kubernetes)
$220k - $250k
...As the only vertically integrated AI infrastructure company built from... ...instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry in reliability and... ...roadmap Work collaboratively with tech leads and engineers to create a...
Temporary work
Crusoe
Sunnyvale, CA
21 hours ago
Senior Software Engineer - AI Inference
$152k - $241.5k
NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer - AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in...
NVIDIA Gruppe
Santa Clara, CA
2 days ago
Principal Software Engineer - Tech Lead
$157k - $271.4k
...a Principal Software Engineer – Technical Lead within the Polyphonic® Product Engineering... ...platform easy to adopt and accelerate surgical AI. Build shared ML infrastructure with the... ..., Pipeline Orchestrator, and Training/Inference control planes. Design great developer experiences...
Johnson & Johnson
Santa Clara, CA
2 days ago
Kubernetes DevOps Engineer at Aranya.tech
...This is a job that Jill, our AI Recruiter, is recruiting for on behalf... ...speak to Jack. Job Title: Kubernetes DevOps Engineer Company Description: Aranya.tech - Seed-stage MIT-founded AI... ...making Kubernetes accessible for the inference era. You will architect and...
Jack and Jill AI
San Jose, CA
3 days ago
Principal Software Engineer - AI Inference
$272k - $431.25k
...NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly...
NVIDIA Gruppe
Santa Clara, CA
3 days ago
Senior Staff Software Engineer, Performance (Tech Lead) - Veza
$190.9k - $334.1k
.... The combination brings together Veza's AI-native Access Graph with ServiceNow's AI... ...Senior Staff Software Engineer, Performance (Tech Lead), you will own performance, scalability... ...scale SaaS platforms Familiarity with Kubernetes and observability tools Exposure to...
Work at office
Remote work
Flexible hours
ServiceNow
Santa Clara, CA
1 day ago
Tech Lead, AI Compute Infrastructure
...Tech Lead, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore... ...across thousands of devices for inference, training, data processing and large-scale... ...cloud and container technologies (Kubernetes, Ray) for elastic, cost-efficient scaling...
Full time
HeyGen
Palo Alto, CA
4 days ago
Tech Lead Software Engineer - AI Compute Infrastructure
$244.8k
...About the Team The Inference Infrastructure team is... ...maintainer of AIBrix, a Kubernetes-native control plane for... ...external developers to bring AI workloads from research... ...with great people. We lead with curiosity,... ...impact in a rapidly growing tech company. By constantly...
Temporary work
Local area
ByteDance
San Jose, CA
2 days ago
Software Engineer, Kubernetes Core Interfaces
$109k - $160k
...Software Engineer, Kubernetes Core Interfaces Livingston, NJ / New York, NY CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...
Permanent employment
Temporary work
Casual work
Work at office
Flexible hours
CoreWeave
Sunnyvale, CA
1 day ago
Senior AI Inference Kernel Engineer
$184k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves...
NVIDIA Gruppe
Santa Clara, CA
4 days ago
Senior Staff Software Engineer - High Performance GPU Inference Systems
$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed... .... Deploying and optimizing ML/HPC workloads on GPU clusters (Kubernetes, Slurm, Ray, etc.). Hands‑on experience with multi‑GPU...
I did my part and supported the Regular Toilet
Palo Alto, CA
2 days ago
Senior AI Inference Engineer - High-Performance LLM Serving
$152k - $241.5k
...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...
NVIDIA Gruppe
Santa Clara, CA
3 days ago
AI Inference Engineer
...Role: AI Inference Engineer Location: San Jose, CA Duration: 6 to 12 Months Overview: We are seeking a highly skilled AI... ...fault-tolerant, high-concurrency serving systems deployed on Kubernetes, OpenShift, Helm, or similar orchestration platforms Implement...
Triune Infomatics Inc
San Jose, CA
8 days ago
Senior Software Engineer - Cloud and Kubernetes
...perspective and a passion for scalable cloud infrastructure powered by Kubernetes, ConnectX, BlueField NICs, and GPUs. You’ll join a dynamic team... ...and develop scalable cloud solutions to accelerate HPC and AI workloads using NVIDIA’s advanced technologies (GPUs, DPUs,...
NVIDIA Gruppe
Santa Clara, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kubernetes AI Inference Tech Lead. Be the first to apply!