Kubernetes AI Inference Tech Lead
Advanced Micro Devices , Inc.
Advanced Micro Devices in Santa Clara, California, seeks a strategic software engineering lead. This role entails developing techniques for optimizing key applications, particularly for large-scale inference within the K8s ecosystem. Successful candidates should possess leadership skills, effective communication abilities, and a strong background in software engineering. The position requires a Bachelor's or Master's degree in Computer Science or related fields. Benefits details are available under AMD benefits at a glance. #J-18808-Ljbffr
- ...infrastructure for high-performance, low-latency inference services. Applicants should have a... .... The position involves deploying Kubernetes services, optimizing resource allocation... ...The environment supports growth and diversity in tech. #J-18808-Ljbffr Cerebras SystemsSuggested
- ...Tech Lead, Data & Inference Engineer San Jose, California, United States About the Job Tech Lead,... ...with a specialized vertical in Applied AI, Machine Learning, and Data Science. We... ..., and observability. ~ Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or...SuggestedFull time
$152k - $204k
...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...hardware teams to evolve our Kubernetes-native inference platform and...SuggestedPermanent employmentTemporary workCasual workWork at officeFlexible hoursShift work$139k - $204k
...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue,... ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...hardware teams to evolve our Kubernetes-native inference platform and...SuggestedPermanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work$188k - $275k
...Staff Software Engineer, Inference CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave... ...AI with confidence. Trusted by leading AI labs, startups, and global enterprises... ...builds and operates CoreWeave's Kubernetes-native inference platform,...SuggestedPermanent employmentTemporary workCasual workWork at officeFlexible hours$224k - $356.5k
...NVIDIA Gruppe in Santa Clara is seeking a Technical Lead Manager to lead the AIPerf engineering team. In this role, you will be responsible... ...the AIPerf platform as the leading benchmarking tool in AI performance measurement. Candidates should have over 8 years of software...$92k - $135k
...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...What You'll Do: Join the Inference team to ship production features... ...Exposure to containers and Kubernetes (coursework or projects welcome...Permanent employmentTemporary workCasual workInternshipWork at officeFlexible hours$184k - $287.5k
...software engineers to join us and build AI inference systems that serve large-scale models... ...and NVIDIA’s submissions to the industry‑leading MLPerf Inference benchmarking suite.... ...containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces...$188k - $275k
...CoreWeave is The Essential Cloud for AI™. Built for pioneers by... ...with confidence. Trusted by leading AI labs, startups, and global... ...at What You'll Do: Inference Platform Team The Inference... ...builds and operates CoreWeave's Kubernetes-native inference platform,...Permanent employmentTemporary workCasual workWork at officeFlexible hours- ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our... ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning... ...systems architecture ideally with kubernetes. ~ Strong track record of making...
$230k - $250k
...involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a... ...and experience with containerization tools like Docker and Kubernetes. A competitive salary between $230,000 and $250,000 is offered...- ...generation computing experiences-from AI and data centers, to PCs,... ...Key Responsibilities: • Lead technical initiatives and provide... ...accelerate LLM training and inference on AMD GPUs, improving kernel,... ...or inference platforms using Kubernetes, Ray, or Kubeflow. • Familiarity...
- ...products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems. You will collaborate...
- ...products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building and optimizing...
$153k - $204k
...Software Engineer, Kubernetes Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...Permanent employmentTemporary workCasual workWork at officeFlexible hours$120k - $140k
...ZEDEDA ZEDEDA unlocks the value of AI where it matters most, enabling enterprises... ...AI model lifecycle management with Kubernetes-based edge orchestration. Build and extend... ...networks, deep learning, model training and inference, and attention mechanisms (self-attention...Permanent employmentTemporary workWork at office3 days per week$220k - $250k
...As the only vertically integrated AI infrastructure company built from... ...instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry in reliability and... ...roadmap Work collaboratively with tech leads and engineers to create a...Temporary work$152k - $241.5k
NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer - AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in...$157k - $271.4k
...a Principal Software Engineer – Technical Lead within the Polyphonic® Product Engineering... ...platform easy to adopt and accelerate surgical AI. Build shared ML infrastructure with the... ..., Pipeline Orchestrator, and Training/Inference control planes. Design great developer experiences...- ...This is a job that Jill, our AI Recruiter, is recruiting for on behalf... ...speak to Jack. Job Title: Kubernetes DevOps Engineer Company Description: Aranya.tech - Seed-stage MIT-founded AI... ...making Kubernetes accessible for the inference era. You will architect and...
$272k - $431.25k
...NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly...$190.9k - $334.1k
.... The combination brings together Veza's AI-native Access Graph with ServiceNow's AI... ...Senior Staff Software Engineer, Performance (Tech Lead), you will own performance, scalability... ...scale SaaS platforms Familiarity with Kubernetes and observability tools Exposure to...Work at officeRemote workFlexible hours- ...Tech Lead, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore... ...across thousands of devices for inference, training, data processing and large-scale... ...cloud and container technologies (Kubernetes, Ray) for elastic, cost-efficient scaling...Full time
$244.8k
...About the Team The Inference Infrastructure team is... ...maintainer of AIBrix, a Kubernetes-native control plane for... ...external developers to bring AI workloads from research... ...with great people. We lead with curiosity,... ...impact in a rapidly growing tech company. By constantly...Temporary workLocal area$109k - $160k
...Software Engineer, Kubernetes Core Interfaces Livingston, NJ / New York, NY CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform... ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises...Permanent employmentTemporary workCasual workWork at officeFlexible hours$184k - $287.5k
...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves...$248.71k - $292.6k
About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed... .... Deploying and optimizing ML/HPC workloads on GPU clusters (Kubernetes, Slurm, Ray, etc.). Hands‑on experience with multi‑GPU...$152k - $241.5k
...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building...- ...Role: AI Inference Engineer Location: San Jose, CA Duration: 6 to 12 Months Overview: We are seeking a highly skilled AI... ...fault-tolerant, high-concurrency serving systems deployed on Kubernetes, OpenShift, Helm, or similar orchestration platforms Implement...
- ...perspective and a passion for scalable cloud infrastructure powered by Kubernetes, ConnectX, BlueField NICs, and GPUs. You’ll join a dynamic team... ...and develop scalable cloud solutions to accelerate HPC and AI workloads using NVIDIA’s advanced technologies (GPUs, DPUs,...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Kubernetes AI Inference Tech Lead. Be the first to apply!
- technical leader Santa Clara, CA
- technical lead Santa Clara, CA
- computer tech Santa Clara, CA
- high tech Santa Clara, CA
- technology executive Santa Clara, CA
- technology risk Santa Clara, CA
- technology sales Santa Clara, CA
- assistive technology trainer Santa Clara, CA
- cardiac tech Santa Clara, CA
- operations tech Santa Clara, CA


