Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Kubernetes AI Inference Tech Lead

Advanced Micro Devices , Inc.

Advanced Micro Devices in Santa Clara, California, seeks a strategic software engineering lead. This role entails developing techniques for optimizing key applications, particularly for large-scale inference within the K8s ecosystem. Successful candidates should possess leadership skills, effective communication abilities, and a strong background in software engineering. The position requires a Bachelor's or Master's degree in Computer Science or related fields. Benefits details are available under AMD benefits at a glance. #J-18808-Ljbffr

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Kubernetes AI Inference Tech Lead in Santa Clara, CA vacancy
  •  ...infrastructure for high-performance, low-latency inference services. Applicants should have a...  .... The position involves deploying Kubernetes services, optimizing resource allocation...  ...The environment supports growth and diversity in tech. #J-18808-Ljbffr Cerebras Systems
    Suggested

    Cerebras Systems

    Sunnyvale, CA
    1 day ago
  •  ...Tech Lead, Data & Inference Engineer San Jose, California, United States About the Job Tech Lead,...  ...with a specialized vertical in Applied AI, Machine Learning, and Data Science. We...  ..., and observability. ~ Exposure to Kubernetes and cloud infrastructure (AWS, GCP, or... 
    Suggested
    Full time

    Catalyst Labs, LLC

    San Jose, CA
    21 hours ago
  • $152k - $204k

     ...Senior Software Engineer, Inference Sunnyvale, CA / Bellevue, WA...  ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by...  ...with confidence. Trusted by leading AI labs, startups, and global...  ...hardware teams to evolve our Kubernetes-native inference platform and... 
    Suggested
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $139k - $204k

     ...Senior Software Engineer I, Inference Sunnyvale, CA / Bellevue,...  ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by...  ...with confidence. Trusted by leading AI labs, startups, and global...  ...hardware teams to evolve our Kubernetes-native inference platform and... 
    Suggested
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Remote work
    Flexible hours
    Shift work

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $188k - $275k

     ...Staff Software Engineer, Inference CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave...  ...AI with confidence. Trusted by leading AI labs, startups, and global enterprises...  ...builds and operates CoreWeave's Kubernetes-native inference platform,... 
    Suggested
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    2 days ago
  • $224k - $356.5k

     ...NVIDIA Gruppe in Santa Clara is seeking a Technical Lead Manager to lead the AIPerf engineering team. In this role, you will be responsible...  ...the AIPerf platform as the leading benchmarking tool in AI performance measurement. Candidates should have over 8 years of software... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $92k - $135k

     ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by...  ...with confidence. Trusted by leading AI labs, startups, and global...  ...What You'll Do: Join the Inference team to ship production features...  ...Exposure to containers and Kubernetes (coursework or projects welcome... 
    Permanent employment
    Temporary work
    Casual work
    Internship
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $184k - $287.5k

     ...software engineers to join us and build AI inference systems that serve large-scale models...  ...and NVIDIA’s submissions to the industry‑leading MLPerf Inference benchmarking suite....  ...containers and orchestration (Docker, Kubernetes, Slurm); familiarity with Linux namespaces... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $188k - $275k

     ...CoreWeave is The Essential Cloud for AI™. Built for pioneers by...  ...with confidence. Trusted by leading AI labs, startups, and global...  ...at What You'll Do: Inference Platform Team The Inference...  ...builds and operates CoreWeave's Kubernetes-native inference platform,... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    16 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our...  ...allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning...  ...systems architecture ideally with kubernetes. ~ Strong track record of making... 

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    2 days ago
  • $230k - $250k

     ...involves designing resilient software features for cloud-based AI inference, leveraging AWS tools and services. Candidates should have a...  ...and experience with containerization tools like Docker and Kubernetes. A competitive salary between $230,000 and $250,000 is offered... 

    Cerebras Systems

    Sunnyvale, CA
    1 day ago
  •  ...generation computing experiences-from AI and data centers, to PCs,...  ...Key Responsibilities: • Lead technical initiatives and provide...  ...accelerate LLM training and inference on AMD GPUs, improving kernel,...  ...or inference platforms using Kubernetes, Ray, or Kubeflow. • Familiarity... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    3 days ago
  •  ...products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded...  ...models, and enabling RL training and SOTA LLM and Multimodal inference at scale across multi-GPU and multi-node systems. You will collaborate... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    21 hours ago
  •  ...products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded...  ...your career. THE ROLE: As a senior member of the LLM inference framework team, you will be responsible for building and optimizing... 

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    21 hours ago
  • $153k - $204k

     ...Software Engineer, Kubernetes Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers...  ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $120k - $140k

     ...ZEDEDA ZEDEDA unlocks the value of AI where it matters most, enabling enterprises...  ...AI model lifecycle management with Kubernetes-based edge orchestration. Build and extend...  ...networks, deep learning, model training and inference, and attention mechanisms (self-attention... 
    Permanent employment
    Temporary work
    Work at office
    3 days per week

    ZEDEDA

    San Jose, CA
    2 days ago
  • $220k - $250k

     ...As the only vertically integrated AI infrastructure company built from...  ...instrumental in advancing our managed Kubernetes and AI training clusters, ensuring they lead the industry in reliability and...  ...roadmap Work collaboratively with tech leads and engineers to create a... 
    Temporary work

    Crusoe

    Sunnyvale, CA
    21 hours ago
  • $152k - $241.5k

    NVIDIA is the platform upon which every new AI‑powered application is built. We are seeking a Senior Software Engineer - AI Inference to advance open‑source LLM serving by contributing directly to upstream inference engines like vLLM and SGLang-ensuring they run best‑in... 

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $157k - $271.4k

     ...a Principal Software Engineer – Technical Lead within the Polyphonic® Product Engineering...  ...platform easy to adopt and accelerate surgical AI. Build shared ML infrastructure with the...  ..., Pipeline Orchestrator, and Training/Inference control planes. Design great developer experiences... 

    Johnson & Johnson

    Santa Clara, CA
    2 days ago
  •  ...This is a job that Jill, our AI Recruiter, is recruiting for on behalf...  ...speak to Jack. Job Title: Kubernetes DevOps Engineer Company Description: Aranya.tech - Seed-stage MIT-founded AI...  ...making Kubernetes accessible for the inference era. You will architect and... 

    Jack and Jill AI

    San Jose, CA
    3 days ago
  • $272k - $431.25k

     ...NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream inference engines like vLLM and SGLang. You will ensure they run outstandingly... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $190.9k - $334.1k

     .... The combination brings together Veza's AI-native Access Graph with ServiceNow's AI...  ...Senior Staff Software Engineer, Performance (Tech Lead), you will own performance, scalability...  ...scale SaaS platforms Familiarity with Kubernetes and observability tools Exposure to... 
    Work at office
    Remote work
    Flexible hours

    ServiceNow

    Santa Clara, CA
    1 day ago
  •  ...Tech Lead, AI Compute Infrastructure Los Angeles, Palo Alto, San Francisco, Toronto, Singapore...  ...across thousands of devices for inference, training, data processing and large-scale...  ...cloud and container technologies (Kubernetes, Ray) for elastic, cost-efficient scaling... 
    Full time

    HeyGen

    Palo Alto, CA
    4 days ago
  • $244.8k

     ...About the Team The Inference Infrastructure team is...  ...maintainer of AIBrix, a Kubernetes-native control plane for...  ...external developers to bring AI workloads from research...  ...with great people. We lead with curiosity,...  ...impact in a rapidly growing tech company. By constantly... 
    Temporary work
    Local area

    ByteDance

    San Jose, CA
    2 days ago
  • $109k - $160k

     ...Software Engineer, Kubernetes Core Interfaces Livingston, NJ / New York, NY CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform...  ...scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises... 
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    1 day ago
  • $184k - $287.5k

     ...NVIDIA Gruppe in Santa Clara is seeking an AI Systems Engineer to innovate and develop cutting-edge technologies in the AI inference software stack. Candidates should hold a Master's degree and possess over 6 years of experience in ML/DL systems development. The role involves... 

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $248.71k - $292.6k

    About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed...  .... Deploying and optimizing ML/HPC workloads on GPU clusters (Kubernetes, Slurm, Ray, etc.). Hands‑on experience with multi‑GPU... 

    I did my part and supported the Regular Toilet

    Palo Alto, CA
    2 days ago
  • $152k - $241.5k

     ...NVIDIA Gruppe is seeking a Senior Software Engineer – AI Inference in Santa Clara, California. This role involves enhancing open-source LLM serving optimizations and implementing high-performance runtime capabilities. Candidates should have 5+ years of experience in building... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Role: AI Inference Engineer Location: San Jose, CA Duration: 6 to 12 Months Overview: We are seeking a highly skilled AI...  ...fault-tolerant, high-concurrency serving systems deployed on Kubernetes, OpenShift, Helm, or similar orchestration platforms Implement... 

    Triune Infomatics Inc

    San Jose, CA
    8 days ago
  •  ...perspective and a passion for scalable cloud infrastructure powered by Kubernetes, ConnectX, BlueField NICs, and GPUs. You’ll join a dynamic team...  ...and develop scalable cloud solutions to accelerate HPC and AI workloads using NVIDIA’s advanced technologies (GPUs, DPUs,... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Kubernetes AI Inference Tech Lead. Be the first to apply!