Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI Inference Infrastructure Software Engineer (Kubernetes / Cloud)

ElastixAI INC.

Location: Seattle, WA (Hybrid - 3 days/week in office) About ElastixAI ElastixAI is an early-stage Software startup on a mission to reinvent AI inference infrastructure from the ground up. We're building a next-generation inference platform that delivers unprecedented efficiency by tightly integrating machine learning, software stack, and custom hardware. Our philosophy is simple: the best performance comes from holistic co-design, where every layer, from model architecture to kernels to silicon, works in harmony. If you're excited about pushing AI performance to physical limits and shaping the future of large-scale inference, we'd love to meet you. Role Summary We're looking for an Inference Infrastructure Software Engineer to own and evolve the cloud and Kubernetes backbone behind our Token-as-a-Service platform. You'll be the connective tissue between our inference engine and the production environments where customers actually consume tokens - making sure our accelerated workloads run reliably, scale predictably, and deploy seamlessly across managed and self-hosted clusters. This is a hands-on role with broad surface area. You'll touch everything from cluster bring-up, automating the software releases, and AI Accelerator scheduling to service reliability and cost optimization, working closely with our ML, runtime, and hardware teams to expose the full performance of our co-designed stack to end users. Key Responsibilities Build, operate, and evolve ElastixAI's Kubernetes infrastructure powering our Token-as-a-Service capability. Run accelerated inference workloads in production at scale, with strong SLAs around latency, throughput, and availability. Manage and harden our AWS, GCP, and on-prem infrastructure, including networking, storage, IAM, and observability layers tied to our services. Develop tooling and automation in Python, Bash, Rust, and Go to streamline deployments, rollouts, autoscaling, and incident response. Partner with the ML and runtime teams to productionize new inference capabilities, model deployments, and routing strategies. Contribute to capacity planning, cost optimization, and reliability engineering across multi-cloud and self-hosted environments. Help define the platform roadmap as we scale from early customers to broad production deployments. Be a member of the Elastix On-Call rotation Required Qualifications Minimum BS in Computer Science, Software Engineering, or a related field. 3-5 years of hands-on Kubernetes experience, including EKS, GKE, and/or self-hosted clusters. 2-3 years of production experience operating workloads on AWS or GCP. Proven track record running ML or inference services at scale on Kubernetes in production. Strong experience running accelerated workloads in Kubernetes, scheduling, drivers, device plugins, MIG, networking, and storage considerations. Solid coding skills in Python, Bash and proficiency in Go Proficient in configuring and leveraging Linux OS in production Experience with infrastructure-as-code (Terraform, Pulumi), OS configuration state (Ansible, Puppet, Salt) and GitOps workflows (Argo CD, Flux). Experience in OS configuration tooling. Familiarity with AI inference and/or training workflows and the operational patterns around them. Pragmatic, ownership-oriented mindset; comfortable operating in early-stage ambiguity and shipping iteratively. Preferred/Bonus Qualifications MS/PhD in Computer Science, Software Engineering, or a related field. Experience with inference servers and runtimes (e.g., Triton, vLLM, TGI) and model-serving patterns (batching, streaming, KV-cache aware routing). Exposure to heterogeneous accelerators beyond GPUs (FPGAs, custom ASICs). Background in observability, SRE, or performance engineering for latency-sensitive services. Experience building customer facing API platforms including onboarding, API keys/auth management, and usage metering. What We Offer A chance to be a foundational engineer in an innovative AI startup. A dynamic and collaborative work environment and the change to have a significant impact on new technology The opportunity to work on challenging problems at the intersection of ML, software, and systems. Competitive compensation and startup equity package Comprehensive medical, dental, and vision coverage (premiums 100% paid by employer) Flexible Time Off (FTO) Paid parental leave Gym or fitness benefit Commuter benefit Investment in employee learning & development #J-18808-Ljbffr

Vacancy posted 11 hours ago
Similar jobs that could be interesting for youBased on the AI Inference Infrastructure Software Engineer (Kubernetes / Cloud) in Seattle, WA vacancy
  •  ...ElastixAI INC. in Seattle seeks an Inference Infrastructure Software Engineer to manage the cloud and Kubernetes backbone behind their Token-as-a-Service platform. The ideal...  ...the opportunity to work at the forefront of AI technology in a collaborative environment. #J-1... 
    Software

    ElastixAI INC.

    Seattle, WA
    11 hours ago
  • A leading AI-driven technology company in Seattle is...  ...seeking a Senior or Staff Software Engineer for the ML Infrastructure team. The role involves designing...  ...scale model training and inference, focusing on reliability...  ...distributed systems, Kubernetes, and machine learning... 
    Software

    Salesforce, Inc.

    Seattle, WA
    2 days ago
  • $148.2k - $300.96k

     ...About the Team The Inference Infrastructure team is the...  ...maintainer of AIBrix, a Kubernetes-native control...  ...computing across multi-cloud and global...  ...developers to bring AI workloads from...  ...are looking for engineers passionate about...  ...a PhD degree in Software Development, Computer... 
    Software
    Temporary work
    Local area

    ByteDance

    Seattle, WA
    2 days ago
  • $202.16k - $368.22k

     ...Team The Compute Infrastructure - Orchestration &...  ...Scheduling team uses Kubernetes and Serverless...  ...daily, including AI and LLM workloads...  ...on K8s, and LLM inference control plan project...  ...seeking talented software engineers excited to...  ...in large-scale, cloud-native environments... 
    Software
    Temporary work
    Internship
    Local area
    Overseas

    ByteDance

    Seattle, WA
    2 days ago
  • $184.94k - $342.49k

     ...vLLM and LLM-D Engineering team at Red...  ...not just build software; you will be the...  ...cutting-edge inference platform (LLM-...  ...solve "last mile" infrastructure challenges...  ...throughput on complex Kubernetes clusters. This...  ...CNI failures. AI Inference...  ...deployments. Cloud & GPU Hardware... 
    Software
    Permanent employment
    Full time
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Red Hat

    Seattle, WA
    1 day ago
  •  ...Salesforce is the #1 AI CRM, where...  ...The AI and ML Infrastructure team is part...  ..., deployment, inference, and...  ...deployment using Kubernetes based platforms...  ...across multiple cloud providers acting...  ...infrastructure and product engineering teams.## About...  ...or Staff Software Engineer to... 
    Software
    Temporary work

    Slack Enterprise

    Seattle, WA
    10 hours ago
  •  ...Software Engineer Role at Salesforce Salesforce is the #1 AI CRM, where humans with agents drive...  ...The AI and ML Infrastructure team is part of...  ...training, deployment, inference, and monitoring....  ...and maintain Kubernetes based platforms...  ...and operating cloud native systems on... 
    Software
    Temporary work

    Slack

    Seattle, WA
    3 days ago
  • $207k - $300k

     ...learning infra (e.g., GPU, Cloud TPU, etc.). About the job...  ...own ambitions, the work of a Software Engineer (SWE) goes way beyond just...  ...next-generation of on‑prem AI infrastructure to bring the best of Google...  ...scalable TPU orchestration with Kubernetes. Our goal is to enable... 
    Software
    Full time
    Temporary work

    Google Inc.

    Seattle, WA
    2 days ago
  •  ...and steerable AI systems. We want...  ...researchers, engineers, policy...  ...Role Anthropic's Infrastructure organization is...  ...across all major cloud providers and...  ...internal research, inference and product...  ...(e.g., Kubernetes, IaC, AWS/GCP/...  ...Qualifications 8+ years of software engineering... 
    Software
    Visa sponsorship

    Menlo Ventures

    Seattle, WA
    11 hours ago
  • $148.5k - $313.7k

    100 Salesforce, Inc. is seeking a Software Engineer for ML Infrastructure to design and operate core systems that power AI at Slack. Candidates should have significant experience in software engineering, particularly in infrastructure and distributed systems, as well as... 
    Software

    100 Salesforce, Inc.

    Seattle, WA
    11 hours ago
  • $148.2k - $300.96k

     ...Introduction The Compute Infrastructure team uses Kubernetes and Serverless technologies...  ...jobs daily, including AI and LLM workloads. The team...  .... We're seeking talented software engineers excited to optimize our...  ...of machines across multi-clouds globally hosting hundreds... 
    Software
    Temporary work
    Internship
    Local area
    Overseas

    ByteDance

    Seattle, WA
    2 days ago
  • $405k

     ...interpretable, and steerable AI systems. We want...  ...researchers, engineers, policy experts,...  ...seeking a Staff Software Engineer to build...  ...Claude on third‑party cloud service provider (...  ...and the Cloud Inference team: taking classifiers...  ...a CSP partner’s infrastructure at serving‑path... 
    Software
    Work at office
    Visa sponsorship
    Flexible hours

    Menlo Ventures

    Seattle, WA
    10 hours ago
  • Paradigm is a software company transforming...  ..., ML Ops Infrastructure to be part of...  ...team of ML Ops engineers focused on...  ...frameworks for AI/ML systems including...  ...and operate Kubernetes-based...  ...training, real‑time inference, LLM serving,...  ...working with Azure cloud services such... 
    Software

    Paradigm

    Seattle, WA
    10 hours ago
  • $197.3k - $313.7k

     ...Job Category Software Engineering Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer...  ...a Software Engineer on the Cloud Infrastructure Automation team at...  ...containerization frameworks such as Kubernetes and Docker. Build and ship... 
    Software

    Centaur Labs

    Bellevue, WA
    1 day ago
  • $148.5k - $313.7k

     ...Job Category Software Engineering Job Details About...  ...Salesforce is the #1 AI CRM, where humans with...  ...building foundational infrastructure that enables great customer...  ...- highly scalable cloud infrastructure, robust...  ...technologies such as Kubernetes, AWS ECS, or Docker.... 
    Software
    Work experience placement

    Salesforce.Com Inc

    Seattle, WA
    5 days ago
  •  ...Senior Platform/DevOps Engineer (Kubernetes-Linux) Bellevue Office, Sunset...  ...edge, delivering modular AI infrastructure from first deployment to AI...  ...mobile data centers and Atlas cloud integration. This is a...  ...Work in collaboration with software engineering, DevOps, security... 
    Software
    Work at office
    Local area
    Flexible hours

    Armada

    Bellevue, WA
    3 days ago
  •  ...Introduction At IBM Software, we transform client...  ...the world’s leading AI-powered, cloud-native products that...  ...the foudantional infrastructure layer that powers Confluent...  ...As a Staff Software Engineer on the Secure...  ...platform is built on Kubernetes and runs across a... 
    Software
    Work experience placement

    IBM

    Bellevue, WA
    5 days ago
  • $210k - $250k

     ...Gable.ai is looking for a Staff Software Engineer, Infrastructure, to take ownership of DevOps and infrastructure while also contributing as a software engineer....  ...of $210K - $250K, and involves working with CI/CD, cloud infrastructure, and backend systems. You will be part... 
    Software

    Gable

    Seattle, WA
    11 hours ago
  • $197k - $291k

    Software Engineer, Cloud HPC and Accelerator Networking Google Seattle...  ...large-scale infrastructure, distributed systems...  ...compute platforms (Kubernetes, Cloud Functions)....  ...frontier models, run inference, and otherwise advance...  ...in next generation AI computing. Google Cloud... 
    Software
    Full time
    Temporary work

    Google Inc.

    Seattle, WA
    3 days ago
  • $157k - $213.8k

     ...and running the world's best data and AI infrastructure platform so our customers can use...  ...products to build connectivity solutions to cloud resources and prevent data...  ...exfiltration. We are seeking experienced Senior Software Engineers with large-scale distributed system... 
    Software
    Local area
    Worldwide

    Menlo Ventures

    Bellevue, WA
    11 hours ago
  • Ll Oefentherapie in Seattle seeks a Senior Software Engineer to lead the design and development of innovative solutions for cloud infrastructure. This role emphasizes AI and ML technologies, aiming to enhance automation and efficiency. As part of the engineering division... 
    Software

    Ll Oefentherapie

    Seattle, WA
    4 days ago
  • $157.6k - $197k

     ...Senior Security Engineer - Infrastructure Bellevue Office, Sunset...  ..., delivering modular AI infrastructure from first...  ...for securing our cloud and edge computing environments...  ...(AWS, Azure, GCP), Kubernetes environments, and...  ...throughout the software development lifecycle... 
    Software
    Work at office
    Flexible hours

    Armada

    Bellevue, WA
    3 days ago
  • $151.8k

     ...seeking an experienced AI Infrastructure Engineer to join our AI...  ...environments using Docker and Kubernetes Optimizing CUDA...  ...Developing platform software to support AI/ML...  ...training and inference pipelines What we...  ...distributed systems and cloud computing ~ Demonstrate... 
    Software
    Work at office
    Remote work

    Zoom Corporation

    Seattle, WA
    3 days ago
  • $207k - $300k

    Staff Software Engineer, AI/ML, Cloud Identity and Access Management Infrastructure Kirkland, WA, USA; Seattle, WA, USA. Benefits for this role include: Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off:... 
    Software
    Full time
    Temporary work
    Shift work

    Google Inc.

    Seattle, WA
    4 days ago
  • $216k - $270k

     ...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System...  ...compute into breakthrough AI. You will:...  ...Expert-level knowledge of Kubernetes internals (Custom Resources...  ...hardware. ~ Familiarity with cloud infrastructure (AWS, GCP... 
    Software
    Full time

    Scale AI

    Seattle, WA
    7 days ago
  •  ...B Capital is seeking Software Engineers to join the ML Infrastructure team. In this role, you will design and operate...  ...learning model training and inference. Candidates need significant experience...  ...distributed technologies like Kubernetes. You will work on evolving GPU infrastructure... 
    Software

    B Capital

    Seattle, WA
    11 hours ago
  •  ...combination of proprietary infrastructure and software, we empower over 200,...  ...end-to-end. You use AI to work smarter and...  ...the company’s global engineering organization. We build and operate the cloud infrastructure,...  ...AWS, GCP, or Aliyun), Kubernetes, Terraform, CI/CD pipelines... 
    Software
    Worldwide

    Airwallex

    Seattle, WA
    10 hours ago
  • $207k - $300k

    A leading technology company is seeking a Staff Software Engineer in AI/ML for its Cloud Identity and Access Management team. This role requires extensive...  ..., and cloud computing, focusing on building scalable infrastructure systems. The position offers a competitive salary... 
    Software

    Google Inc.

    Seattle, WA
    4 days ago
  •  ...Responsibilities Kubernetes container platform ....  ...systems like VMware , Infrastructure and Network knowledge. optional(Cloud ( AWS / Azure / GCP)...  ...overall solution # Helps Engineers troubleshoot issues that...  ...troubleshoot CI/CD pipelines and software # Able to automate... 
    Software

    InterSources

    Bellevue, WA
    3 days ago
  • brobstongroup.com - Jobboard is seeking a Software Engineer 2 for the Data Platform Enablement team based in Seattle. This hybrid...  .... The successful candidate will focus on automation, cloud infrastructure, and Kubernetes to improve platform reliability. Key responsibilities... 
    Software

    brobstongroup.com - Jobboard

    Seattle, WA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI Inference Infrastructure Software Engineer (Kubernetes / Cloud). Be the first to apply!