AI Inference Infrastructure Software Engineer (Kubernetes / Cloud)
ElastixAI INC.
Location: Seattle, WA (Hybrid - 3 days/week in office) About ElastixAI ElastixAI is an early-stage Software startup on a mission to reinvent AI inference infrastructure from the ground up. We're building a next-generation inference platform that delivers unprecedented efficiency by tightly integrating machine learning, software stack, and custom hardware. Our philosophy is simple: the best performance comes from holistic co-design, where every layer, from model architecture to kernels to silicon, works in harmony. If you're excited about pushing AI performance to physical limits and shaping the future of large-scale inference, we'd love to meet you. Role Summary We're looking for an Inference Infrastructure Software Engineer to own and evolve the cloud and Kubernetes backbone behind our Token-as-a-Service platform. You'll be the connective tissue between our inference engine and the production environments where customers actually consume tokens - making sure our accelerated workloads run reliably, scale predictably, and deploy seamlessly across managed and self-hosted clusters. This is a hands-on role with broad surface area. You'll touch everything from cluster bring-up, automating the software releases, and AI Accelerator scheduling to service reliability and cost optimization, working closely with our ML, runtime, and hardware teams to expose the full performance of our co-designed stack to end users. Key Responsibilities Build, operate, and evolve ElastixAI's Kubernetes infrastructure powering our Token-as-a-Service capability. Run accelerated inference workloads in production at scale, with strong SLAs around latency, throughput, and availability. Manage and harden our AWS, GCP, and on-prem infrastructure, including networking, storage, IAM, and observability layers tied to our services. Develop tooling and automation in Python, Bash, Rust, and Go to streamline deployments, rollouts, autoscaling, and incident response. Partner with the ML and runtime teams to productionize new inference capabilities, model deployments, and routing strategies. Contribute to capacity planning, cost optimization, and reliability engineering across multi-cloud and self-hosted environments. Help define the platform roadmap as we scale from early customers to broad production deployments. Be a member of the Elastix On-Call rotation Required Qualifications Minimum BS in Computer Science, Software Engineering, or a related field. 3-5 years of hands-on Kubernetes experience, including EKS, GKE, and/or self-hosted clusters. 2-3 years of production experience operating workloads on AWS or GCP. Proven track record running ML or inference services at scale on Kubernetes in production. Strong experience running accelerated workloads in Kubernetes, scheduling, drivers, device plugins, MIG, networking, and storage considerations. Solid coding skills in Python, Bash and proficiency in Go Proficient in configuring and leveraging Linux OS in production Experience with infrastructure-as-code (Terraform, Pulumi), OS configuration state (Ansible, Puppet, Salt) and GitOps workflows (Argo CD, Flux). Experience in OS configuration tooling. Familiarity with AI inference and/or training workflows and the operational patterns around them. Pragmatic, ownership-oriented mindset; comfortable operating in early-stage ambiguity and shipping iteratively. Preferred/Bonus Qualifications MS/PhD in Computer Science, Software Engineering, or a related field. Experience with inference servers and runtimes (e.g., Triton, vLLM, TGI) and model-serving patterns (batching, streaming, KV-cache aware routing). Exposure to heterogeneous accelerators beyond GPUs (FPGAs, custom ASICs). Background in observability, SRE, or performance engineering for latency-sensitive services. Experience building customer facing API platforms including onboarding, API keys/auth management, and usage metering. What We Offer A chance to be a foundational engineer in an innovative AI startup. A dynamic and collaborative work environment and the change to have a significant impact on new technology The opportunity to work on challenging problems at the intersection of ML, software, and systems. Competitive compensation and startup equity package Comprehensive medical, dental, and vision coverage (premiums 100% paid by employer) Flexible Time Off (FTO) Paid parental leave Gym or fitness benefit Commuter benefit Investment in employee learning & development #J-18808-Ljbffr
- ...ElastixAI INC. in Seattle seeks an Inference Infrastructure Software Engineer to manage the cloud and Kubernetes backbone behind their Token-as-a-Service platform. The ideal... ...the opportunity to work at the forefront of AI technology in a collaborative environment. #J-1...Software
- A leading AI-driven technology company in Seattle is... ...seeking a Senior or Staff Software Engineer for the ML Infrastructure team. The role involves designing... ...scale model training and inference, focusing on reliability... ...distributed systems, Kubernetes, and machine learning...Software
$148.2k - $300.96k
...About the Team The Inference Infrastructure team is the... ...maintainer of AIBrix, a Kubernetes-native control... ...computing across multi-cloud and global... ...developers to bring AI workloads from... ...are looking for engineers passionate about... ...a PhD degree in Software Development, Computer...SoftwareTemporary workLocal area$202.16k - $368.22k
...Team The Compute Infrastructure - Orchestration &... ...Scheduling team uses Kubernetes and Serverless... ...daily, including AI and LLM workloads... ...on K8s, and LLM inference control plan project... ...seeking talented software engineers excited to... ...in large-scale, cloud-native environments...SoftwareTemporary workInternshipLocal areaOverseas$184.94k - $342.49k
...vLLM and LLM-D Engineering team at Red... ...not just build software; you will be the... ...cutting-edge inference platform (LLM-... ...solve "last mile" infrastructure challenges... ...throughput on complex Kubernetes clusters. This... ...CNI failures. AI Inference... ...deployments. Cloud & GPU Hardware...SoftwarePermanent employmentFull timeWork experience placementWork at officeRemote workFlexible hours- ...Salesforce is the #1 AI CRM, where... ...The AI and ML Infrastructure team is part... ..., deployment, inference, and... ...deployment using Kubernetes based platforms... ...across multiple cloud providers acting... ...infrastructure and product engineering teams.## About... ...or Staff Software Engineer to...SoftwareTemporary work
- ...Software Engineer Role at Salesforce Salesforce is the #1 AI CRM, where humans with agents drive... ...The AI and ML Infrastructure team is part of... ...training, deployment, inference, and monitoring.... ...and maintain Kubernetes based platforms... ...and operating cloud native systems on...SoftwareTemporary work
$207k - $300k
...learning infra (e.g., GPU, Cloud TPU, etc.). About the job... ...own ambitions, the work of a Software Engineer (SWE) goes way beyond just... ...next-generation of on‑prem AI infrastructure to bring the best of Google... ...scalable TPU orchestration with Kubernetes. Our goal is to enable...SoftwareFull timeTemporary work- ...and steerable AI systems. We want... ...researchers, engineers, policy... ...Role Anthropic's Infrastructure organization is... ...across all major cloud providers and... ...internal research, inference and product... ...(e.g., Kubernetes, IaC, AWS/GCP/... ...Qualifications 8+ years of software engineering...SoftwareVisa sponsorship
$148.5k - $313.7k
100 Salesforce, Inc. is seeking a Software Engineer for ML Infrastructure to design and operate core systems that power AI at Slack. Candidates should have significant experience in software engineering, particularly in infrastructure and distributed systems, as well as...Software$148.2k - $300.96k
...Introduction The Compute Infrastructure team uses Kubernetes and Serverless technologies... ...jobs daily, including AI and LLM workloads. The team... .... We're seeking talented software engineers excited to optimize our... ...of machines across multi-clouds globally hosting hundreds...SoftwareTemporary workInternshipLocal areaOverseas$405k
...interpretable, and steerable AI systems. We want... ...researchers, engineers, policy experts,... ...seeking a Staff Software Engineer to build... ...Claude on third‑party cloud service provider (... ...and the Cloud Inference team: taking classifiers... ...a CSP partner’s infrastructure at serving‑path...SoftwareWork at officeVisa sponsorshipFlexible hours- Paradigm is a software company transforming... ..., ML Ops Infrastructure to be part of... ...team of ML Ops engineers focused on... ...frameworks for AI/ML systems including... ...and operate Kubernetes-based... ...training, real‑time inference, LLM serving,... ...working with Azure cloud services such...Software
$197.3k - $313.7k
...Job Category Software Engineering Job Details About Salesforce Salesforce is the #1 AI CRM, where humans with agents drive customer... ...a Software Engineer on the Cloud Infrastructure Automation team at... ...containerization frameworks such as Kubernetes and Docker. Build and ship...Software$148.5k - $313.7k
...Job Category Software Engineering Job Details About... ...Salesforce is the #1 AI CRM, where humans with... ...building foundational infrastructure that enables great customer... ...- highly scalable cloud infrastructure, robust... ...technologies such as Kubernetes, AWS ECS, or Docker....SoftwareWork experience placement- ...Senior Platform/DevOps Engineer (Kubernetes-Linux) Bellevue Office, Sunset... ...edge, delivering modular AI infrastructure from first deployment to AI... ...mobile data centers and Atlas cloud integration. This is a... ...Work in collaboration with software engineering, DevOps, security...SoftwareWork at officeLocal areaFlexible hours
- ...Introduction At IBM Software, we transform client... ...the world’s leading AI-powered, cloud-native products that... ...the foudantional infrastructure layer that powers Confluent... ...As a Staff Software Engineer on the Secure... ...platform is built on Kubernetes and runs across a...SoftwareWork experience placement
$210k - $250k
...Gable.ai is looking for a Staff Software Engineer, Infrastructure, to take ownership of DevOps and infrastructure while also contributing as a software engineer.... ...of $210K - $250K, and involves working with CI/CD, cloud infrastructure, and backend systems. You will be part...Software$197k - $291k
Software Engineer, Cloud HPC and Accelerator Networking Google Seattle... ...large-scale infrastructure, distributed systems... ...compute platforms (Kubernetes, Cloud Functions).... ...frontier models, run inference, and otherwise advance... ...in next generation AI computing. Google Cloud...SoftwareFull timeTemporary work$157k - $213.8k
...and running the world's best data and AI infrastructure platform so our customers can use... ...products to build connectivity solutions to cloud resources and prevent data... ...exfiltration. We are seeking experienced Senior Software Engineers with large-scale distributed system...SoftwareLocal areaWorldwide- Ll Oefentherapie in Seattle seeks a Senior Software Engineer to lead the design and development of innovative solutions for cloud infrastructure. This role emphasizes AI and ML technologies, aiming to enhance automation and efficiency. As part of the engineering division...Software
$157.6k - $197k
...Senior Security Engineer - Infrastructure Bellevue Office, Sunset... ..., delivering modular AI infrastructure from first... ...for securing our cloud and edge computing environments... ...(AWS, Azure, GCP), Kubernetes environments, and... ...throughout the software development lifecycle...SoftwareWork at officeFlexible hours$151.8k
...seeking an experienced AI Infrastructure Engineer to join our AI... ...environments using Docker and Kubernetes Optimizing CUDA... ...Developing platform software to support AI/ML... ...training and inference pipelines What we... ...distributed systems and cloud computing ~ Demonstrate...SoftwareWork at officeRemote work$207k - $300k
Staff Software Engineer, AI/ML, Cloud Identity and Access Management Infrastructure Kirkland, WA, USA; Seattle, WA, USA. Benefits for this role include: Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off:...SoftwareFull timeTemporary workShift work$216k - $270k
...As a Software Engineer on the Machine Learning Infrastructure team, you will build the "Operating System... ...compute into breakthrough AI. You will:... ...Expert-level knowledge of Kubernetes internals (Custom Resources... ...hardware. ~ Familiarity with cloud infrastructure (AWS, GCP...SoftwareFull time- ...B Capital is seeking Software Engineers to join the ML Infrastructure team. In this role, you will design and operate... ...learning model training and inference. Candidates need significant experience... ...distributed technologies like Kubernetes. You will work on evolving GPU infrastructure...Software
- ...combination of proprietary infrastructure and software, we empower over 200,... ...end-to-end. You use AI to work smarter and... ...the company’s global engineering organization. We build and operate the cloud infrastructure,... ...AWS, GCP, or Aliyun), Kubernetes, Terraform, CI/CD pipelines...SoftwareWorldwide
$207k - $300k
A leading technology company is seeking a Staff Software Engineer in AI/ML for its Cloud Identity and Access Management team. This role requires extensive... ..., and cloud computing, focusing on building scalable infrastructure systems. The position offers a competitive salary...Software- ...Responsibilities Kubernetes container platform .... ...systems like VMware , Infrastructure and Network knowledge. optional(Cloud ( AWS / Azure / GCP)... ...overall solution # Helps Engineers troubleshoot issues that... ...troubleshoot CI/CD pipelines and software # Able to automate...Software
- brobstongroup.com - Jobboard is seeking a Software Engineer 2 for the Data Platform Enablement team based in Seattle. This hybrid... .... The successful candidate will focus on automation, cloud infrastructure, and Kubernetes to improve platform reliability. Key responsibilities...Software
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Inference Infrastructure Software Engineer (Kubernetes / Cloud). Be the first to apply!
- senior ai engineer Seattle, WA
- ai ml engineer Seattle, WA
- ai engineer remote Seattle, WA
- ai engineer Seattle, WA
- ai prompt engineer Seattle, WA
- ai developer Seattle, WA
- machine learning ai engineer Seattle, WA
- security infrastructure engineer Seattle, WA
- principal infrastructure engineer Seattle, WA
- lead infrastructure engineer Seattle, WA

