Global Remote SRE for AI Infrastructure & Kubernetes
Andromeda Cluster
- Remote job
A cutting-edge AI infrastructure company is seeking a Site Reliability Engineer to manage Kubernetes clusters and improve the reliability of critical systems. The ideal candidate will have 5+ years of experience in SRE or DevOps, strong Linux and Kubernetes expertise, and skills in automation and Infrastructure-as-Code. This role offers the opportunity to shape the future of scalable AI infrastructure, working closely with both customers and technical teams in a dynamic environment. #J-18808-Ljbffr Andromeda Cluster
- ...Infrastructure Engineer/SRE Taiwan (Remote) Cresta unlocks the true potential of the customer experience, turning... ...advantage. Cresta's unified AI platform combines conversational AI... ...Ensure reliability of multi-cloud Kubernetes clusters and pipelines. Metrics,...Remote workLocal areaWork from homeHome office
- ...trust in the age of AI At Oscilar, we're... ...for a experienced SRE to take ownership... ...dependency failures, and global deployments. You’... ..., and how we run infrastructure that supports... ...infrastructure (AWS, Pulumi, Kubernetes). Lead... ...plan Flexibility: Remote-first culture —...Remote work
- A leading AI infrastructure company seeks a Head of AI Infrastructure to define... ...the technical roadmap for a global GPU cloud platform. This... ...significant expertise in Kubernetes and GPU clusters. The ideal... ...distributed team, working in a remote-first environment, and will...Remote jobImmediate start
$140k - $215k
...As a global leader in cybersecurity, CrowdStrike protects... ...world's most advanced AI-native platform. Our... ...is hiring for a Sr. Infrastructure Engineer to help with... ...depth experience with Kubernetes, its many add-ons,... ...effectively with both local and remote teams Rock solid...Remote workWork experience placementWork at officeLocal areaShift work$200k - $250k
...Responsibilities Kubernetes Ownership :... ...and security. Infrastructure Automation : Implement... ...~5+ years in a SRE or Software Engineer... ...for your ideal remote set-up ~ Flexible... ...meal benefit ~ Global off-sites The... .... Phantom may use AI-powered tools and...Remote workLive inFlexible hours- ...Overview: Job Title: AI/ML Ops & Infrastructure Engineer Company:... ..., GA (Hybrid / Remote Options Available)... ...of experience across global markets, we have built... ...Serve, NVIDIA Triton) on Kubernetes across multi-cloud... ...Reliability Engineering (SRE), or Cloud...Remote workFull timeShift work
- ...Infrastructure Ops Engineer at Baseten Baseten powers mission... ...world's most dynamic AI companies, like Cursor,... ...engine of our global infrastructure. You will... ...partnering closely with our SRE and FDE teams to execute... ...will be hands-on with Kubernetes and cloud-native tools,...Remote workWork experience placementWork at officeFlexible hours
- ...the world's most dynamic AI companies, like Cursor,... ...AI research, flexible infrastructure, and seamless developer... ...Design and architect a global training scheduler... ...Partner closely with SRE and Capacity teams to unlock... ...Deep expertise with Kubernetes in production environments...Remote workFlexible hours
- A leading AI company is looking for an Infrastructure Engineer to design and build its core infrastructure. You'll work in a remote, autonomous environment supporting developer workflows, ensuring reliability of Kubernetes clusters, and automating operations. Requires...Remote job
$180k - $200k
...Infrastructure Engineer (Observability) Lightning AI is the company behind PyTorch Lightning... ...AI operates globally with offices in New... ..., or SF) or fully remote within the U.S., with... ...engineering, SRE, or observability-... ...environments and Kubernetes observability ~...Remote workWork from homeFlexible hours$93.5k - $182.85k
...resilient. The company's unique AI-powered platform combines... ...possesses the vision to architect a global network and the grit to build... ...the CISO and Director of Infrastructure to align networking goals... ...Commvault? Apply now!? #LI-JS1 #LI-Remote Thank you for your...Remote work$230k - $240k
...Job Description: Title: VP, Global IT Infrastructure & Operations Location: Remote - USA or Canada Reports to: Chief... ...will establish the foundation for AI-enabled operations, drive top-line... ...frameworks, including ITIL, SRE practices, and DevOps methodologies...Remote workContract work- ...Senior Infrastructure Engineer LiveKit is building the infrastructure... ..., scale, and observe AI agents in production.... ...this team: (1) Product SRE — jumping directly into... ...influence over how a global real-time platform... ...CockroachDB, NATS, Nebula, Kubernetes — and map where...Remote workFlexible hours
- ...Reinforcement Learning, AI, Control and... ...Machine Learning infrastructure. The ideal candidate... ...in DevOps/SRE practices, cloud infrastructure... ...GCP and AWS using Kubernetes, Apache Airflow,... ...are flexible for remote work except for... ...benefits include global access to mental health...Remote workWork at officeLocal areaMonday to ThursdayFlexible hours
- ...Enterprise Architect, Infrastructure and Operations... ...platform engineering, SRE, networking, and... ...enable safe, scalable AI adoption, and lower... ...Design and govern global, cloud-native platforms (e.g., Kubernetes/GKE, service mesh,... ...you can choose to be remote or in the office....Remote workInterim roleWork at officeFlexible hours
- Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda... ...The Role This is not a generalist SRE role. You will design, operate, and... ...at the syscall and hardware level. Kubernetes & Orchestration: Strong experience...Remote workFull time
- ...design, and build our SRE foundation from... .... Location: Remote - US based What... ...comprehensive application and infrastructure monitoring... ...Leverage AI and machine learning... ...containerization (Docker, Kubernetes) and orchestration... ...a fragmented, global marketplace of...Remote work
- ...Reinforcement Learning, AI, Control and... ...Machine Learning infrastructure. The ideal candidate... ...in DevOps/SRE practices, cloud infrastructure... ...GCP and AWS using Kubernetes, Apache Airflow,... ...are flexible for remote work except for... ...benefits include global access to mental health...Remote workWork at officeLocal areaMonday to ThursdayFlexible hours
$140k - $208k
..., observability, and AI workloads. The company... ...of AI innovators and global brands such as Meta,... ...'re hiring a Senior SRE / Senior Infrastructure Engineer to own... ...with Terraform , Kubernetes , and container-based... ...infrastructure. #LI-remote The typical starting...Remote workLocal areaHome officeFlexible hours- ...collaboration and AI-powered workflow software... ...as an all-remote company, though many... ...all forces and global organizations, and... ...Role Onebrief's infrastructure team owns the... ...Our charter spans Kubernetes clusters in commercial... ...DevSecOps, Platform, SRE, Cloud, or...Remote work
- ...Salesforce is the #1 AI CRM, where humans with... ...is the backbone of our infrastructure - a dynamic group of Cloud... ...the systems that power global real-time communication... ...Experience working with Kubernetes (K8s) Qualifications... ..., OpenTelemetry) and SRE practices Unleash...Remote workWork experience placementWorldwide
$133.5k - $212k
...Senior Infrastructure Engineer REMOTE - US Iterable is the leading AI-powered customer engagement platform... ...Want to Work. With a global presence—including... ..., Infrastructure, and SRE to bring next-generation... ...will: Use your Kubernetes and AWS expertise to evolve...Remote workContract workLocal areaImmediate startWorldwideHome officeFlexible hours- ...company that's on a journey to transform the world's infrastructure. We are seeking a Director of Global IT DevOps & AI Infrastructure to take full ownership of how... ...Full-Time or Part-Time : Full-Time Reports to : Founder & CEO Location : Remote - US...Remote workFull timePart timeFor contractors
$189.6k - $312.73k
...solve " last mile " infrastructure challenges that... ...throughput on complex Kubernetes clusters. This is... ...Backend Systems, SRE, or Infrastructure... ...CNI failures. AI Inference Proficiency... ...positions with Remote‑US locations, the... ...that compose our global village. Equal Opportunity...Remote workPermanent employmentFull timeContract workWork experience placementWork at officeFlexible hours- ...contract Location: Remote (overlap with PST)... ...Sphere , we partner with global logistics company leveraging AI, Machine Learning, and Data... ...maintain scalable AI infrastructure, enabling teams to run ML... ...Docker) , orchestration (Kubernetes) , and CI/CD for ML ....Remote workLong term contract
$180k - $200k
...bureaucracy — backed by the global footprint and... ...turn cutting‑edge AI research into... ...Serve as the founding infrastructure engineer, building... ...extending Kubernetes via CRDs. Deep Kubernetes... ...). Proven SRE practice—SLIs/SLOs... ...has cracked before. Remote & Flexible - Work...Remote workPermanent employmentFull timeTemporary workWork experience placementFlexible hours- ...About Nebius: Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform... ...Familiarity with containerized environments (e.g., Docker, Kubernetes). Strong communication and ability to work...Remote work
$153.2k - $234.1k
...future of transportation on a global scale. Role: Are you... ...driving? Join the Embodied AI team at General Motors. Our... ..., applications, or ML infrastructure. ~ Experience designing robust... ...(e.g., Docker, Kubernetes). Remote/Hybrid: This role is categorized...Remote workLocal areaWork from homeRelocation packageFlexible hours$200k - $250k
...Presence is building the AI growth platform for... ...AI-native, self-service infrastructure platform that powers how... ...Engineering, or SRE - you've built and operated... ...workflow - Terraform changes, Kubernetes debugging, automation,... ...has grown to a global team ranked on the Inc....Remote workImmediate startShift work$189.3k - $290.7k
...future of transportation on a global scale. Role: Are you... ...driving? Join the Embodied AI team at General Motors. Our... ...efficient systems on modern cloud infrastructure-performance ~ End-to-end... ...technologies (e.g., Docker, Kubernetes) Remote/Hybrid: This role is...Remote workLocal areaWork from homeRelocationRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Global Remote SRE for AI Infrastructure & Kubernetes. Be the first to apply!
- remote education consultant San Francisco, CA
- remote nonprofit San Francisco, CA
- remote financial analyst San Francisco, CA
- remote virtual assistant San Francisco, CA
- junior ux designer remote San Francisco, CA
- remote real estate San Francisco, CA
- remote design intern San Francisco, CA
- remote legal internship San Francisco, CA
- software engineer internship remote San Francisco, CA
- remote data entry agent San Francisco, CA


