Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Global Remote SRE for AI Infrastructure & Kubernetes

Andromeda Cluster

A cutting-edge AI infrastructure company is seeking a Site Reliability Engineer to manage Kubernetes clusters and improve the reliability of critical systems. The ideal candidate will have 5+ years of experience in SRE or DevOps, strong Linux and Kubernetes expertise, and skills in automation and Infrastructure-as-Code. This role offers the opportunity to shape the future of scalable AI infrastructure, working closely with both customers and technical teams in a dynamic environment. #J-18808-Ljbffr Andromeda Cluster

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Global Remote SRE for AI Infrastructure & Kubernetes in San Francisco, CA vacancy
  •  ...Infrastructure Engineer/SRE Taiwan (Remote) Cresta unlocks the true potential of the customer experience, turning...  ...advantage. Cresta's unified AI platform combines conversational AI...  ...Ensure reliability of multi-cloud Kubernetes clusters and pipelines. Metrics,... 
    Remote work
    Local area
    Work from home
    Home office

    Cresta

    United States
    16 hours ago
  •  ...trust in the age of AI At Oscilar, we're...  ...for a experienced SRE to take ownership...  ...dependency failures, and global deployments. You’...  ..., and how we run infrastructure that supports...  ...infrastructure (AWS, Pulumi, Kubernetes). Lead...  ...plan Flexibility: Remote-first culture —... 
    Remote work

    Oscilar

    New York, NY
    1 day ago
  • A leading AI infrastructure company seeks a Head of AI Infrastructure to define...  ...the technical roadmap for a global GPU cloud platform. This...  ...significant expertise in Kubernetes and GPU clusters. The ideal...  ...distributed team, working in a remote-first environment, and will... 
    Remote job
    Immediate start

    Blue Signal Search

    San Francisco, CA
    1 day ago
  • $140k - $215k

     ...As a global leader in cybersecurity, CrowdStrike protects...  ...world's most advanced AI-native platform. Our...  ...is hiring for a Sr. Infrastructure Engineer to help with...  ...depth experience with Kubernetes, its many add-ons,...  ...effectively with both local and remote teams Rock solid... 
    Remote work
    Work experience placement
    Work at office
    Local area
    Shift work

    CrowdStrike Holdings, Inc.

    United States
    5 hours ago
  • $200k - $250k

     ...Responsibilities Kubernetes Ownership :...  ...and security. Infrastructure Automation : Implement...  ...~5+ years in a SRE or Software Engineer...  ...for your ideal remote set-up ~ Flexible...  ...meal benefit ~ Global off-sites The...  .... Phantom may use AI-powered tools and... 
    Remote work
    Live in
    Flexible hours

    Phantom Technologies

    United States
    16 hours ago
  •  ...Overview: Job Title: AI/ML Ops & Infrastructure Engineer Company:...  ..., GA (Hybrid / Remote Options Available)...  ...of experience across global markets, we have built...  ...Serve, NVIDIA Triton) on Kubernetes across multi-cloud...  ...Reliability Engineering (SRE), or Cloud... 
    Remote work
    Full time
    Shift work

    R2 Technologies

    Alpharetta, GA
    1 day ago
  •  ...Infrastructure Ops Engineer at Baseten Baseten powers mission...  ...world's most dynamic AI companies, like Cursor,...  ...engine of our global infrastructure. You will...  ...partnering closely with our SRE and FDE teams to execute...  ...will be hands-on with Kubernetes and cloud-native tools,... 
    Remote work
    Work experience placement
    Work at office
    Flexible hours

    Baseten

    United States
    4 days ago
  •  ...the world's most dynamic AI companies, like Cursor,...  ...AI research, flexible infrastructure, and seamless developer...  ...Design and architect a global training scheduler...  ...Partner closely with SRE and Capacity teams to unlock...  ...Deep expertise with Kubernetes in production environments... 
    Remote work
    Flexible hours

    Baseten

    United States
    16 hours ago
  • A leading AI company is looking for an Infrastructure Engineer to design and build its core infrastructure. You'll work in a remote, autonomous environment supporting developer workflows, ensuring reliability of Kubernetes clusters, and automating operations. Requires... 
    Remote job

    Cresta

    New York, NY
    1 day ago
  • $180k - $200k

     ...Infrastructure Engineer (Observability) Lightning AI is the company behind PyTorch Lightning...  ...AI operates globally with offices in New...  ..., or SF) or fully remote within the U.S., with...  ...engineering, SRE, or observability-...  ...environments and Kubernetes observability ~... 
    Remote work
    Work from home
    Flexible hours

    Lightning AI

    Seattle, WA
    3 days ago
  • $93.5k - $182.85k

     ...resilient. The company's unique AI-powered platform combines...  ...possesses the vision to architect a global network and the grit to build...  ...the CISO and Director of Infrastructure to align networking goals...  ...Commvault? Apply now!? #LI-JS1 #LI-Remote Thank you for your... 
    Remote work

    Commvault

    Eatontown, NJ
    13 days ago
  • $230k - $240k

     ...Job Description: Title: VP, Global IT Infrastructure & Operations Location: Remote - USA or Canada Reports to: Chief...  ...will establish the foundation for AI-enabled operations, drive top-line...  ...frameworks, including ITIL, SRE practices, and DevOps methodologies... 
    Remote work
    Contract work

    J.D. Power

    Grizzly Flats, CA
    3 days ago
  •  ...Senior Infrastructure Engineer LiveKit is building the infrastructure...  ..., scale, and observe AI agents in production....  ...this team: (1) Product SRE — jumping directly into...  ...influence over how a global real-time platform...  ...CockroachDB, NATS, Nebula, Kubernetes — and map where... 
    Remote work
    Flexible hours

    LiveKit

    United States
    16 hours ago
  •  ...Reinforcement Learning, AI, Control and...  ...Machine Learning infrastructure. The ideal candidate...  ...in DevOps/SRE practices, cloud infrastructure...  ...GCP and AWS using Kubernetes, Apache Airflow,...  ...are flexible for remote work except for...  ...benefits include global access to mental health... 
    Remote work
    Work at office
    Local area
    Monday to Thursday
    Flexible hours

    Roku

    Austin, TX
    1 day ago
  •  ...Enterprise Architect, Infrastructure and Operations...  ...platform engineering, SRE, networking, and...  ...enable safe, scalable AI adoption, and lower...  ...Design and govern global, cloud-native platforms (e.g., Kubernetes/GKE, service mesh,...  ...you can choose to be remote or in the office.... 
    Remote work
    Interim role
    Work at office
    Flexible hours

    Priceline.com LLC

    New York, NY
    3 days ago
  • Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda...  ...The Role This is not a generalist SRE role. You will design, operate, and...  ...at the syscall and hardware level. Kubernetes & Orchestration: Strong experience... 
    Remote work
    Full time

    Cortes 23

    San Francisco, CA
    1 day ago
  •  ...design, and build our SRE foundation from...  .... Location: Remote - US based What...  ...comprehensive application and infrastructure monitoring...  ...Leverage AI and machine learning...  ...containerization (Docker, Kubernetes) and orchestration...  ...a fragmented, global marketplace of... 
    Remote work

    INFINITE CHOICE LLC

    Dallas, TX
    a month ago
  •  ...Reinforcement Learning, AI, Control and...  ...Machine Learning infrastructure. The ideal candidate...  ...in DevOps/SRE practices, cloud infrastructure...  ...GCP and AWS using Kubernetes, Apache Airflow,...  ...are flexible for remote work except for...  ...benefits include global access to mental health... 
    Remote work
    Work at office
    Local area
    Monday to Thursday
    Flexible hours

    Roku, Inc.

    Austin, TX
    4 days ago
  • $140k - $208k

     ..., observability, and AI workloads. The company...  ...of AI innovators and global brands such as Meta,...  ...'re hiring a Senior SRE / Senior Infrastructure Engineer to own...  ...with Terraform , Kubernetes , and container-based...  ...infrastructure. #LI-remote The typical starting... 
    Remote work
    Local area
    Home office
    Flexible hours

    ClickHouse

    United States
    2 days ago
  •  ...collaboration and AI-powered workflow software...  ...as an all-remote company, though many...  ...all forces and global organizations, and...  ...Role Onebrief's infrastructure team owns the...  ...Our charter spans Kubernetes clusters in commercial...  ...DevSecOps, Platform, SRE, Cloud, or... 
    Remote work

    Onebrief, Inc

    United States
    16 hours ago
  •  ...Salesforce is the #1 AI CRM, where humans with...  ...is the backbone of our infrastructure - a dynamic group of Cloud...  ...the systems that power global real-time communication...  ...Experience working with Kubernetes (K8s) Qualifications...  ..., OpenTelemetry) and SRE practices Unleash... 
    Remote work
    Work experience placement
    Worldwide

    Salesforce

    United States
    8 days ago
  • $133.5k - $212k

     ...Senior Infrastructure Engineer REMOTE - US Iterable is the leading AI-powered customer engagement platform...  ...Want to Work. With a global presence—including...  ..., Infrastructure, and SRE to bring next-generation...  ...will: Use your Kubernetes and AWS expertise to evolve... 
    Remote work
    Contract work
    Local area
    Immediate start
    Worldwide
    Home office
    Flexible hours

    Iterable

    United States
    2 days ago
  •  ...company that's on a journey to transform the world's infrastructure. We are seeking a Director of Global IT DevOps & AI Infrastructure to take full ownership of how...  ...Full-Time or Part-Time : Full-Time Reports to : Founder & CEO Location : Remote - US... 
    Remote work
    Full time
    Part time
    For contractors

    Endeavour. Inspired Infrastructure.

    Darien, CT
    25 days ago
  • $189.6k - $312.73k

     ...solve " last mile " infrastructure challenges that...  ...throughput on complex Kubernetes clusters. This is...  ...Backend Systems, SRE, or Infrastructure...  ...CNI failures. AI Inference Proficiency...  ...positions with Remote‑US locations, the...  ...that compose our global village. Equal Opportunity... 
    Remote work
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Work at office
    Flexible hours

    Red Hat, Inc.

    Sacramento, CA
    4 days ago
  •  ...contract Location: Remote (overlap with PST)...  ...Sphere , we partner with global logistics company leveraging AI, Machine Learning, and Data...  ...maintain scalable AI infrastructure, enabling teams to run ML...  ...Docker) , orchestration (Kubernetes) , and CI/CD for ML .... 
    Remote work
    Long term contract

    Sphere Partners LLC

    United States
    4 days ago
  • $180k - $200k

     ...bureaucracy — backed by the global footprint and...  ...turn cutting‑edge AI research into...  ...Serve as the founding infrastructure engineer, building...  ...extending Kubernetes via CRDs. Deep Kubernetes...  ...). Proven SRE practice—SLIs/SLOs...  ...has cracked before. Remote & Flexible - Work... 
    Remote work
    Permanent employment
    Full time
    Temporary work
    Work experience placement
    Flexible hours

    Integrated Research Ltd.

    Denver, CO
    4 days ago
  •  ...About Nebius: Nebius is leading a new era in cloud infrastructure for the global AI economy. We are building a full-stack AI cloud platform...  ...Familiarity with containerized environments (e.g., Docker, Kubernetes). Strong communication and ability to work... 
    Remote work

    Nebius

    United States
    16 hours ago
  • $153.2k - $234.1k

     ...future of transportation on a global scale. Role: Are you...  ...driving? Join the Embodied AI team at General Motors. Our...  ..., applications, or ML infrastructure. ~ Experience designing robust...  ...(e.g., Docker, Kubernetes). Remote/Hybrid: This role is categorized... 
    Remote work
    Local area
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    3 days ago
  • $200k - $250k

     ...Presence is building the AI growth platform for...  ...AI-native, self-service infrastructure platform that powers how...  ...Engineering, or SRE - you've built and operated...  ...workflow - Terraform changes, Kubernetes debugging, automation,...  ...has grown to a global team ranked on the Inc.... 
    Remote work
    Immediate start
    Shift work

    Luxury Presence

    United States
    3 days ago
  • $189.3k - $290.7k

     ...future of transportation on a global scale. Role: Are you...  ...driving? Join the Embodied AI team at General Motors. Our...  ...efficient systems on modern cloud infrastructure-performance ~ End-to-end...  ...technologies (e.g., Docker, Kubernetes) Remote/Hybrid: This role is... 
    Remote work
    Local area
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Olympia, WA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Global Remote SRE for AI Infrastructure & Kubernetes. Be the first to apply!