Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Platform Reliability Engineer

Virtual Vocations Inc

A company is looking for a Platform Reliability Engineer. Key Responsibilities Design and maintain a Kubernetes-based platform for autonomous AI execution Automate infrastructure processes using Terraform and other tools to achieve zero manual intervention Establish reliability metrics and implement self-healing systems for AI workflows Required Qualifications 6+ years of experience in Platform Engineering, SRE, or Infrastructure roles with a focus on AI/ML systems Mastery of Terraform, ArgoCD, and GitOps workflows Expert-level knowledge of Kubernetes networking, scaling, and security Hands-on experience with MLOps pipelines and scaling AI inference services Proficiency in Python for automation and platform tool development

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Platform Reliability Engineer in United States vacancy
  • $168.93k - $192.5k

     ...enable all people to have a secure digital identity. To learn more, visit Role Overview We are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE team builds the automation, observability, and operational foundations that... 
    Suggested
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours

    ID.me

    Mountain View, CA
    1 day ago
  •  ...harnessed the power of airspace technology, analytics platforms, and drone services to transform business operations...  ...About the role DroneUp is seeking an SRE - Platform Engineer who will focus on ensuring the reliability, scalability, and performance of our internal and... 
    Suggested
    Contract work
    Remote work

    DroneUp

    New York, NY
    2 days ago
  •  ...A leading tech company in the United States is seeking a Site Reliability Engineer to ensure the reliability and performance of their services. The ideal candidate will have a solid background in cloud computing and container orchestration, along with strong programming... 
    Suggested
    Remote work

    DevOpsChat

    United States
    2 days ago
  •  ...DroneUp, LLC is hiring an SRE - Platform Engineer in the United States, focusing on the reliability and performance of their IT infrastructure while mentoring teams. Responsibilities include managing SLOs and incident response while working with cloud technologies such... 
    Suggested

    DroneUp

    New York, NY
    2 days ago
  • $90k - $215k

     ...Senior Software Engineer- Observability and Reliability Platform Engineering (REMOTE) Senior Software Engineer- Observability and Reliability Platform Engineering (REMOTE) 1 week ago Be among the first 25 applicants At GEICO, we offer a rewarding career where your ambitions... 
    Suggested
    Hourly pay
    Full time
    Work experience placement
    Local area
    Remote work
    Flexible hours

    GEICO

    San Jose, CA
    20 hours ago
  •  ...operations, and customer experience. We combine agency speed with engineering discipline, so people who join us get broad ownership and...  ...guardrails. We need a pragmatic engineer who can improve reliability without freezing product delivery. You will redesign delivery... 
    Weekend work

    KeY2Moon Solutions

    New York, NY
    1 day ago
  •  ...Site Reliability Engineering (SRE) Platform Engineer (Lead) Job Number: 26-00672 Use your skills where innovative technology solutions begin. ECLARO is looking for a Site Reliability Engineering (SRE) Platform Engineer (Lead) for our client in Rochester, NY. ECLARO’s... 
    Local area

    Eclaro

    Rochester, NY
    2 days ago
  • $90k - $215k

     ...Senior Software Engineer - Observability and Reliability Platform Engineering (REMOTE) Join to apply for the Senior Software Engineer - Observability and Reliability Platform Engineering (REMOTE) role at GEICO . Position Summary GEICO is seeking an experienced... 
    Remote work

    GEICO

    Colorado Springs, CO
    4 days ago
  • $168.93k - $192.5k

     ...enable all people to have a secure digital identity. To learn more, visit Role Overview We are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE team builds the automation, observability, and operational foundations that... 
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours

    ID.me

    Mountain View, CA
    28 days ago
  • A bioscience and IT firm located in Rockville, Maryland is seeking a DevOps Engineer with extensive experience in Linux and cloud platforms. The successful candidate will be responsible for designing scalable infrastructure, leading DevOps practices, and optimizing AI/ML... 

    Axle

    Rockville, MD
    4 days ago
  • $130k - $170k

     ...A leading media and entertainment company is looking for a Staff Software Engineer (SRE Lead) to oversee operational support for SAP BTP CPI applications. The role requires strong hands-on experience with BTP Cloud Foundry, AEM, and Edge Integration Cell. Responsibilities... 
    Remote work

    NBCUniversal

    United States
    1 day ago
  • Supporting our newest platforms, on which: Cloud native applications are built, leveraging a microservices architecture, Java, Kotlin,...  ...Jenkins, Spinnaker, Prometheus, Grafana and Mimir. Helping other engineers to learn and adopt these technologies and techniques.... 

    Apex Systems

    Indianapolis, IN
    2 days ago
  • $160k - $220k

     ...developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SR. KUBERNETES PLATFORM SITE RELIABILITY ENGINEER (STARLINK) At SpaceX we’re leveraging our experience in building rockets and spacecraft to deploy Starlink, the world... 
    Permanent employment
    Temporary work
    Work at office
    Worldwide
    Monday to Friday
    Weekend work

    Latent AI

    Redmond, WA
    3 days ago
  • $145.4k - $212.2k

     ...Senior Associate – Platform Reliability Engineer – Cloud Technology Location: 110 Cokesbury Road, Lebanon, New Jersey 08833 Offered Wage: $145,400.06 - $212,200.00/year Duties Automate and deliver repeatable, reliable solutions using Terraform, GitHub, Jenkins... 

    New York Life

    Lebanon, NJ
    3 days ago
  • $145.4k - $212.2k

     ...Duties : Automate and deliver repeatable, reliable solutions using Terraform, GitHub,...  ...implementations on AWS. Automate remediation of platform defects and security vulnerabilities....  ...with Cloud Architects and Solution Engineers to deliver solutions and projects. Education... 
    Local area

    New York Life Insurance Company

    Lebanon, NJ
    4 days ago
  •  ...identity security, delivering an AI-powered platform that governs and secures access to...  ...cloud-native systems. As a Staff Platform Engineer, you will play a critical role in ensuring...  ...technical leadership role. You will own reliability for major platform domains, design... 

    Saviynt

    Atlanta, GA
    14 days ago
  • $163k - $203k

    GoTo Meeting is looking for a Senior Site Reliability Engineer in San Francisco. You will be responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This role requires expertise in Kubernetes, cloud platforms (preferably GCP), and... 

    GoTo Meeting

    San Francisco, CA
    2 days ago
  • An innovative R&D company in San Francisco is seeking a Site Reliability Engineer to join its Platform Engineering team. This position focuses on ensuring the reliability and performance of an AI-powered code review platform. The ideal candidate will have 6-8 years of experience... 

    CodeRabbit

    San Francisco, CA
    20 hours ago
  •  ...technology company is looking for a Java SRE Engineer to support large-scale cloud migrations...  ...lead migrations, design robust AWS EKS platforms, and implement deployment strategies....  ...with various teams to ensure reliability. This position is onsite in the San Francisco... 

    EITACIES Inc.

    Santa Clara, CA
    4 days ago
  • ManpowerGroup Global, Inc. is seeking a Salesforce SRE Lead - Reliability Engineering & Operations to support crucial manufacturing and patient...  .... The ideal candidate will have extensive Salesforce platform expertise and deep knowledge of reliability engineering. This... 
    Flexible hours

    ManpowerGroup Global, Inc.

    New Brunswick, NJ
    1 day ago
  • $141.6k - $212.4k

     ...EngineeringGeneral Summary:Looking for a SRE Platform Lead resource to implement and support...  ...contributor role combining platform engineering + SRE + integration/event streaming...  ...improvement to reduce MTTR/MTBF and improve reliability.DevSecOps EnablementBuild and... 
    Work experience placement
    Work from home
    Weekend work
    Weekday work

    Nutanix

    San Diego, CA
    1 day ago
  • $202.8k - $327.63k

     ...Docusign's Intelligent Agreement Management platform, companies can create, commit, and...  ...ll do The Senior Director, SRE Platform Engineering is a senior engineering leader...  ...IT Service Management (ITSM) and Site Reliability Engineering (SRE) capabilities, applying... 
    Permanent employment
    Contract work
    Work at office
    Local area
    Remote work
    2 days per week

    DocuSign, Inc.

    San Francisco, CA
    1 day ago
  • $119k - $206k

     ...About this role: Wells Fargo is seeking a Lead Systems Operations Engineer – Platform Reliability Engineering (PRE) within the CTO Platform organization. This role is aligned to modern Site Reliability Engineering (SRE) practices and is responsible for driving reliability... 
    Work experience placement

    Wells Fargo

    West Des Moines, IA
    20 hours ago
  • Overview Senior Platform & Reliability Engineer OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We’re building the next generation of creative tools powered by cutting-edge AI, enabling anyone to create videos, visuals, characters,... 
    Remote work
    Worldwide
    Visa sponsorship

    OpenArt AI

    San Francisco, CA
    1 day ago
  • Toshiba Global Commerce Solutions is seeking a Senior Software Engineer in Durham, NC, focused on performance, resilience, and...  ...experience, strong skills in Node.js and Java, and a background in reliability engineering. The position offers robust benefits including health... 

    Toshiba Global Commerce Solutions

    Durham, NC
    4 days ago
  • $200k - $250k

    A leading visual creation platform in San Francisco is seeking a Senior Owner of Stability and Infrastructure. This hands-on technical leadership role demands expertise in service reliability to ensure the platform's performance as it scales. Responsibilities include setting... 

    Vizcom

    San Francisco, CA
    2 days ago
  • Founding Platform & Reliability Engineer About OpenArt OpenArt is an AI Storytelling and Visual Creation Platform used by millions worldwide. We’re building the next generation of creative tools powered by cutting-edge AI, enabling anyone to create videos, visuals, characters... 
    Remote work
    Worldwide
    Visa sponsorship

    Embedding VC

    San Francisco, CA
    4 days ago
  • WRITER is seeking an Infrastructure Engineer to ensure robust and reliable systems for their AI platform. This hybrid position, based in New York City, San Francisco, Seattle, or London, involves automating processes, leading incident responses, and collaborating with... 

    WRITER

    New York, NY
    20 hours ago
  • OpenArt AI in San Francisco is seeking a Senior Platform & Reliability Engineer to design and improve the reliability of its infrastructure. The role emphasizes building and operating production systems while collaborating with product engineers to ensure platform scalability... 

    OpenArt AI

    San Francisco, CA
    1 day ago
  • Skechers USA Ltd. in Manhattan Beach, CA, is looking for a Sr. Reliability Engineer, Digital Marketing. The role focuses on ensuring the...  ...reliability engineering or related fields and hands-on skills with platforms like Salesforce. This hybrid position requires a minimum of... 
    3 days per week

    Skechers USA Ltd.

    Manhattan Beach, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Platform Reliability Engineer. Be the first to apply!