Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Site Reliability Engineer

Replit

Replit is the agentic software creation platform that enables anyone to build applications using natural language. With millions of users worldwide, Replit is democratizing software development by removing traditional barriers to application creation.

About the role:

Join our Site Reliability Engineering team and help ensure the reliability, scalability, and performance of Replit's infrastructure that serves millions of developers worldwide. As a Site Reliability Engineer, you will bridge the gap between development and operations, implementing automation and establishing best practices that enable our platform to scale efficiently while maintaining high availability.

We are seeking SREs who are passionate about building and maintaining resilient systems at scale. Your mission will be to design and implement robust monitoring solutions, automate operational tasks, and continuously improve our infrastructure's reliability and performance.

You will:
  • Design and Implement Observability Solutions : Develop comprehensive monitoring and alerting systems using modern observability tools. Create dashboards and metrics that provide real-time visibility into system health and performance. Implement logging strategies that enable quick problem identification and resolution.

  • Drive Automation and Infrastructure as Code : Architect and implement infrastructure automation solutions using tools like Terraform, Ansible, or Pulumi. Design and maintain CI/CD pipelines that enable reliable and consistent deployments. Create self-healing systems that can automatically respond to common failure scenarios.

  • Establish SLOs and SLIs : Work with product and engineering teams to define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Build systems to track and report on these metrics, ensuring we maintain high reliability standards while balancing innovation speed.

  • Incident Management and Response : Lead incident response efforts, conducting thorough post-mortems, and implementing improvements to prevent future occurrences. Develop and maintain runbooks for critical services. Build tools and processes that reduce Mean Time To Recovery (MTTR).

  • Performance Optimization : Identify and resolve performance bottlenecks across our infrastructure. Implement capacity planning strategies and optimize resource utilization. Work on reducing latency and improving system efficiency across global regions.

Required skills and experience:
  • 4-8 years of experience in Site Reliability Engineering or similar roles (DevOps, Systems Engineering, Infrastructure Engineering)

  • Strong programming skills in languages commonly used for automation (Python, Go, or similar)

  • Deep understanding of distributed systems

  • Experience with container orchestration platforms (Kubernetes) and cloud-native technologies

  • Proven track record of implementing and maintaining monitoring/observability solutions

  • Strong incident management skills with experience leading incident response

  • Experience with infrastructure as code and configuration management tools

Bonus Points:
  • Experience with Google Cloud Platform (GCP) services and tools

  • Knowledge of modern observability platforms (Prometheus, Grafana, Datadog, etc.)

What we value:
  • Problem-solving mindset: Ability to approach complex operational challenges systematically and devise effective solutions

  • Self-directed and autonomous: Capable of working independently while collaborating effectively with cross-functional teams

  • Strong communication skills: Ability to explain complex technical concepts to both technical and non-technical audiences

  • Continuous learning: Passion for staying current with industry best practices and new technologies

  • Focus on automation: Strong belief in automating repetitive tasks and building self-healing systems

Full-Time Employee Benefits Include:

Competitive Salary & Equity

401(k) Program with a 4% match ( US Only )

⚕️ Health, Dental, Vision and Life Insurance

Short Term and Long Term Disability

Paid Parental, Medical, Caregiver Leave

Flexible Time Off (FTO) + Holidays

Commuter Benefits ( In-Office Only )

Monthly Wellness Stipend

‍ Autonomous Work Environment

In Office Set-Up Reimbursement ( In-Office Only )

Quarterly Team Gatherings

☕ In Office Amenities ( In-Office Only )

Want to learn more about what we are up to?

  • Meet the Replit Agent

  • Replit: Make an app for that

  • Replit Blog

  • Amjad TED Talk

Interviewing + Culture at Replit

  • Operating Principles

  • Reasons not to work at Replit

To achieve our mission of making programming more accessible around the world, we need our team to be representative of the world. We welcome your unique perspective and experiences in shaping this product. We encourage people from all kinds of backgrounds to apply, including and especially candidates from underrepresented and non-traditional backgrounds.

Vacancy posted 14 hours ago
Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer in United States vacancy
  • $96k - $163k

     ...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Who is Mastercard? At Mastercard technology, we work to connect and power an inclusive, digital economy that... 
    Senior
    Full time
    Part time
    Worldwide
    Flexible hours

    Mastercard

    O Fallon, MO
    1 day ago
  • $163k

     ...and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Overview-The ProCOM team is looking for a Site Reliability Engineering (SRE) who can help us solve problems, build... 
    Senior
    Full time
    Part time
    Immediate start
    Worldwide
    Flexible hours

    Mastercard

    O Fallon, MO
    1 day ago
  • $96k - $163k

     ...and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Overview The BizOps team is looking for a Senior Site Reliability Engineer who can help us solve problems and... 
    Senior
    Full time
    Part time
    Worldwide
    Flexible hours
    Shift work

    Mastercard

    O Fallon, MO
    1 day ago
  •  ...Joining a high-performing team remotely, the full-time Senior Site Reliability Engineer will own the reliability and automation of critical AI infrastructure, ensuring systems are resilient and secure while building automation tools to streamline operational workflows... 
    Senior
    Full time
    Remote work

    Virtual Vocations Inc

    United States
    1 day ago
  •  ...them resilient to any single person, and raise the bar on how reliably we run. This is not simply a ticket-queue or keep-the-lights...  ...that make them boring . We deliberately pair operational and engineering work so the role grows rather than narrows. What you'll own... 
    Senior
    Remote work

    Synthesia

    United States
    2 days ago
  •  ...actionable to everyone, everywhere. That everyone now includes AI agents. The Role: You'll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team running over 3 million sync jobs a week powering thousands of data... 
    Senior
    Work at office
    Local area
    Remote work
    Flexible hours

    Airbyte

    United States
    2 days ago
  • $198.03k - $239.96k

     ...s so great about working on Calendly’s Engineering team? We make things possible for our...  ...we need you? Well, we are looking for a Site Reliability Engineer who will bring creative...  ...keen eye for detail. You will report to a Senior Manager of Engineering and be responsible... 
    Senior
    Full time
    Work experience placement
    Remote work

    Calendly LLC

    United States
    2 days ago
  •  ...we imbue a lot of trust, autonomy, and accountability from Day 1. #LI-Remote Little more about the team: Honeycomb’s Site Reliability Engineering (SRE) team works at the intersection of infrastructure, developer experience, and organizational enablement. We lead... 
    Senior
    Work at office
    Local area
    Remote work
    Work from home
    Home office
    Visa sponsorship

    Honey Comb

    United States
    2 days ago
  •  ...We focus on craft, innovation, and collaboration, creating exceptional impact for e-commerce businesses worldwide. Senior Site Reliability Engineer (Machine Learning Infrastructure) We are looking for a Senior Site Reliability Engineer to help scale the AI... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Home office
    Trial period
    Visa sponsorship
    Relocation package
    Flexible hours

    Photoroom

    United States
    2 days ago
  •  ...about this role, we encourage you to apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and industry...  ...are met. What You Will Be Doing Improving production reliability and system resilience within an SRE scoped team Championing... 
    Senior
    Flexible hours

    Megaport

    Cambridge, ID
    4 days ago
  • $53.3k - $119.85k

     ...the future of work! This position As a Senior SRE at Remote, you'll work with a high degree of autonomy on complex reliability and platform problems, owning the plan and...  ...solutions and raise the technical bar of the engineers around you while collaborating closely with... 
    Senior
    Full time
    Work experience placement
    Local area
    Immediate start
    Remote work
    Home office
    Flexible hours

    Remote Services Inc.

    United States
    2 days ago
  • $113.3k - $205.52k

     ...important to maintain our strong culture, achieve our goals, and thrive as #OneJamf. What you'll do at Jamf: As a Senior Site Reliability Engineer, you'll help us balance development velocity with the reliability our customers depend on. You'll partner with... 
    Senior
    Work at office
    Remote work
    Worldwide
    Flexible hours

    Jamf

    United States
    2 days ago
  •  ...English About the role: In the Engineering team at Owkin, you will be at the heart...  ...response to ensure high availability and reliability Gather and analyze metrics from...  ...talent team on LinkedIn ~ Check our senior team on our website ~ Check the existence... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Owkin

    United States
    14 hours ago
  •  ...Overview: Senior Site Reliability Engineer (SRE) Location: Chicago, IL (Onsite) Type: Contract Role Overview: We are seeking a Senior Site Reliability Engineer (SRE) with strong expertise in AWS infrastructure, automation, observability, and production... 
    Senior
    Contract work

    Purple Drive

    Chicago, IL
    3 days ago
  • $123k - $144k

     ...has its own compensation programs, benefits, and employment policies. To learn more, visit The opportunity The Senior Site Reliability Engineer establishes and maintains the infrastructure and operational systems that Thunderbird users and teams depend on every... 
    Senior
    Permanent employment
    Full time
    Local area
    Remote work

    Mozilla

    United States
    2 days ago
  •  ...Site Reliability Engineers are responsible for ensuring the availability, reliability, scalability, and performance of the firm’s most critical customer-facing microservices that power all eCommerce channels. This role applies Google-inspired SRE principles to balance... 
    Senior
    Local area
    Remote work
    Flexible hours
    Shift work

    O'Reilly Technology Services, Inc.

    Pierce, ID
    4 days ago
  •  ...Senior Site Reliability Engineer Location: West Lake, CA or Carrolton, TX (ONSITE) FTE ONLY Must Have Technical/Functional Skills ~5-7 years of professional experience in a Site Reliability, DevOps, or Systems Engineering role. ~3-5 years of hands-on experience... 
    Senior
    Permanent employment

    AceStack LLC

    Sacramento, CA
    3 days ago
  •  ...Senior Site Reliability Engineer (Sre) / Application Reliability Engineer We are looking for a highly experienced senior site reliability engineer (sre) / application reliability engineer with aws knowledge over and over 10+ years of expertise in incident management... 
    Senior

    Diverse Lynx

    Naperville, IL
    4 days ago
  •  ...converging world. Powered by nearly 90,000 talented and entrepreneurial professionals across more than 30 countries. Role---Senior Site Reliability Engineer (SRE) Location--New York - New York and Los Angles, CA We are looking for an experienced Site Reliability... 
    Senior
    Local area

    E-Solutions

    Los Angeles, CA
    5 days ago
  •  ...The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability, and operational excellence of the production systems that power Nectar's platform. We run high-volume data ingestion pipelines and real-time AI agents on top of a fast... 
    Senior

    XRC Ventures

    Palo Alto, CA
    3 days ago
  •  ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient... 
    Senior

    TechChain Talent

    New York, NY
    2 days ago
  •  ...Job Description This role focuses on site reliability engineering for cloud-hosted data science toolsets, ensuring that platforms are reliable...  ...in team skills-building and cross-training. This is a senior position. The successful candidate is a self-starter who assesses... 
    Senior

    Insight Global

    Alpharetta, GA
    2 days ago
  •  ...IXL Learning, developer of personalized learning products used by millions of people globally, is seeking a Senior Site Reliability Engineer to join our team, and help maintain the reliability and optimal performance of our products. We are seeking engineers with a passion... 
    Senior
    Work at office
    Immediate start

    IXL Learning

    Raleigh, NC
    2 days ago
  •  ...Senior Site Reliability Engineer United States About OfficeSpace: OfficeSpace Software provides the leading AI operating system for the built world, that helps teams plan, connect, and perform in the workplace. As a performance-based, PE-backed company, we hire... 
    Senior
    Shift work

    OfficeSpace Software

    Washington DC
    3 days ago
  • $181.69k - $213.75k

     ...Senior Site Reliability Engineer San Francisco, California; Santa Clara, California; Seattle, WA The Company You'll Join Carta connects founders, investors, and limited partners through world-class software, purpose-built for everyone in venture capital, private... 
    Senior
    Full time
    Work at office

    Carta

    San Francisco, CA
    1 day ago
  • $152k - $195k

     ...investors including Silver Lake Waterman, Moody’s, Sequoia Capital, GV and Riverwood Capital. About the Team: As a Senior Site Reliability Engineer, you will be a key technical leader driving the design and optimization of our Kubernetes-based infrastructure and CI/... 
    Senior

    SecurityScorecard

    Austin, TX
    2 days ago
  • $182.3k - $220k

     ...putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team, you'll sit at the...  ...infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across... 
    Senior
    Local area
    Flexible hours

    Modern Fertility

    New York, NY
    4 days ago
  • $86.9k - $198k

     ...Site Reliability Engineer, Senior Opportunity: Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software development, if you... 
    Senior
    Full time
    Part time
    Local area

    Booz Allen Hamilton

    Aurora, CO
    4 days ago
  •  ...opportunity with preference to candidates located in San Diego, CA, Norfolk, VA or Charleston, SC Position Overview: The Senior Site Reliability Engineer is a technical leader responsible for architecting the reliability strategy for large-scale, distributed government... 
    Senior
    Contract work
    Remote work

    Arctiq, Inc.

    San Diego, CA
    5 days ago
  • $104.9k - $174.7k

     ...immediately hire a highly skilled and proactive Senior SRE to join our dynamic team. You will...  ...fault-tolerant systems within agreed reliability objectives, whilst enabling the fast...  ...skills. About team; This diverse team of Engineers in assisting multiple product teams as... 
    Senior
    Local area
    Immediate start
    Worldwide

    RELX

    Raleigh, NC
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!