Senior Site Reliability Engineer
Replit
Replit is the agentic software creation platform that enables anyone to build applications using natural language. With millions of users worldwide, Replit is democratizing software development by removing traditional barriers to application creation.
About the role:
Join our Site Reliability Engineering team and help ensure the reliability, scalability, and performance of Replit's infrastructure that serves millions of developers worldwide. As a Site Reliability Engineer, you will bridge the gap between development and operations, implementing automation and establishing best practices that enable our platform to scale efficiently while maintaining high availability.
We are seeking SREs who are passionate about building and maintaining resilient systems at scale. Your mission will be to design and implement robust monitoring solutions, automate operational tasks, and continuously improve our infrastructure's reliability and performance.
You will:
Design and Implement Observability Solutions : Develop comprehensive monitoring and alerting systems using modern observability tools. Create dashboards and metrics that provide real-time visibility into system health and performance. Implement logging strategies that enable quick problem identification and resolution.
-
Drive Automation and Infrastructure as Code : Architect and implement infrastructure automation solutions using tools like Terraform, Ansible, or Pulumi. Design and maintain CI/CD pipelines that enable reliable and consistent deployments. Create self-healing systems that can automatically respond to common failure scenarios.
-
Establish SLOs and SLIs : Work with product and engineering teams to define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs). Build systems to track and report on these metrics, ensuring we maintain high reliability standards while balancing innovation speed.
-
Incident Management and Response : Lead incident response efforts, conducting thorough post-mortems, and implementing improvements to prevent future occurrences. Develop and maintain runbooks for critical services. Build tools and processes that reduce Mean Time To Recovery (MTTR).
-
Performance Optimization : Identify and resolve performance bottlenecks across our infrastructure. Implement capacity planning strategies and optimize resource utilization. Work on reducing latency and improving system efficiency across global regions.
Required skills and experience:
4-8 years of experience in Site Reliability Engineering or similar roles (DevOps, Systems Engineering, Infrastructure Engineering)
Strong programming skills in languages commonly used for automation (Python, Go, or similar)
Deep understanding of distributed systems
Experience with container orchestration platforms (Kubernetes) and cloud-native technologies
Proven track record of implementing and maintaining monitoring/observability solutions
Strong incident management skills with experience leading incident response
Experience with infrastructure as code and configuration management tools
Bonus Points:
Experience with Google Cloud Platform (GCP) services and tools
Knowledge of modern observability platforms (Prometheus, Grafana, Datadog, etc.)
What we value:
Problem-solving mindset: Ability to approach complex operational challenges systematically and devise effective solutions
Self-directed and autonomous: Capable of working independently while collaborating effectively with cross-functional teams
Strong communication skills: Ability to explain complex technical concepts to both technical and non-technical audiences
Continuous learning: Passion for staying current with industry best practices and new technologies
Focus on automation: Strong belief in automating repetitive tasks and building self-healing systems
Full-Time Employee Benefits Include:
Competitive Salary & Equity
401(k) Program with a 4% match ( US Only )
⚕️ Health, Dental, Vision and Life Insurance
Short Term and Long Term Disability
Paid Parental, Medical, Caregiver Leave
Flexible Time Off (FTO) + Holidays
Commuter Benefits ( In-Office Only )
Monthly Wellness Stipend
Autonomous Work Environment
In Office Set-Up Reimbursement ( In-Office Only )
Quarterly Team Gatherings
☕ In Office Amenities ( In-Office Only )
Want to learn more about what we are up to?
Meet the Replit Agent
Replit: Make an app for that
Replit Blog
Amjad TED Talk
Interviewing + Culture at Replit
Operating Principles
Reasons not to work at Replit
To achieve our mission of making programming more accessible around the world, we need our team to be representative of the world. We welcome your unique perspective and experiences in shaping this product. We encourage people from all kinds of backgrounds to apply, including and especially candidates from underrepresented and non-traditional backgrounds.
$96k - $163k
...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Who is Mastercard? At Mastercard technology, we work to connect and power an inclusive, digital economy that...SeniorFull timePart timeWorldwideFlexible hours$163k
...and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Overview-The ProCOM team is looking for a Site Reliability Engineering (SRE) who can help us solve problems, build...SeniorFull timePart timeImmediate startWorldwideFlexible hours$96k - $163k
...and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Overview The BizOps team is looking for a Senior Site Reliability Engineer who can help us solve problems and...SeniorFull timePart timeWorldwideFlexible hoursShift work- ...Joining a high-performing team remotely, the full-time Senior Site Reliability Engineer will own the reliability and automation of critical AI infrastructure, ensuring systems are resilient and secure while building automation tools to streamline operational workflows...SeniorFull timeRemote work
- ...them resilient to any single person, and raise the bar on how reliably we run. This is not simply a ticket-queue or keep-the-lights... ...that make them boring . We deliberately pair operational and engineering work so the role grows rather than narrows. What you'll own...SeniorRemote work
- ...actionable to everyone, everywhere. That everyone now includes AI agents. The Role: You'll be the infrastructure and reliability engineer on the Data Replication team - a full-stack product team running over 3 million sync jobs a week powering thousands of data...SeniorWork at officeLocal areaRemote workFlexible hours
$198.03k - $239.96k
...s so great about working on Calendly’s Engineering team? We make things possible for our... ...we need you? Well, we are looking for a Site Reliability Engineer who will bring creative... ...keen eye for detail. You will report to a Senior Manager of Engineering and be responsible...SeniorFull timeWork experience placementRemote work- ...we imbue a lot of trust, autonomy, and accountability from Day 1. #LI-Remote Little more about the team: Honeycomb’s Site Reliability Engineering (SRE) team works at the intersection of infrastructure, developer experience, and organizational enablement. We lead...SeniorWork at officeLocal areaRemote workWork from homeHome officeVisa sponsorship
- ...We focus on craft, innovation, and collaboration, creating exceptional impact for e-commerce businesses worldwide. Senior Site Reliability Engineer (Machine Learning Infrastructure) We are looking for a Senior Site Reliability Engineer to help scale the AI...SeniorWork at officeLocal areaRemote workWorldwideHome officeTrial periodVisa sponsorshipRelocation packageFlexible hours
- ...about this role, we encourage you to apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and industry... ...are met. What You Will Be Doing Improving production reliability and system resilience within an SRE scoped team Championing...SeniorFlexible hours
$53.3k - $119.85k
...the future of work! This position As a Senior SRE at Remote, you'll work with a high degree of autonomy on complex reliability and platform problems, owning the plan and... ...solutions and raise the technical bar of the engineers around you while collaborating closely with...SeniorFull timeWork experience placementLocal areaImmediate startRemote workHome officeFlexible hours$113.3k - $205.52k
...important to maintain our strong culture, achieve our goals, and thrive as #OneJamf. What you'll do at Jamf: As a Senior Site Reliability Engineer, you'll help us balance development velocity with the reliability our customers depend on. You'll partner with...SeniorWork at officeRemote workWorldwideFlexible hours- ...English About the role: In the Engineering team at Owkin, you will be at the heart... ...response to ensure high availability and reliability Gather and analyze metrics from... ...talent team on LinkedIn ~ Check our senior team on our website ~ Check the existence...SeniorWork at officeRemote workFlexible hours
- ...Overview: Senior Site Reliability Engineer (SRE) Location: Chicago, IL (Onsite) Type: Contract Role Overview: We are seeking a Senior Site Reliability Engineer (SRE) with strong expertise in AWS infrastructure, automation, observability, and production...SeniorContract work
$123k - $144k
...has its own compensation programs, benefits, and employment policies. To learn more, visit The opportunity The Senior Site Reliability Engineer establishes and maintains the infrastructure and operational systems that Thunderbird users and teams depend on every...SeniorPermanent employmentFull timeLocal areaRemote work- ...Site Reliability Engineers are responsible for ensuring the availability, reliability, scalability, and performance of the firm’s most critical customer-facing microservices that power all eCommerce channels. This role applies Google-inspired SRE principles to balance...SeniorLocal areaRemote workFlexible hoursShift work
- ...Senior Site Reliability Engineer Location: West Lake, CA or Carrolton, TX (ONSITE) FTE ONLY Must Have Technical/Functional Skills ~5-7 years of professional experience in a Site Reliability, DevOps, or Systems Engineering role. ~3-5 years of hands-on experience...SeniorPermanent employment
- ...Senior Site Reliability Engineer (Sre) / Application Reliability Engineer We are looking for a highly experienced senior site reliability engineer (sre) / application reliability engineer with aws knowledge over and over 10+ years of expertise in incident management...Senior
- ...converging world. Powered by nearly 90,000 talented and entrepreneurial professionals across more than 30 countries. Role---Senior Site Reliability Engineer (SRE) Location--New York - New York and Los Angles, CA We are looking for an experienced Site Reliability...SeniorLocal area
- ...The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability, and operational excellence of the production systems that power Nectar's platform. We run high-volume data ingestion pipelines and real-time AI agents on top of a fast...Senior
- ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient...Senior
- ...Job Description This role focuses on site reliability engineering for cloud-hosted data science toolsets, ensuring that platforms are reliable... ...in team skills-building and cross-training. This is a senior position. The successful candidate is a self-starter who assesses...Senior
- ...IXL Learning, developer of personalized learning products used by millions of people globally, is seeking a Senior Site Reliability Engineer to join our team, and help maintain the reliability and optimal performance of our products. We are seeking engineers with a passion...SeniorWork at officeImmediate start
- ...Senior Site Reliability Engineer United States About OfficeSpace: OfficeSpace Software provides the leading AI operating system for the built world, that helps teams plan, connect, and perform in the workplace. As a performance-based, PE-backed company, we hire...SeniorShift work
$181.69k - $213.75k
...Senior Site Reliability Engineer San Francisco, California; Santa Clara, California; Seattle, WA The Company You'll Join Carta connects founders, investors, and limited partners through world-class software, purpose-built for everyone in venture capital, private...SeniorFull timeWork at office$152k - $195k
...investors including Silver Lake Waterman, Moody’s, Sequoia Capital, GV and Riverwood Capital. About the Team: As a Senior Site Reliability Engineer, you will be a key technical leader driving the design and optimization of our Kubernetes-based infrastructure and CI/...Senior$182.3k - $220k
...putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team, you'll sit at the... ...infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across...SeniorLocal areaFlexible hours$86.9k - $198k
...Site Reliability Engineer, Senior Opportunity: Engineering to make a system more resilient and efficient frees up time and money to build more capabilities. Whether you come from a background in network engineering, systems administration, or software development, if you...SeniorFull timePart timeLocal area- ...opportunity with preference to candidates located in San Diego, CA, Norfolk, VA or Charleston, SC Position Overview: The Senior Site Reliability Engineer is a technical leader responsible for architecting the reliability strategy for large-scale, distributed government...SeniorContract workRemote work
$104.9k - $174.7k
...immediately hire a highly skilled and proactive Senior SRE to join our dynamic team. You will... ...fault-tolerant systems within agreed reliability objectives, whilst enabling the fast... ...skills. About team; This diverse team of Engineers in assisting multiple product teams as...SeniorLocal areaImmediate startWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!
- site reliability engineering manager United States
- site reliability engineer remote United States
- lead site reliability engineer United States
- site reliability engineer sre United States
- site reliability engineer United States
- senior licensing manager United States
- senior cloud service delivery manager United States
- senior business analyst contract United States
- senior product design engineer United States
- senior game producer United States


