Site Reliability Engineer II
Backblaze External Website
About Backblaze
Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we’re helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the full power of the open cloud in their hands.Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $100m in revenue and is the leading specialized storage cloud - managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals.
About the Role
We are seeking a Site Reliability Engineer II (SRE II) to help ensure the stability, scalability, and reliability of our services and infrastructure. This role focuses on building automation, maintaining observability, and supporting incident response to keep customer-facing systems performing at their best. The SRE will collaborate with engineering, product, and operations teams to embed reliability practices into day-to-day development and operations while contributing to tools and processes that improve efficiency and reduce manual effort.
Key Responsibilities
Service Reliability & Operations
- Support the availability and durability of critical services across production environments.
- Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk.
- Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements.
- Follow established ITIL/OSS processes (incident, change, problem, and capacity management).
Automation & Tooling
- Develop automation for common operational tasks, reducing manual intervention and toil.
- Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, Catchpoint,ELK).
- Work with CI/CD pipelines, configuration management, and infrastructure as code tools (Terraform, Ansible, Jenkins).
- Write scripts (Bash, Python, Go, etc.) to improve system reliability and efficiency.
Collaboration
- Partner with engineering, product, and operations teams to support resilient system design and operations.
- Assist in capacity planning and disaster recovery exercises.
- Work with vendors and service providers to troubleshoot service issues and track SLA performance.
- Document systems, share learnings, and help grow a reliability-minded engineering culture.
Continuous Improvement
- Contribute to playbooks, runbooks, and operational documentation.
- Identify recurring issues and propose long-term improvements.
- Promote reliability-focused practices within development and operations teams.
Qualifications
Education & Experience
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- 2–4 years of experience in site reliability, systems engineering, or operations.
- Exposure to large-scale, production-grade systems.
Technical Skills
- Solid Linux systems administration and troubleshooting skills.
- Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis.
- Proficiency in at least one scripting language (Python, Bash, or Go).
- Understanding of containers (Kubernetes, Docker) and microservices concepts.
- Knowledge of incident response and operational best practices.
Preferred Attributes
- Experience in a SaaS, service provider, or distributed systems environment.
- Familiarity with ITIL/OSS practices and SLO/SLA’s
- Strong problem-solving skills and willingness to learn new technologies.
- Experience with cloud platforms (AWS, GCP, or Azure).
- Ability to work independently, take ownership, and drive projects from problem discovery through resolution.
At this point, we hope you're feeling excited about the job description you're reading. Even if you don't meet every requirement, we still encourage you to apply. Learning, developing, and growing are key parts of our culture. We're eager to meet people who believe in our mission and can contribute to our team in various ways. We want people to feel comfortable expressing their true selves and to come, stay, and do their best work here.
At Backblaze, we value being fair and good to our customers, partners, and employees. That’s why diversity, equity, and inclusion are at the core of our values. We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries, but further reinforces our commitment to doing the right thing. We are proud to be an Equal Opportunity Employer.To understand more about the data we collect and process as part of your application, please view our Backblaze Employee Privacy Notice.
$76k - $127k
...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Engineer II Site Reliability Engineer II Who is Mastercard? At Mastercard technology, we work to connect and power an...SuggestedFull timePart timeWorldwideFlexible hours$75k - $120k
...headquarters in Denver, Colorado, and offices across the U.S., Canada, and India. Role Summary We are seeking a Site Reliability Engineer II to support the reliability, scalability, and performance of critical production services. This role contributes to the...SuggestedContract workTemporary workWork at officeWork from homeFlexible hours$130k - $145k
...your desire to team up with some of the best and brightest in technology and entertainment. The Role The Site Reliability Engineer (SRE) II is responsible for designing, implementing, and maintaining scalable and reliable systems and applications. Focus on...SuggestedFull timeLocal areaWorldwideFlexible hours$123k - $165k
...Site Reliability Engineer II Job Posting ID: 10143234 Department: Engineering Fleet – Reliability Engineering & Operational Support to backend service development teams. We build world‑class products that enable Disney, ESPN, Hulu, and other media brands to reach millions...SuggestedFull timeWorldwide$98.58k - $138.02k
...role requires a hybrid work schedule based out of one of our office locations: Austin, TX; Irvine, CA; or Akron, OH. Role Site Reliability Engineer II will be responsible for supporting, enhancing, and maintaining Restaurant365’s cloud infrastructure and applications....SuggestedWork at office- ...Under general supervision, the Site Reliability Systems Administrator II is responsible for improving system reliability and resilience. This role focuses... ...incidents. The SRE combines software and systems engineering to build and support large-scale, distributed, fault-tolerant...
$145.7k - $218.5k
...Site Reliability Engineer II United States, Aliso Viejo, CA Sony Interactive Entertainment isn't just the Best Place to Play — it's also the Best Place to Work. Sony Interactive Entertainment (SIE) is the company behind the PlayStation brand. As a subsidiary of...Work experience placementShift work- ...Site Reliability Engineer II Join the leader in providing smarter solutions for a safer world. The property technology space is growing rapidly, and Kastle Systems is leading the way. Kastle Systems is the leader in managed security, with a track record of introducing...Remote work
$93.9k - $156.5k
Site Reliability Engineer II page is loaded## Site Reliability Engineer IIlocations: Chicago - 20 S. Wackertime type: Full timeposted on: Posted Todayjob requisition id: 33998**Note: This position follows a hybrid work model, requiring 2 days per week on-site at our corporate...Work at officeLocal areaWorldwide2 days per week$95k - $171k
.... Opportunities exist to focus on GPU infrastructure, Kubernetes, and ensuring reliability for AI workloads within Akamai's serverless inference platform. As an Site Reliability Engineer II, you will be responsible for: Building and maintaining dashboards, alerts, and...Permanent employmentWork experience placementWork at officeWork from homeWorldwideFlexible hours- Our client, a leading organization in the financial services industry, is seeking a Site Reliability Engineer II to join their team. As a Site Reliability Engineer II, you will be part of the Infrastructure Support Department supporting the SRE team. The ideal candidate...Weekly pay
- ManpowerGroup Global, Inc. is seeking a Site Reliability Engineer II to join their team in Town of Norway, Wisconsin. As a part of the Infrastructure Support Department, you will design and implement critical Service Level Indicators (SLIs) and Service Level Objectives...Weekly pay
$180k - $225k
Your Impact You are a Sr. Site Reliability Engineer II who will help define how Axon builds and operates its core platforms, with a primary focus on Zero Touch, our controlled, compliant execution framework, and the identity and security foundations that sit around it....Work at officeImmediate startRemote work- Site Reliability Engineer II About the Role This role focuses on enhancing system reliability and scalability for PROS’s platform, contributing to automation and self‑service tool development. The engineer will optimize performance, monitor service reliability, implement...
$93.9k - $156.5k
...requiring 2 days per week on‑site at our corporate office 20 S Wacker... .... CME Group is seeking a SRE II to help build, operate and... ...latency performance and rock‑solid reliability to seamlessly handle the world... ...product teams and senior engineers to assist with building out observability...Work at officeLocal area2 days per week$103.5k - $150k
...together. Bring your whole self. The Role and Team The Site Reliability Engineering organization at Medallia brings together the infrastructure... ...that power a highly reliable global SaaS platform. As an SRE II, you will help operate and improve the reliability,...Temporary workWork experience placementLocal area3 days per week$95k - $171k
A leading cloud computing company seeks a Site Reliability Engineer II to join their Inference Cloud Team. The role involves building dashboards, writing automation in Python or Go, and collaborating with engineering teams to ensure AI infrastructure reliability. Candidates...Flexible hours$93.9k - $156.5k
CME Group Inc. is looking for a Site Reliability Engineer II in Chicago to assist in building, operating, and scaling systems. This role requires a keen interest in SRE and skills in Linux, programming, and problem-solving. Candidates will work with senior engineers and...$111k - $130k
QUEST DIAGNOSTICS INC is seeking a Performance II‑Epic to provide reliability engineering services through observability and performance engineering techniques. The role requires collaboration with product owners, ensuring optimal operation through monitoring system performance...Remote job$111k - $130k
Job Description As a Performance II‑Epic, your role is to provide reliability engineering services through observability and performance engineering techniques.... ...for optimizing operational efficiency. You will use Site Reliability Engineering practices to deliver a seamless...Full timePart timeWork experience placementRemote workFlexible hours$207k - $300k
Software Engineering Manager II, Site Reliability Engineering corporate_fare Google Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with software development in one or more programming...Full time$207k - $300k
Software Engineering Manager II, Site Reliability Engineering Location: Seattle, WA, USA; Sunnyvale, CA, USA; +4 more; +3 more Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain....Full timeTemporary work- ...generative AI and cloud-native platforms to advanced release engineering practices, our teams are redefining how financial technology... ...AI-driven solutions that accelerate development and improve reliability. Your work will directly influence how GM Financial leverages...Full timeWork at officeRemote workFlexible hours2 days per week
- ...job is responsible for partnering with engineering and technology teams to implement measures... ...teams to automate services and improve reliability and efficiency. Job expectations include... ...reliability. Position Summary: Site Reliability Engineer (SRE) focused on building...Work at officeShift workDay shift
$67 per hour
...High School Diploma or GED and eleven (11) years of related experience Or Bachelor's degree in Computer Science, Computer Engineering or a related field and seven (7) years of related experience Skills and Competencies Ability to collaborate with programmers...Immediate startRemote work$165k - $225k
...operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog... ...taking a look here. How you'll make an impact: As a Site Reliability Engineer (SRE) with advanced expertise in networking and security...Work at officeFlexible hours$93.9k - $156.5k
Hybrid role , 2 days on site. Role is located in NYC with alternative location Chicago... .... Working hours: 9am‑5pm EST. Site Reliability EngineerII (Tuesday‑Saturday). CME Group... ...successful candidate will work alongside senior engineers to learn how we observe, monitor,...Local area- ABOUT THIS POSITIONWe are looking for a talented and driven Sr. Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our Waystar products. This role is ideal for an experienced engineer who thrives...Live outLocal areaFlexible hours
$135k - $154k
...Axon we are on a mission to Protect Life. As the APX platform engineering organization works on our CloudNet team, we build and operate... ...mission‑critical services and maintain the high quality and reliability that our customers demand. You will work closely with sovereign...- ...cybersecurity will depend on you Learn how Illumio approaches AI with integrity — view our Transparency Statement. Senior Backend Software Engineer (Python (Golang a plus)) Hybrid: 2 days in office/week in Sunnyvale, CA In this role, you will focus on the Azure Firewall...Work at office2 days per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer II. Be the first to apply!
- site reliability engineering manager United States
- site reliability engineer remote United States
- lead site reliability engineer United States
- site reliability engineer sre United States
- site reliability engineer United States
- on-site clinical research associate (traveling/remote) United States
- junior website developer United States
- site merchandiser United States
- IT site lead United States
- site acquisition specialist United States


