Site Reliability Engineer II

Backblaze External Website

About Backblaze

Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we’re helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the full power of the open cloud in their hands.

Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $100m in revenue and is the leading specialized storage cloud - managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals.

About the Role

We are seeking a Site Reliability Engineer II (SRE II) to help ensure the stability, scalability, and reliability of our services and infrastructure. This role focuses on building automation, maintaining observability, and supporting incident response to keep customer-facing systems performing at their best. The SRE will collaborate with engineering, product, and operations teams to embed reliability practices into day-to-day development and operations while contributing to tools and processes that improve efficiency and reduce manual effort.

Key Responsibilities

Service Reliability & Operations

Support the availability and durability of critical services across production environments.
Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk.
Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements.
Follow established ITIL/OSS processes (incident, change, problem, and capacity management).

Automation & Tooling

Develop automation for common operational tasks, reducing manual intervention and toil.
Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, Catchpoint,ELK).
Work with CI/CD pipelines, configuration management, and infrastructure as code tools (Terraform, Ansible, Jenkins).
Write scripts (Bash, Python, Go, etc.) to improve system reliability and efficiency.

Collaboration

Partner with engineering, product, and operations teams to support resilient system design and operations.
Assist in capacity planning and disaster recovery exercises.
Work with vendors and service providers to troubleshoot service issues and track SLA performance.
Document systems, share learnings, and help grow a reliability-minded engineering culture.

Continuous Improvement

Contribute to playbooks, runbooks, and operational documentation.
Identify recurring issues and propose long-term improvements.
Promote reliability-focused practices within development and operations teams.

Qualifications

Education & Experience

Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
2–4 years of experience in site reliability, systems engineering, or operations.
Exposure to large-scale, production-grade systems.

Technical Skills

Solid Linux systems administration and troubleshooting skills.
Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis.
Proficiency in at least one scripting language (Python, Bash, or Go).
Understanding of containers (Kubernetes, Docker) and microservices concepts.
Knowledge of incident response and operational best practices.

Preferred Attributes

Experience in a SaaS, service provider, or distributed systems environment.
Familiarity with ITIL/OSS practices and SLO/SLA’s
Strong problem-solving skills and willingness to learn new technologies.
Experience with cloud platforms (AWS, GCP, or Azure).
Ability to work independently, take ownership, and drive projects from problem discovery through resolution.

At this point, we hope you're feeling excited about the job description you're reading. Even if you don't meet every requirement, we still encourage you to apply. Learning, developing, and growing are key parts of our culture. We're eager to meet people who believe in our mission and can contribute to our team in various ways. We want people to feel comfortable expressing their true selves and to come, stay, and do their best work here.

At Backblaze, we value being fair and good to our customers, partners, and employees. That’s why diversity, equity, and inclusion are at the core of our values. We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries, but further reinforces our commitment to doing the right thing. We are proud to be an Equal Opportunity Employer.

To understand more about the data we collect and process as part of your application, please view our Backblaze Employee Privacy Notice.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Site Reliability Engineer II in United States vacancy

Site Reliability Engineer II
$76k - $127k
...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Engineer II Site Reliability Engineer II Who is Mastercard? At Mastercard technology, we work to connect and power an...
Suggested
Full time
Part time
Worldwide
Flexible hours
Mastercard
O Fallon, MO
1 day ago
Site Reliability Engineer II
$75k - $120k
...headquarters in Denver, Colorado, and offices across the U.S., Canada, and India. Role Summary We are seeking a Site Reliability Engineer II to support the reliability, scalability, and performance of critical production services. This role contributes to the...
Suggested
Contract work
Temporary work
Work at office
Work from home
Flexible hours
Vertafore
Denver, CO
3 days ago
Site Reliability Engineer II
$130k - $145k
...your desire to team up with some of the best and brightest in technology and entertainment. The Role The Site Reliability Engineer (SRE) II is responsible for designing, implementing, and maintaining scalable and reliable systems and applications. Focus on...
Suggested
Full time
Local area
Worldwide
Flexible hours
AXS
Los Angeles, CA
1 day ago
Site Reliability Engineer II
$123k - $165k
...Site Reliability Engineer II Job Posting ID: 10143234 Department: Engineering Fleet – Reliability Engineering & Operational Support to backend service development teams. We build world‑class products that enable Disney, ESPN, Hulu, and other media brands to reach millions...
Suggested
Full time
Worldwide
5014 Disney Entertainment & Sports LLC
New York, NY
3 days ago
Site Reliability Engineer II
$98.58k - $138.02k
...role requires a hybrid work schedule based out of one of our office locations: Austin, TX; Irvine, CA; or Akron, OH. Role Site Reliability Engineer II will be responsible for supporting, enhancing, and maintaining Restaurant365’s cloud infrastructure and applications....
Suggested
Work at office
Restaurant365
Austin, TX
1 day ago
Site Reliability Engineer II
...Under general supervision, the Site Reliability Systems Administrator II is responsible for improving system reliability and resilience. This role focuses... ...incidents. The SRE combines software and systems engineering to build and support large-scale, distributed, fault-tolerant...
Genuine Parts Company
Birmingham, AL
3 days ago
Site Reliability Engineer II
$145.7k - $218.5k
...Site Reliability Engineer II United States, Aliso Viejo, CA Sony Interactive Entertainment isn't just the Best Place to Play — it's also the Best Place to Work. Sony Interactive Entertainment (SIE) is the company behind the PlayStation brand. As a subsidiary of...
Work experience placement
Shift work
Sony Interactive Entertainment
Aliso Viejo, CA
1 day ago
Site Reliability Engineer II
...Site Reliability Engineer II Join the leader in providing smarter solutions for a safer world. The property technology space is growing rapidly, and Kastle Systems is leading the way. Kastle Systems is the leader in managed security, with a track record of introducing...
Remote work
Kastle Systems
Falls Church, VA
4 days ago
Site Reliability Engineer II
$93.9k - $156.5k
Site Reliability Engineer II page is loaded## Site Reliability Engineer IIlocations: Chicago - 20 S. Wackertime type: Full timeposted on: Posted Todayjob requisition id: 33998**Note: This position follows a hybrid work model, requiring 2 days per week on-site at our corporate...
Work at office
Local area
Worldwide
2 days per week
CME Group Inc.
Chicago, IL
1 day ago
Site Reliability Engineer II
$95k - $171k
.... Opportunities exist to focus on GPU infrastructure, Kubernetes, and ensuring reliability for AI workloads within Akamai's serverless inference platform. As an Site Reliability Engineer II, you will be responsible for: Building and maintaining dashboards, alerts, and...
Permanent employment
Work experience placement
Work at office
Work from home
Worldwide
Flexible hours
Akamai Technologies
Cambridge, MA
4 days ago
Site Reliability Engineer II
Our client, a leading organization in the financial services industry, is seeking a Site Reliability Engineer II to join their team. As a Site Reliability Engineer II, you will be part of the Infrastructure Support Department supporting the SRE team. The ideal candidate...
Weekly pay
ManpowerGroup Global, Inc.
Plano, TX
5 days ago
Site Reliability Engineer II - Build Reliability & SLOs
ManpowerGroup Global, Inc. is seeking a Site Reliability Engineer II to join their team in Town of Norway, Wisconsin. As a part of the Infrastructure Support Department, you will design and implement critical Service Level Indicators (SLIs) and Service Level Objectives...
Weekly pay
ManpowerGroup Global, Inc.
Plano, TX
5 days ago
Sr. Site Reliability Engineer II
$180k - $225k
Your Impact You are a Sr. Site Reliability Engineer II who will help define how Axon builds and operates its core platforms, with a primary focus on Zero Touch, our controlled, compliant execution framework, and the identity and security foundations that sit around it....
Work at office
Immediate start
Remote work
Koitecc Solutions
Boston, MA
5 days ago
Site Reliability Engineer II
Site Reliability Engineer II About the Role This role focuses on enhancing system reliability and scalability for PROS’s platform, contributing to automation and self‑service tool development. The engineer will optimize performance, monitor service reliability, implement...
PROS Holdings, Inc.
Nashville, TN
3 days ago
Site Reliability Engineer II
$93.9k - $156.5k
...requiring 2 days per week on‑site at our corporate office 20 S Wacker... .... CME Group is seeking a SRE II to help build, operate and... ...latency performance and rock‑solid reliability to seamlessly handle the world... ...product teams and senior engineers to assist with building out observability...
Work at office
Local area
2 days per week
CME Chicago Mercantile Exchange Inc.
Chicago, IL
1 day ago
Site Reliability Engineer II
$103.5k - $150k
...together. Bring your whole self. The Role and Team The Site Reliability Engineering organization at Medallia brings together the infrastructure... ...that power a highly reliable global SaaS platform. As an SRE II, you will help operate and improve the reliability,...
Temporary work
Work experience placement
Local area
3 days per week
Medallia
Mc Lean, VA
2 days ago
Site Reliability Engineer II - Build Resilient Systems
$95k - $171k
A leading cloud computing company seeks a Site Reliability Engineer II to join their Inference Cloud Team. The role involves building dashboards, writing automation in Python or Go, and collaborating with engineering teams to ensure AI infrastructure reliability. Candidates...
Flexible hours
Akamai Technologies
Cambridge, MA
5 days ago
Site Reliability Engineer II — Low-Latency Trading SRE
$93.9k - $156.5k
CME Group Inc. is looking for a Site Reliability Engineer II in Chicago to assist in building, operating, and scaling systems. This role requires a keen interest in SRE and skills in Linux, programming, and problem-solving. Candidates will work with senior engineers and...
CME Group Inc.
Chicago, IL
3 days ago
Remote Site Reliability Engineer II - Observability
$111k - $130k
QUEST DIAGNOSTICS INC is seeking a Performance II‑Epic to provide reliability engineering services through observability and performance engineering techniques. The role requires collaboration with product owners, ensuring optimal operation through monitoring system performance...
Remote job
QUEST DIAGNOSTICS INC
Secaucus, NJ
4 days ago
Epic Site Reliability Engineer II
$111k - $130k
Job Description As a Performance II‑Epic, your role is to provide reliability engineering services through observability and performance engineering techniques.... ...for optimizing operational efficiency. You will use Site Reliability Engineering practices to deliver a seamless...
Full time
Part time
Work experience placement
Remote work
Flexible hours
QUEST DIAGNOSTICS INC
Secaucus, NJ
4 days ago
Software Engineering Manager II, Site Reliability Engineering
$207k - $300k
Software Engineering Manager II, Site Reliability Engineering corporate_fare Google Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with software development in one or more programming...
Full time
Google Inc.
Sunnyvale, CA
3 days ago
Software Engineering Manager II, Site Reliability Engineering
$207k - $300k
Software Engineering Manager II, Site Reliability Engineering Location: Seattle, WA, USA; Sunnyvale, CA, USA; +4 more; +3 more Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain....
Full time
Temporary work
Google Inc.
Durham, NC
2 days ago
Site Reliability Engineer II
...generative AI and cloud-native platforms to advanced release engineering practices, our teams are redefining how financial technology... ...AI-driven solutions that accelerate development and improve reliability. Your work will directly influence how GM Financial leverages...
Full time
Work at office
Remote work
Flexible hours
2 days per week
GMAC Financial Services
Irving, TX
2 days ago
Site Reliability Engineer II
...job is responsible for partnering with engineering and technology teams to implement measures... ...teams to automate services and improve reliability and efficiency. Job expectations include... ...reliability. Position Summary: Site Reliability Engineer (SRE) focused on building...
Work at office
Shift work
Day shift
Bank of America
Charlotte, NC
4 days ago
Site Reliability Engineer II
$67 per hour
...High School Diploma or GED and eleven (11) years of related experience Or Bachelor's degree in Computer Science, Computer Engineering or a related field and seven (7) years of related experience Skills and Competencies Ability to collaborate with programmers...
Immediate start
Remote work
United IT Solutions
Pampa, TX
4 days ago
Site Reliability Engineer II
$165k - $225k
...operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog... ...taking a look here. How you'll make an impact: As a Site Reliability Engineer (SRE) with advanced expertise in networking and security...
Work at office
Flexible hours
Dataiku
New York, NY
2 days ago
Site Reliability Engineer II
$93.9k - $156.5k
Hybrid role , 2 days on site. Role is located in NYC with alternative location Chicago... .... Working hours: 9am‑5pm EST. Site Reliability EngineerII (Tuesday‑Saturday). CME Group... ...successful candidate will work alongside senior engineers to learn how we observe, monitor,...
Local area
CME Chicago Mercantile Exchange Inc.
New York, NY
1 day ago
Senior Site Reliability Engineer II
ABOUT THIS POSITIONWe are looking for a talented and driven Sr. Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our Waystar products. This role is ideal for an experienced engineer who thrives...
Live out
Local area
Flexible hours
Waystar, Inc
Louisville, KY
5 days ago
Site Reliability Engineer II
$135k - $154k
...Axon we are on a mission to Protect Life. As the APX platform engineering organization works on our CloudNet team, we build and operate... ...mission‑critical services and maintain the high quality and reliability that our customers demand. You will work closely with sovereign...
Accreditation Council for Graduate Medical Education
Seattle, WA
3 days ago
Site Reliability Engineer II
...cybersecurity will depend on you Learn how Illumio approaches AI with integrity — view our Transparency Statement. Senior Backend Software Engineer (Python (Golang a plus)) Hybrid: 2 days in office/week in Sunnyvale, CA In this role, you will focus on the Azure Firewall...
Work at office
2 days per week
Illumio
Los Angeles, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer II. Be the first to apply!