Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer II

Backblaze External Website

About Backblaze

Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we’re helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the full power of the open cloud in their hands.

Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $100m in revenue and is the leading specialized storage cloud - managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals.

About the Role

We are seeking a Site Reliability Engineer II (SRE II) to help ensure the stability, scalability, and reliability of our services and infrastructure. This role focuses on building automation, maintaining observability, and supporting incident response to keep customer-facing systems performing at their best. The SRE will collaborate with engineering, product, and operations teams to embed reliability practices into day-to-day development and operations while contributing to tools and processes that improve efficiency and reduce manual effort.

Key Responsibilities
Service Reliability & Operations
  • Support the availability and durability of critical services across production environments.
  • Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk.
  • Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements.
  • Follow established ITIL/OSS processes (incident, change, problem, and capacity management).
Automation & Tooling
  • Develop automation for common operational tasks, reducing manual intervention and toil.
  • Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, Catchpoint,ELK).
  • Work with CI/CD pipelines, configuration management, and infrastructure as code tools (Terraform, Ansible, Jenkins).
  • Write scripts (Bash, Python, Go, etc.) to improve system reliability and efficiency.
Collaboration
  • Partner with engineering, product, and operations teams to support resilient system design and operations.
  • Assist in capacity planning and disaster recovery exercises.
  • Work with vendors and service providers to troubleshoot service issues and track SLA performance.
  • Document systems, share learnings, and help grow a reliability-minded engineering culture.
Continuous Improvement
  • Contribute to playbooks, runbooks, and operational documentation.
  • Identify recurring issues and propose long-term improvements.
  • Promote reliability-focused practices within development and operations teams.
Qualifications
Education & Experience
  • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
  • 2–4 years of experience in site reliability, systems engineering, or operations.
  • Exposure to large-scale, production-grade systems.
Technical Skills
  • Solid Linux systems administration and troubleshooting skills.
  • Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis.
  • Proficiency in at least one scripting language (Python, Bash, or Go).
  • Understanding of containers (Kubernetes, Docker) and microservices concepts.
  • Knowledge of incident response and operational best practices.
Preferred Attributes
  • Experience in a SaaS, service provider, or distributed systems environment.
  • Familiarity with ITIL/OSS practices and SLO/SLA’s
  • Strong problem-solving skills and willingness to learn new technologies.
  • Experience with cloud platforms (AWS, GCP, or Azure).
  • Ability to work independently, take ownership, and drive projects from problem discovery through resolution. 

At this point, we hope you're feeling excited about the job description you're reading. Even if you don't meet every requirement, we still encourage you to apply. Learning, developing, and growing are key parts of our culture. We're eager to meet people who believe in our mission and can contribute to our team in various ways. We want people to feel comfortable expressing their true selves and to come, stay, and do their best work here.

At Backblaze, we value being fair and good to our customers, partners, and employees. That’s why diversity, equity, and inclusion are at the core of our values. We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries, but further reinforces our commitment to doing the right thing. We are proud to be an Equal Opportunity Employer.

To understand more about the data we collect and process as part of your application, please view our Backblaze Employee Privacy Notice.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer II in United States vacancy
  • $76k - $127k

     ...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Engineer II Site Reliability Engineer II Who is Mastercard? At Mastercard technology, we work to connect and power an... 
    Suggested
    Full time
    Part time
    Worldwide
    Flexible hours

    Mastercard

    O Fallon, MO
    1 day ago
  • $75k - $120k

     ...headquarters in Denver, Colorado, and offices across the U.S., Canada, and India. Role Summary We are seeking a Site Reliability Engineer II to support the reliability, scalability, and performance of critical production services. This role contributes to the... 
    Suggested
    Contract work
    Temporary work
    Work at office
    Work from home
    Flexible hours

    Vertafore

    Denver, CO
    3 days ago
  • $130k - $145k

     ...your desire to team up with some of the best and brightest in technology and entertainment. The Role The Site Reliability Engineer (SRE) II is responsible for designing, implementing, and maintaining scalable and reliable systems and applications. Focus on... 
    Suggested
    Full time
    Local area
    Worldwide
    Flexible hours

    AXS

    Los Angeles, CA
    1 day ago
  • $123k - $165k

     ...Site Reliability Engineer II Job Posting ID: 10143234 Department: Engineering Fleet – Reliability Engineering & Operational Support to backend service development teams. We build world‑class products that enable Disney, ESPN, Hulu, and other media brands to reach millions... 
    Suggested
    Full time
    Worldwide

    5014 Disney Entertainment & Sports LLC

    New York, NY
    3 days ago
  • $98.58k - $138.02k

     ...role requires a hybrid work schedule based out of one of our office locations: Austin, TX; Irvine, CA; or Akron, OH. Role Site Reliability Engineer II will be responsible for supporting, enhancing, and maintaining Restaurant365’s cloud infrastructure and applications.... 
    Suggested
    Work at office

    Restaurant365

    Austin, TX
    1 day ago
  •  ...Under general supervision, the Site Reliability Systems Administrator II is responsible for improving system reliability and resilience. This role focuses...  ...incidents. The SRE combines software and systems engineering to build and support large-scale, distributed, fault-tolerant... 

    Genuine Parts Company

    Birmingham, AL
    3 days ago
  • $145.7k - $218.5k

     ...Site Reliability Engineer II United States, Aliso Viejo, CA Sony Interactive Entertainment isn't just the Best Place to Play — it's also the Best Place to Work. Sony Interactive Entertainment (SIE) is the company behind the PlayStation brand. As a subsidiary of... 
    Work experience placement
    Shift work

    Sony Interactive Entertainment

    Aliso Viejo, CA
    1 day ago
  •  ...Site Reliability Engineer II Join the leader in providing smarter solutions for a safer world. The property technology space is growing rapidly, and Kastle Systems is leading the way. Kastle Systems is the leader in managed security, with a track record of introducing... 
    Remote work

    Kastle Systems

    Falls Church, VA
    4 days ago
  • $93.9k - $156.5k

    Site Reliability Engineer II page is loaded## Site Reliability Engineer IIlocations: Chicago - 20 S. Wackertime type: Full timeposted on: Posted Todayjob requisition id: 33998**Note: This position follows a hybrid work model, requiring 2 days per week on-site at our corporate... 
    Work at office
    Local area
    Worldwide
    2 days per week

    CME Group Inc.

    Chicago, IL
    1 day ago
  • $95k - $171k

     .... Opportunities exist to focus on GPU infrastructure, Kubernetes, and ensuring reliability for AI workloads within Akamai's serverless inference platform. As an Site Reliability Engineer II, you will be responsible for: Building and maintaining dashboards, alerts, and... 
    Permanent employment
    Work experience placement
    Work at office
    Work from home
    Worldwide
    Flexible hours

    Akamai Technologies

    Cambridge, MA
    4 days ago
  • Our client, a leading organization in the financial services industry, is seeking a Site Reliability Engineer II to join their team. As a Site Reliability Engineer II, you will be part of the Infrastructure Support Department supporting the SRE team. The ideal candidate... 
    Weekly pay

    ManpowerGroup Global, Inc.

    Plano, TX
    5 days ago
  • ManpowerGroup Global, Inc. is seeking a Site Reliability Engineer II to join their team in Town of Norway, Wisconsin. As a part of the Infrastructure Support Department, you will design and implement critical Service Level Indicators (SLIs) and Service Level Objectives... 
    Weekly pay

    ManpowerGroup Global, Inc.

    Plano, TX
    5 days ago
  • $180k - $225k

    Your Impact You are a Sr. Site Reliability Engineer II who will help define how Axon builds and operates its core platforms, with a primary focus on Zero Touch, our controlled, compliant execution framework, and the identity and security foundations that sit around it.... 
    Work at office
    Immediate start
    Remote work

    Koitecc Solutions

    Boston, MA
    5 days ago
  • Site Reliability Engineer II About the Role This role focuses on enhancing system reliability and scalability for PROS’s platform, contributing to automation and self‑service tool development. The engineer will optimize performance, monitor service reliability, implement... 

    PROS Holdings, Inc.

    Nashville, TN
    3 days ago
  • $93.9k - $156.5k

     ...requiring 2 days per week on‑site at our corporate office 20 S Wacker...  .... CME Group is seeking a SRE II to help build, operate and...  ...latency performance and rock‑solid reliability to seamlessly handle the world...  ...product teams and senior engineers to assist with building out observability... 
    Work at office
    Local area
    2 days per week

    CME Chicago Mercantile Exchange Inc.

    Chicago, IL
    1 day ago
  • $103.5k - $150k

     ...together. Bring your whole self. The Role and Team The Site Reliability Engineering organization at Medallia brings together the infrastructure...  ...that power a highly reliable global SaaS platform. As an SRE II, you will help operate and improve the reliability,... 
    Temporary work
    Work experience placement
    Local area
    3 days per week

    Medallia

    Mc Lean, VA
    2 days ago
  • $95k - $171k

    A leading cloud computing company seeks a Site Reliability Engineer II to join their Inference Cloud Team. The role involves building dashboards, writing automation in Python or Go, and collaborating with engineering teams to ensure AI infrastructure reliability. Candidates... 
    Flexible hours

    Akamai Technologies

    Cambridge, MA
    5 days ago
  • $93.9k - $156.5k

    CME Group Inc. is looking for a Site Reliability Engineer II in Chicago to assist in building, operating, and scaling systems. This role requires a keen interest in SRE and skills in Linux, programming, and problem-solving. Candidates will work with senior engineers and... 

    CME Group Inc.

    Chicago, IL
    3 days ago
  • $111k - $130k

    QUEST DIAGNOSTICS INC is seeking a Performance II‑Epic to provide reliability engineering services through observability and performance engineering techniques. The role requires collaboration with product owners, ensuring optimal operation through monitoring system performance... 
    Remote job

    QUEST DIAGNOSTICS INC

    Secaucus, NJ
    4 days ago
  • $111k - $130k

    Job Description As a Performance II‑Epic, your role is to provide reliability engineering services through observability and performance engineering techniques....  ...for optimizing operational efficiency. You will use Site Reliability Engineering practices to deliver a seamless... 
    Full time
    Part time
    Work experience placement
    Remote work
    Flexible hours

    QUEST DIAGNOSTICS INC

    Secaucus, NJ
    4 days ago
  • $207k - $300k

    Software Engineering Manager II, Site Reliability Engineering corporate_fare Google Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with software development in one or more programming... 
    Full time

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $207k - $300k

    Software Engineering Manager II, Site Reliability Engineering Location: Seattle, WA, USA; Sunnyvale, CA, USA; +4 more; +3 more Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain.... 
    Full time
    Temporary work

    Google Inc.

    Durham, NC
    2 days ago
  •  ...generative AI and cloud-native platforms to advanced release engineering practices, our teams are redefining how financial technology...  ...AI-driven solutions that accelerate development and improve reliability. Your work will directly influence how GM Financial leverages... 
    Full time
    Work at office
    Remote work
    Flexible hours
    2 days per week

    GMAC Financial Services

    Irving, TX
    2 days ago
  •  ...job is responsible for partnering with engineering and technology teams to implement measures...  ...teams to automate services and improve reliability and efficiency. Job expectations include...  ...reliability. Position Summary: Site Reliability Engineer (SRE) focused on building... 
    Work at office
    Shift work
    Day shift

    Bank of America

    Charlotte, NC
    4 days ago
  • $67 per hour

     ...High School Diploma or GED and eleven (11) years of related experience Or Bachelor's degree in Computer Science, Computer Engineering or a related field and seven (7) years of related experience Skills and Competencies Ability to collaborate with programmers... 
    Immediate start
    Remote work

    United IT Solutions

    Pampa, TX
    4 days ago
  • $165k - $225k

     ...operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog...  ...taking a look here. How you'll make an impact: As a Site Reliability Engineer (SRE) with advanced expertise in networking and security... 
    Work at office
    Flexible hours

    Dataiku

    New York, NY
    2 days ago
  • $93.9k - $156.5k

    Hybrid role , 2 days on site. Role is located in NYC with alternative location Chicago...  .... Working hours: 9am‑5pm EST. Site Reliability EngineerII (Tuesday‑Saturday). CME Group...  ...successful candidate will work alongside senior engineers to learn how we observe, monitor,... 
    Local area

    CME Chicago Mercantile Exchange Inc.

    New York, NY
    1 day ago
  • ABOUT THIS POSITIONWe are looking for a talented and driven Sr. Site Reliability Engineering (SRE) to support our engineering team, which manages the infrastructure and services that power our Waystar products. This role is ideal for an experienced engineer who thrives... 
    Live out
    Local area
    Flexible hours

    Waystar, Inc

    Louisville, KY
    5 days ago
  • $135k - $154k

     ...Axon we are on a mission to Protect Life. As the APX platform engineering organization works on our CloudNet team, we build and operate...  ...mission‑critical services and maintain the high quality and reliability that our customers demand. You will work closely with sovereign... 

    Accreditation Council for Graduate Medical Education

    Seattle, WA
    3 days ago
  •  ...cybersecurity will depend on you Learn how Illumio approaches AI with integrity — view our Transparency Statement. Senior Backend Software Engineer (Python (Golang a plus)) Hybrid: 2 days in office/week in Sunnyvale, CA In this role, you will focus on the Azure Firewall... 
    Work at office
    2 days per week

    Illumio

    Los Angeles, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer II. Be the first to apply!