Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

HostPapa

Position Summary With team members and customers in 39 countries around the globe, HostPapa is currently one of the fastest-growing web hosting companies with a wide range of products available. At its core, we provide individuals and small and medium-sized businesses with access to valuable tools and services critical to their online success, including a Website Builder service for making website creation an ultra-easy task for anyone. Tailored to meet every user's unique needs, our award-winning customer support, email, and cloud-based solutions keep HostPapa at the cutting edge of the web hosting industry and innovation by putting our customers first. This role focuses on CloudBlue, a HostPapa business that powers cloud commerce for many of the world’s largest service providers, including major Telcos, distributors, and MSPs. CloudBlue enables partners to monetize and manage cloud services and subscriptions at scale, combining the agility of a high-growth business with the backing of a global organization. As the Site Reliability Engineer, you will help ensure the reliability, scalability, and observability of CloudBlue’s multi-tenant SaaS platforms used by service providers worldwide. You will focus on improving system stability and performance through monitoring, high availability, and incident response, while working closely with DevOps, Platform, and Engineering teams to build and operate resilient production systems. What you’ll do Define and implement SLIs, SLOs, and error budgets for critical CloudBlue services to ensure reliability and performance Influence system architecture with a strong focus on reliability, scalability, and operability, designing systems for fault tolerance, graceful degradation, and self-healing Reduce operational toil by identifying opportunities for automation and process improvement Design and operate CloudBlue’s observability stack across metrics, logs, and traces using tools such as Datadog, Grafana, and Elastic Stack Develop actionable alerting strategies and dashboards that provide clear insight into platform and business health Design and maintain high-availability architectures, implementing redundancy, failover, and disaster recovery strategies across regions and availability zones Conduct capacity planning, load testing, and performance optimization to ensure platform stability and scalability Act as a senior responder during production incidents, leading incident coordination, communication, and service restoration Own blameless postmortems and drive improvements that reduce incident frequency, MTTR, and customer impact Improve reliability of Kubernetes-based platforms through health checks, autoscaling strategies, rollout safety, and resilience testing Partner with engineering and DevOps teams to improve deployment safety, rollback strategies, and platform reliability Maintain runbooks and operational documentation, and promote SRE best practices across engineering teams Support other tasks or projects as assigned to meet team and business needs About you 3+ years of experience as an SRE, DevOps Engineer, or Production Engineer, with strong ownership of production systems Proven experience operating highly available, enterprise-grade, multi-tenant SaaS platforms Hands-on experience with observability and monitoring tools such as Datadog, Grafana, and Elasticsearch/Kibana Solid understanding of Linux, networking, and distributed systems fundamentals Experience working with containerized environments such as Docker and Kubernetes Strong scripting and automation skills using Python and/or Bash Experience participating in on-call rotations and incident response in production environments Strong written and spoken English Experience defining SLIs/SLOs and managing error budgets at scale will be considered a plus Exposure to hyperscale or service-provider-grade platforms is an advantage Cloud experience, preferably with Azure; experience with AWS and/or GCP will also be valued Experience working with hybrid or on-premises integrations is beneficial Familiarity with chaos engineering and resilience testing will be considered an asset What We Offer Work from anywhere - this is a remote opportunity A competitive salary that values you and your unique skill sets Career advancement & professional development opportunities to help you reach your full potential Flexible work arrangements to support work/life balance About Us At HostPapa, we’ve been committed to providing a complete array of enterprise-grade cloud services solutions to every business owner since 2006. These services, traditionally out of reach to smaller businesses, are offered in a one-stop shop, making it quick and easy for customers to select the services they need to grow. We back these offerings with 24/7 award‑winning customer support in four languages. Our HostPapa team values diversity and inclusion. We have a friendly company culture built on trust and respect. With the acquisition of several companies into our product portfolio, we’re growing at an incredible rate and have ample opportunities for career growth. Come join our talented team of enthusiastic, hard-working, passionate, driven people engaged in meaningful, innovative work. We can’t wait to meet you! HostPapa is an equal-opportunity employer committed to diversity and inclusion. As a multicultural organization, we encourage individual achievement and recognize the strength of our diverse team. HostPapa is committed to providing accommodations for people with disabilities. If you require accommodation, please let us know, and we will work with you to meet your needs. Accommodation may be provided in all parts of the hiring process. It is anticipated that this position will be performed outside of Ontario. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in New York, NY vacancy
  •  ...New York, United States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary, we’re transforming the lives of borrowers by bringing humanity to consumer finance. Our data-driven products empower financial institutions... 
    Suggested
    Remote work

    Govserviceshub

    New York, NY
    3 days ago
  •  ...Job Summary Minio is seeking a Remote Site Reliability Engineer to enhance the performance and reliability of its cloud-native storage solutions. In this role, you will be responsible for monitoring systems, troubleshooting incidents, and implementing automation to improve... 
    Suggested
    Remote work
    Flexible hours

    DevOpsChat

    New York, NY
    1 day ago
  •  ...remote role, we will consider applicants based in LATAM. Our Engineering team is having a blast while delivering the most...  ...engineers building and maintaining Kraken's infrastructure. As a Site Reliability Engineer, you will keep one of the fastest growing companies... 
    Suggested
    Local area
    Remote work

    Framework Ventures

    New York, NY
    1 day ago
  • $165k - $235k

     ...and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability... 
    Suggested
    Temporary work
    Work at office
    Worldwide
    Flexible hours

    Crypto Pro Network

    New York, NY
    1 day ago
  • $150k - $170k

     ...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and process billions of dollars in payments. We’re looking for a seasoned... 
    Suggested
    Casual work
    Work at office
    Remote work
    Flexible hours

    ZIP

    New York, NY
    3 days ago
  •  ...It's designed so Stellar's ecosystem can make a real-world, lasting impact. About the Role SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You'll ensure the reliability and scalability... 

    TechChain Talent

    New York, NY
    5 hours ago
  •  ...hatch I.T. is partnering with CardioOne to find a Site Reliability Engineer (SRE) to join their team. See deteails below: About the Role: CardioOne is seeking a highly skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, security, and performance... 
    Full time

    Hatchit Co

    New York, NY
    1 day ago
  • $148.32k - $185.4k

     ...professionals, we’re proud of where we’ve been and even more excited about where we’re going. We’re looking for a senior Site Reliability Engineer to join our small, high-ownership SRE team. In this hands-on individual contributor role, you\'ll own the reliability, scalability... 
    Remote work
    Flexible hours

    AbsenceSoft

    New York, NY
    1 day ago
  • $7.5k

     ...and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor... 
    Work at office
    Local area

    The Voleon Group

    New York, NY
    1 day ago
  • $182.3k - $220k

     ...patients first - and that mission depends on reliable, secure, and scalable systems. As a...  ...infrastructure and building tools that empower our engineers to ship safely and confidently....  ...the year (i.e., during team on-sites).   At Ro, we believe that our diverse... 
    Local area
    Flexible hours

    Ro

    New York, NY
    5 hours ago
  • $150k - $200k

     ...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve... 
    Full time
    Local area
    Remote work
    Work from home

    Gradle Inc.

    New York, NY
    1 day ago
  •  ...Senior Site Reliability Engineer – Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies is growing! Currently seeking a full‑time Senior Site Reliability Engineer (Sr. SRE) , with experience engineering solutions... 
    Full time
    Local area
    Immediate start
    Remote work
    Flexible hours

    Concord Technologies

    New York, NY
    1 day ago
  • $185k - $227k

     ...united by this common purpose and we are hiring the world’s best engineers, scientists, designers, product managers, operations experts...  ...on for more details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the operational stability... 
    Remote work

    JUUL Labs

    New York, NY
    1 day ago
  • $150k - $200k

     ...parts of eye care and continue shaping the future of practice management. About the Role We are looking for a seasoned Senior Site Reliability Engineer to join our dynamic team in a foundational role, owning reliability and infrastructure as our first SRE. This role will... 
    Work experience placement
    Remote work

    Barti

    New York, NY
    1 day ago
  •  ...public cloud platform from scratch? Would you like to own critical services in a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform.... 
    Work at office
    Remote work

    Akamai

    New York, NY
    1 day ago
  • $153k - $190k

     ...interconnected health network and we want you to join us to change healthcare for the better! Job Description As a Senior Site Reliability Engineer you will be tasked with making sure we build a reliable, secure and efficient platform for the b. Well network. You will be... 
    Full time
    Contract work
    Live in
    Remote work

    b.well Connected Health

    New York, NY
    1 day ago
  •  ...Overview Discover exciting DevOps job opportunities and connect with 28,396 DevOps professionals. Responsibilities The Site Reliability Engineer (SRE) role involves ensuring the reliability, availability, and performance of core services. Successful candidates will collaborate... 
    Remote work

    DevOpsChat

    New York, NY
    1 day ago
  •  ...We’re on the lookout for a Site Reliability Engineer ! 45-65K EUR | Full Remote (Latam) | Series A startup backed by top US VCs. At Agentero we believe in simple and smart solutions for complex problems. We are building cutting‑edge technology to help insurance agents... 
    Remote work
    Home office
    Night shift

    Agentero

    New York, NY
    1 day ago
  •  ...enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world’s...  ...company is founder‑led, profitable, and growing. We are hiring a Site Reliability / Gitops Engineer to our Information Systems (IS) team. This... 
    Work at office
    Remote work
    Work from home
    Flexible hours

    Canonical Group Ltd

    New York, NY
    1 day ago
  • $65 - $75 per hour

     ...virtualization technologies. Knowledge of ITIL frameworks, Jira, Confluence, and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and customers to identify end-user requirements for infrastructure monitoring... 
    Contract work
    Remote work

    SBS Creatix

    New York, NY
    1 day ago
  •  ...Senior Site Reliability Engineer (Tax free & based in GCC) The ambition is to create a global leader in space – driving innovation globally for a better world, while transforming and inspiring Saudi society. Much attention has turned to the space sector in recent years... 
    Local area

    Firstaff Personnel Consultants Ltd

    New York, NY
    1 day ago
  • $133.11k - $148.04k

     ...As a Site Reliability Engineer at Weedmaps you will work cross‑departmentally with your partners on the application, infrastructure and quality teams to enhance the performance, reliability, resilience and scalability of the web services that make up Weedmaps.com. We are... 

    Weedmaps

    New York, NY
    2 days ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper). As... 
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    1 day ago
  •  ...A leading technology firm is seeking a Sr. Site Reliability Engineer in the United States. The ideal candidate will enhance system reliability and stability and should possess over 8 years of relevant experience in site reliability engineering. The position covers cloud... 

    Jobgether

    New York, NY
    1 day ago
  • $70 per hour

     ...resolve system failures in real time. Build and manage resilient systems for stability and performance optimization. Collaborate with engineering teams to improve CI/CD pipelines and automation. Manage filesystem structures, storage, and process scheduling in containerized... 
    Remote work

    Crossing Hurdles

    New York, NY
    1 day ago
  •  ...environment where each team member plays a significant and impactful role. Overall Purpose and Responsibilities of the Role As a Site Reliability Engineer, you will help build and support a technology platform while working closely with support staff and developers. You will... 
    Full time
    For contractors
    Remote work
    Work from home
    Monday to Friday

    Manila Recruitment

    New York, NY
    1 day ago
  •  ...obsessed about achieving the high quality and reliability our customers demand. You will work...  ...deliverables will reach the entire engineering organization to enable product teams to...  ...secure cloud platforms and tools. Apply site reliability engineering principles to improve... 
    Remote work

    BOSTON TRUST WALDEN COMPANY

    New York, NY
    1 day ago
  •  ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas...  ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle... 

    forhyre.com

    New York, NY
    5 hours ago
  • $150k - $200k

     .... But while there is a lot to celebrate in our past, there is almost as much opportunity ahead of us. We’re seeking a Sr. Site Reliability Engineer to join our team! About the Role We are seeking a Senior Site Reliability Engineer (SRE) to help ensure the stability, scalability... 
    Full time
    Remote work
    Flexible hours

    Backblaze

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!