Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer - Hosting

Full-time

Are you looking for an exciting opportunity?

Join a specialist technology provider delivering advanced provisioning, management, and security solutions for data centers. The organization helps operators enhance customer experience, streamline day-to-day operations, and stay ahead of the competition through innovative products and services, allowing them to focus on their core strengths in hardware and infrastructure.

If you would like to learn more about this opportunity, feel free to reach out and apply today!

Responsibilities:

  • Install and integrate Hydra’s Brokkr software with new datacenters and onboarded servers 
  • Maintain integrated datacenter and inventory, respond to L2 and L3 requests and alerts, and improve monitoring and other supporting infrastructure 
  • Monitor system performance and uptimes, ensuring the highest level of systems and infrastructure availability. 
  • Liaise with vendors and other IT personnel for problem resolution. 
  • Install, configure, test, and maintain operating systems, application software, and system management tools. 
  • Maintain security, backup, and redundancy strategies. 
  • Write and maintain custom scripts to increase system efficiency and lower human intervention time on any tasks. 
  • Participate in the design of information and operational support systems. 

Required Skills/Qualifications:

  • BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted.  
  • Proven working experience in installing, configuring, and troubleshooting UNIX/Linux-based environments. 
  • Solid experience in the administration and performance tuning of application stacks (e.g., Apache, MySQL, NGINX). 
  • Experience with virtualization and containerization (e.g., QEMU/KVM, Docker). 
  • Experience with monitoring systems (e.g., Nagios, Zabbix). 
  • Experience with automation software (e.g., Puppet, Chef, Ansible). 
  • Solid scripting skills (e.g., shell scripts, Perl, Ruby, Python). 
  • Solid networking knowledge (OSI network layers, TCP/IP, DNS, DHCP). 

Desirable Skills:

  • Certification in relevant fields (e.g., Linux Certifications, Cisco Certified Network Associate - CCNA, Microsoft Certified Systems Engineer - MCSE) are a plus. 
  • Experience with cloud services (AWS, Microsoft Azure) is a plus. 
  • Strong problem-solving skills and the ability to work under pressure is a must. 
  • Strong communication skills and the ability to collaborate and be proactive in asking questions is a must.  

Benefits :

  • Flexible working hours and remote work opportunities. 
  • A supportive team environment with an emphasis on learning and growth. 
  • Access to cutting-edge technology and tools.

Salary:

  • Competitive salary and comprehensive benefits package.
Vacancy posted 27 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer - Hosting in San Francisco, CA vacancy
  •  ...in the design of information and operational support systems. Required Skills/Qualifications BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting... 
    Suggested
    Work experience placement
    Start working today
    Remote work
    Flexible hours

    Hamilton Barnes Associates Limited

    San Francisco, CA
    5 days ago
  • $151.5k - $252.5k

     .... About The Role We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will...  ...DB, Storage services, Azure Functions, static website hosting, Azure security, etc.) IaC tools (Azure ARM templates, AWS... 
    Suggested
    Base plus commission
    Local area
    Worldwide

    Veeam

    San Francisco, CA
    3 days ago
  • $250k

     ...Europe, while now significantly expanding its footprint in the United States. The company is looking for a Senior / Staff Site Reliability Engineer to support and scale large-scale HPC and cloud environments powering GPU-intensive workloads. The role involves working... 
    Suggested
    Permanent employment
    Remote work
    San Francisco, CA
    5 days ago
  • $15 per hour

    Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to support and develop the platform serving the world’s favorite...  ...should be able to access that knowledge freely. We host Wikipedia and the Wikimedia projects, build software experiences... 
    Suggested
    Permanent employment
    For contractors
    Remote work

    Nerdleveltech

    San Francisco, CA
    1 day ago
  • $181k - $263k

     ...line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability...  ...collaborative, and friendly people who love what they do.Fun: We host in-person and virtual events such as game nights, happy hours... 
    Suggested
    Work from home
    Flexible hours
    Night shift

    Liveramp

    San Francisco, CA
    5 days ago
  • $232k - $319k

     ...millions of users a day. The service is hosted on Amazon Web Services (AWS) across multiple...  ...scale the service with great people and reliable, cost-effective, and efficient...  ...partnership with architects and product engineering Build a world-class observability platform... 
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    San Francisco, CA
    2 days ago
  •  ...seeking an expert to help build their open superintelligence infrastructure in San Francisco. You will lead efforts in developing a hosted training platform that enables users to launch LoRA and fine-tuning runs on managed GPU clusters. Ideal candidates will have strong... 
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    5 days ago
  • $104k - $130k

     ...infrastructure as well as help improve the reliability, quality of services and overall...  ...recovery.  You’ll collaborate or embed with engineering teams, helping them to improve the...  ...more about our locations by visiting our site. Compensation & Benefits The base... 
    Full time
    Work experience placement

    AppFolio

    San Francisco, CA
    18 hours ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week

    Prosper.com

    San Francisco, CA
    1 day ago
  • $150k

     ...Job Description Job Description About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security posture, and... 
    Full time

    VantageScore

    San Francisco, CA
    13 days ago
  • $197.3k - $225.1k

     ...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been...  ...information available through this site. Capital One Financial is made up of... 
    Full time
    Part time
    Local area

    Capital One Financial Corp

    San Francisco, CA
    1 day ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much a platform engineering role as it is an SRE role— you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours
    2 days per week

    GoTo Meeting

    San Francisco, CA
    2 days ago
  • $125k - $165k

    Position Site Reliability Engineer Location Lincoln, NE, San Francisco, CA, or Remote Job ID 434 Openings 1 Job Summary The Site Reliability Engineer will help ensure the reliability, scalability, and performance of the systems that power our AI products. This role... 
    Temporary work
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    TELCOR Inc

    San Francisco, CA
    5 days ago
  • $165k - $225k

     ...and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability... 
    Temporary work
    Work at office
    Local area
    Worldwide
    Flexible hours

    Stellar

    San Francisco, CA
    5 days ago
  •  ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like...  ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and... 

    Unify

    San Francisco, CA
    2 days ago
  • $160k - $250k

    Responsibilities Automate manual operational processes Improve workflows of developer, data, and machine learning teams Manage secure integration and deployment tooling Create, maintain, monitor, and audit secure infrastructure Manage a diverse array of technology platforms...

    I did my part and supported the Regular Toilet

    San Francisco, CA
    1 day ago
  •  ...co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and... 

    Hyperbolic Labs

    San Francisco, CA
    4 days ago
  •  ...shape the future of healthcare, we’d love to meet you. About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product. You’ll work across our distributed workflow... 
    Work at office
    Remote work
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    3 days ago
  • # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep the trust of...  ...**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part... 
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Careers at Drata

    San Francisco, CA
    3 days ago
  • $175k - $250k

     ...fully distributed across North American time zones and supports a fast‑growing customer base of SaaS companies. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at... 
    Remote work

    I did my part and supported the Regular Toilet

    San Francisco, CA
    2 days ago
  • $60 per hour

    Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,... 
    Full time
    Work at office
    Flexible hours

    Bonfirevc

    San Francisco, CA
    2 days ago
  •  ...millions of daily users while enabling our engineering teams to ship fast. You'll own the...  ...building automation and tooling that improves reliability and partnering with engineering to...  ...services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems... 
    Work at office
    Work from home

    gamma.app

    San Francisco, CA
    5 days ago
  • $125k - $165k

    Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud... 
    Temporary work
    Work at office
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours

    TELCOR

    San Francisco, CA
    5 days ago
  •  ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that...  ...for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes... 
    Work at office
    Worldwide

    Heidi Health Ltd

    San Francisco, CA
    2 days ago
  • The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra-... 

    Blaxel

    San Francisco, CA
    3 days ago
  •  ...and enthusiasm for building a great culture and product, you will find a home at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring the reliability, scalability, and observability of our production... 
    Remote work
    Work from home
    Flexible hours

    Fieldguide

    San Francisco, CA
    5 days ago
  • $140.3k - $191.55k

     ...WriteMed.AI helps Biopharma and Life Sciences companies reduce time to write medical publications and regulatory paperwork. Site Reliability Engineer Location: Atlanta, GA; Miami, FL; Cambridge, MA; San Francisco, CA; Towson, MD Role Overview Our technical team supports... 
    Temporary work
    Work experience placement

    Writemed

    San Francisco, CA
    2 days ago
  •  ...manifesto. About the Role We're looking for an Infrastructure Engineer to take the lead on scaling our operational resilience as we...  ...This is a high-impact, high-trust role where you’ll shape how reliability is done - reducing incident load, building internal tooling, and... 
    Worldwide
    Shift work

    Happyrobot Inc.

    San Francisco, CA
    2 days ago
  • $166.9k - $225.9k

    Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team...  ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or... 
    Flexible hours

    Drata

    San Francisco, CA
    3 days ago
  •  ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building...  ...observability adoptable and improve product reliability. Lead members of other engineering teams...  ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess... 
    Work at office
    Local area
    Work from home

    Lambda

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer - Hosting. Be the first to apply!