Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer - Hosting

Hamilton Barnes Associates Limited

Are you looking for an exciting opportunity? Join a specialist technology provider delivering advanced provisioning, management, and security solutions for data centers. The organization helps operators enhance customer experience, streamline day‑to‑day operations, and stay ahead of the competition through innovative products and services, allowing them to focus on their core strengths in hardware and infrastructure. Your next opportunity starts here—apply today. Responsibilities Install and integrate Hydra’s Brokkr software with new datacenters and onboarded servers Maintain integrated datacenter and inventory, respond to L2 and L3 requests and alerts, and improve monitoring and other supporting infrastructure Monitor system performance and uptimes, ensuring the highest level of systems and infrastructure availability. Liaise with vendors and other IT personnel for problem resolution. Install, configure, test, and maintain operating systems, application software, and system management tools. Maintain security, backup, and redundancy strategies. Write and maintain custom scripts to increase system efficiency and lower human intervention time on any tasks. Participate in the design of information and operational support systems. Required Skills/Qualifications BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting UNIX/Linux-based environments. Solid experience in the administration and performance tuning of application stacks (e.g., Apache, MySQL, NGINX). Experience with virtualization and containerization (e.g., QEMU/KVM, Docker). Experience with monitoring systems (e.g., Nagios, Zabbix). Experience with automation software (e.g., Puppet, Chef, Ansible). Solid scripting skills (e.g., shell scripts, Perl, Ruby, Python). Solid networking knowledge (OSI network layers, TCP/IP, DNS, DHCP). Desirable Skills Certification in relevant fields (e.g., Linux Certifications, Cisco Certified Network Associate - CCNA, Microsoft Certified Systems Engineer - MCSE) are a plus. Experience with cloud services (AWS, Microsoft Azure) is a plus. Strong problem‑solving skills and the ability to work under pressure is a must. Strong communication skills and the ability to collaborate and be proactive in asking questions is a must. Benefits Flexible working hours and remote work opportunities. A supportive team environment with an emphasis on learning and growth. Access to cutting‑edge technology and tools. Salary Competitive salary and comprehensive benefits package. #J-18808-Ljbffr Hamilton Barnes Associates Limited

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer - Hosting in San Francisco, CA vacancy
  • $151.5k - $252.5k

     .... About The Role We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will...  ...DB, Storage services, Azure Functions, static website hosting, Azure security, etc.) IaC tools (Azure ARM templates, AWS... 
    Suggested
    Base plus commission
    Local area
    Worldwide

    Veeam

    San Francisco, CA
    1 day ago
  • $181k - $263k

    ## Senior Staff Site Reliability EngineerApplylocations: San Franciscotime type: Full timeposted...  ...for a Senior Staff Site Reliability Engineer who will set the technical direction for...  ...people who love what they do.* Fun: We host in-person and virtual events such as game... 
    Suggested
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    San Francisco, CA
    19 hours ago
  •  ...shape the future of healthcare, we’d love to meet you. About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product. You’ll work across our distributed workflow... 
    Suggested
    Work at office
    Remote work
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    3 days ago
  •  ...seeking an expert to help build their open superintelligence infrastructure in San Francisco. You will lead efforts in developing a hosted training platform that enables users to launch LoRA and fine-tuning runs on managed GPU clusters. Ideal candidates will have strong... 
    Suggested
    Flexible hours

    Prime-Intellect

    San Francisco, CA
    3 days ago
  • $232k - $319k

     ...millions of users a day. The service is hosted on Amazon Web Services (AWS) across multiple...  ...scale the service with great people and reliable, cost-effective, and efficient...  ...Accelerate the velocity of SRE and product engineering by developing robust platforms, powerful... 
    Suggested
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    San Francisco, CA
    19 hours ago
  • $266k

     ...integrate emerging technologies, and develop the host-side systems software needed to make these systems performant, reliable, and production-ready. About the Role...  ...re looking for an experienced systems software engineer to help define and build the host software stack... 

    OpenAI

    San Francisco, CA
    19 hours ago
  •  ...Staff Software Engineer, Listings & Host Tools and AI Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every country across... 
    Work experience placement

    airbnb, Inc.

    San Francisco, CA
    19 hours ago
  • OpenAI in San Francisco is seeking an experienced systems software engineer to define and build host software for next-generation AI systems. This role focuses on low-level device interfaces and system optimization across hardware and software boundaries. Ideal candidates... 

    OpenAI

    San Francisco, CA
    4 days ago
  • AI Chopping Block, Inc. in San Francisco is looking for an experienced systems software engineer to develop the host software stack for next-generation AI systems. You will work on performance-critical software, including Linux kernel drivers and system-scale networking... 

    AI Chopping Block, Inc.

    San Francisco, CA
    4 days ago
  • $150k

     ...Job Description Job Description About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security posture, and... 

    VantageScore

    San Francisco, CA
    3 days ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours
    2 days per week

    Prosper

    San Francisco, CA
    24 days ago
  • # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep the trust of...  ...**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part... 
    Work at office
    Immediate start
    Worldwide
    Monday to Friday
    Flexible hours

    Careers at Drata

    San Francisco, CA
    1 day ago
  • $60 per hour

    Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,... 
    Full time
    Work at office
    Flexible hours

    Bonfirevc

    San Francisco, CA
    19 hours ago
  • $175k - $250k

     ...fully distributed across North American time zones and supports a fast‑growing customer base of SaaS companies. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at... 
    Remote work

    I did my part and supported the Regular Toilet

    San Francisco, CA
    19 hours ago
  • $125k - $165k

    Position Site Reliability Engineer Location Lincoln, NE, San Francisco, CA, or Remote Job ID 434 Openings 1 Job Summary The Site Reliability Engineer will help ensure the reliability, scalability, and performance of the systems that power our AI products. This role... 
    Temporary work
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    TELCOR Inc

    San Francisco, CA
    3 days ago
  •  ...co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and... 

    Hyperbolic Labs

    San Francisco, CA
    2 days ago
  • $163k - $203k

     ...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much a platform engineering role as it is an SRE role— you will maintain the applications that run on our... 
    Work experience placement
    Work at office
    Remote work
    Flexible hours
    2 days per week

    GoTo Meeting

    San Francisco, CA
    19 hours ago
  •  ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like...  ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and... 

    Unify

    San Francisco, CA
    19 hours ago
  •  ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building...  ...observability adoptable and improve product reliability. Lead members of other engineering teams...  ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess... 
    Work at office
    Local area
    Work from home

    Lambda

    San Francisco, CA
    4 days ago
  • $166.9k - $225.9k

    Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team...  ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or... 
    Flexible hours

    Drata

    San Francisco, CA
    1 day ago
  •  ...manifesto. About the Role We're looking for an Infrastructure Engineer to take the lead on scaling our operational resilience as we...  ...This is a high-impact, high-trust role where you’ll shape how reliability is done - reducing incident load, building internal tooling, and... 
    Worldwide
    Shift work

    Happyrobot Inc.

    San Francisco, CA
    19 hours ago
  • $140k - $220k

    About the Job You’ll own reliability and operational excellence for Pylon's production systems. This means designing and implementing...  ...scale as we grow. You'll build tooling that makes the entire engineering team more effective, establish on-call rotations and runbooks... 

    Pylon

    San Francisco, CA
    3 days ago
  • A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems. This high-impact role demands expertise in AWS and strong programming skills. You will manage production systems' reliability... 

    gamma.app

    San Francisco, CA
    3 days ago
  •  ...advanced algorithms that significantly outperforms individual engineers. We combine language models with human ingenuity to push the...  ...and quality. The Role We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area... 

    CodeRabbit

    San Francisco, CA
    19 hours ago
  • US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems... 

    Axiom Pursuits

    San Francisco, CA
    19 hours ago
  •  ...Responsibilities Lead and onboard services and teams to the reliability tenets. Establish and maintain Service Level Objectives (...  ...Science or equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and services at scale. History of... 

    OutSystems, Inc.

    San Francisco, CA
    19 hours ago
  •  ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that...  ...for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes... 
    Work at office
    Worldwide

    Heidi Health Ltd

    San Francisco, CA
    19 hours ago
  •  ...millions of daily users while enabling our engineering teams to ship fast. You'll own the...  ...building automation and tooling that improves reliability and partnering with engineering to...  ...services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems... 
    Work at office
    Work from home

    gamma.app

    San Francisco, CA
    3 days ago
  • $125k - $165k

    Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud... 
    Temporary work
    Work at office
    Visa sponsorship
    Work visa
    Relocation package
    Flexible hours

    TELCOR

    San Francisco, CA
    3 days ago
  • The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra-... 

    Blaxel

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer - Hosting. Be the first to apply!