Site Reliability Engineer - Hosting
Are you looking for an exciting opportunity?
Join a specialist technology provider delivering advanced provisioning, management, and security solutions for data centers. The organization helps operators enhance customer experience, streamline day-to-day operations, and stay ahead of the competition through innovative products and services, allowing them to focus on their core strengths in hardware and infrastructure.
If you would like to learn more about this opportunity, feel free to reach out and apply today!
Responsibilities:
- Install and integrate Hydra’s Brokkr software with new datacenters and onboarded servers
- Maintain integrated datacenter and inventory, respond to L2 and L3 requests and alerts, and improve monitoring and other supporting infrastructure
- Monitor system performance and uptimes, ensuring the highest level of systems and infrastructure availability.
- Liaise with vendors and other IT personnel for problem resolution.
- Install, configure, test, and maintain operating systems, application software, and system management tools.
- Maintain security, backup, and redundancy strategies.
- Write and maintain custom scripts to increase system efficiency and lower human intervention time on any tasks.
- Participate in the design of information and operational support systems.
Required Skills/Qualifications:
- BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted.
- Proven working experience in installing, configuring, and troubleshooting UNIX/Linux-based environments.
- Solid experience in the administration and performance tuning of application stacks (e.g., Apache, MySQL, NGINX).
- Experience with virtualization and containerization (e.g., QEMU/KVM, Docker).
- Experience with monitoring systems (e.g., Nagios, Zabbix).
- Experience with automation software (e.g., Puppet, Chef, Ansible).
- Solid scripting skills (e.g., shell scripts, Perl, Ruby, Python).
- Solid networking knowledge (OSI network layers, TCP/IP, DNS, DHCP).
Desirable Skills:
- Certification in relevant fields (e.g., Linux Certifications, Cisco Certified Network Associate - CCNA, Microsoft Certified Systems Engineer - MCSE) are a plus.
- Experience with cloud services (AWS, Microsoft Azure) is a plus.
- Strong problem-solving skills and the ability to work under pressure is a must.
- Strong communication skills and the ability to collaborate and be proactive in asking questions is a must.
Benefits :
- Flexible working hours and remote work opportunities.
- A supportive team environment with an emphasis on learning and growth.
- Access to cutting-edge technology and tools.
Salary:
- Competitive salary and comprehensive benefits package.
- ...in the design of information and operational support systems. Required Skills/Qualifications BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting...SuggestedWork experience placementStart working todayRemote workFlexible hours
$151.5k - $252.5k
.... About The Role We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will... ...DB, Storage services, Azure Functions, static website hosting, Azure security, etc.) IaC tools (Azure ARM templates, AWS...SuggestedBase plus commissionLocal areaWorldwide$250k
...Europe, while now significantly expanding its footprint in the United States. The company is looking for a Senior / Staff Site Reliability Engineer to support and scale large-scale HPC and cloud environments powering GPU-intensive workloads. The role involves working...SuggestedPermanent employmentRemote work$15 per hour
Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to support and develop the platform serving the world’s favorite... ...should be able to access that knowledge freely. We host Wikipedia and the Wikimedia projects, build software experiences...SuggestedPermanent employmentFor contractorsRemote work$181k - $263k
...line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability... ...collaborative, and friendly people who love what they do.Fun: We host in-person and virtual events such as game nights, happy hours...SuggestedWork from homeFlexible hoursNight shift$232k - $319k
...millions of users a day. The service is hosted on Amazon Web Services (AWS) across multiple... ...scale the service with great people and reliable, cost-effective, and efficient... ...partnership with architects and product engineering Build a world-class observability platform...Permanent employmentLocal areaWorldwideFlexible hours- ...seeking an expert to help build their open superintelligence infrastructure in San Francisco. You will lead efforts in developing a hosted training platform that enables users to launch LoRA and fine-tuning runs on managed GPU clusters. Ideal candidates will have strong...Flexible hours
$104k - $130k
...infrastructure as well as help improve the reliability, quality of services and overall... ...recovery. You’ll collaborate or embed with engineering teams, helping them to improve the... ...more about our locations by visiting our site. Compensation & Benefits The base...Full timeWork experience placement$163k - $203k
...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much of a platform engineering role as it is SRE role — you will maintain the applications that run on our...Work experience placementWork at officeLocal areaRemote workFlexible hours2 days per week$150k
...Job Description Job Description About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security posture, and...Full time$197.3k - $225.1k
...Lead AI Engineer (FM Hosting, LLM Inference) Overview At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years, Capital One has been... ...information available through this site. Capital One Financial is made up of...Full timePart timeLocal area$163k - $203k
...will be a senior technical contributor on the SRE team, responsible for the reliability, scalability, and security of Prosper’s Cloud Platform portfolio. This is as much a platform engineering role as it is an SRE role— you will maintain the applications that run on our...Work experience placementWork at officeRemote workFlexible hours2 days per week$125k - $165k
Position Site Reliability Engineer Location Lincoln, NE, San Francisco, CA, or Remote Job ID 434 Openings 1 Job Summary The Site Reliability Engineer will help ensure the reliability, scalability, and performance of the systems that power our AI products. This role...Temporary workRemote workVisa sponsorshipWork visaFlexible hours$165k - $225k
...and the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You’ll ensure the reliability and scalability...Temporary workWork at officeLocal areaWorldwideFlexible hours- ...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like... ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and...
$160k - $250k
Responsibilities Automate manual operational processes Improve workflows of developer, data, and machine learning teams Manage secure integration and deployment tooling Create, maintain, monitor, and audit secure infrastructure Manage a diverse array of technology platforms...- ...co-founders with PhDs in AI, Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and...
- ...shape the future of healthcare, we’d love to meet you. About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product. You’ll work across our distributed workflow...Work at officeRemote workFlexible hours2 days per week
- # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata, we help companies earn and keep the trust of... ...**Job Summary:**Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part...Work at officeImmediate startWorldwideMonday to FridayFlexible hours
$175k - $250k
...fully distributed across North American time zones and supports a fast‑growing customer base of SaaS companies. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at...Remote work$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built for law firms, reshaping how data drives impactful outcomes. Our innovative approach blends technology with deep legal expertise,...Full timeWork at officeFlexible hours- ...millions of daily users while enabling our engineering teams to ship fast. You'll own the... ...building automation and tooling that improves reliability and partnering with engineering to... ...services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems...Work at officeWork from home
$125k - $165k
Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory software, is looking for a Site Reliability Engineer to join our TELCOR AI Systems team! Do you have strong experience in cloud...Temporary workWork at officeVisa sponsorshipWork visaRelocation packageFlexible hours- ...alongside clinicians to make that possible. We’re a team of doctors, engineers, designers, researchers, and creatives building tools that... ...for leading incidents end-to-end. Improve operational reliability: Identify recurring issues and reliability risks, and drive fixes...Work at officeWorldwide
- The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra-...
- ...and enthusiasm for building a great culture and product, you will find a home at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring the reliability, scalability, and observability of our production...Remote workWork from homeFlexible hours
$140.3k - $191.55k
...WriteMed.AI helps Biopharma and Life Sciences companies reduce time to write medical publications and regulatory paperwork. Site Reliability Engineer Location: Atlanta, GA; Miami, FL; Cambridge, MA; San Francisco, CA; Towson, MD Role Overview Our technical team supports...Temporary workWork experience placement- ...manifesto. About the Role We're looking for an Infrastructure Engineer to take the lead on scaling our operational resilience as we... ...This is a high-impact, high-trust role where you’ll shape how reliability is done - reducing incident load, building internal tooling, and...WorldwideShift work
$166.9k - $225.9k
Job Summary Drata's SRE team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close-knit SRE team... ...organization. What you’ll bring 6+ years of experience in Site Reliability Engineering, Cloud Engineering, or...Flexible hours- ...work from home day is currently Tuesday. Engineering at Lambda is responsible for building... ...observability adoptable and improve product reliability. Lead members of other engineering teams... ...in Go Have 5+ years of experience in Site Reliability Engineering practices Possess...Work at officeLocal areaWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer - Hosting. Be the first to apply!
- site reliability engineer San Francisco, CA
- site reliability engineer remote San Francisco, CA
- site reliability engineer sre San Francisco, CA
- website content developer San Francisco, CA
- site services specialist San Francisco, CA
- site recruiter San Francisco, CA
- IT site lead San Francisco, CA
- on-site clinical research associate (traveling/remote) San Francisco, CA
- on site coordinator San Francisco, CA
- website coordinator San Francisco, CA


