Remote Site Reliability Engineer
$130k - $180kGrabJobs
Why work at Nebius
Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we work
Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 1400 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
The role
Nebius is looking for a Site Reliability Engineer in Hardware Infrastructure team. You’re welcome to work in our office in Amsterdam.
Hardware Infrastructure team designs, develops and supports systems involved in the data-centers lifecycle:
Serving functional and load testing system.
Monitoring of engineering equipment located in our data centers (power supply, air and water cooling, etc.)
Monitoring of IT equipment: racks, servers, JBODs, JBOGs, power shelves, network devices, etc.
Asset tracking.
Hardware repairs tasks tracking.
Server production.
In this position, your responsibility will be to :
Ensure fault-tolerance, scale and uninterrupted operations for our services.
Use cutting-edge technology to solve a variety of infrastructure problems.
Implement and improve CI/CD processes.
We expect you to have :
Proficiency in Linux systems, with expertise in Python and Bash scripting for automation.
Demonstrated ability to troubleshoot complex system issues, including hardware, software and networking problems.
Strong analytical and problem-solving skills, with a focus on optimizing system performance.
Working proficiency in English.
It would be an added bonus if you had :
Desire to be involved in backend development.
Experience designing, developing and running high-load distributed systems.
Working conditions:
Primarily remote
Occasional travel to data centers required, especially if not located near one
Collaboration with globally distributed engineering and operations teams
Key employee benefits:
Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families
401(k) plan: up to 4% company match with immediate vesting
Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers
Remote work reimbursement: up to $85/month for mobile and internet
Disability & life insurance: company-paid short-term, long-term, and life insurance coverage
Compensation
We offer competitive salaries, ranging from $130k- $180k base + quarterly performance bonuses.
Join Nebius and help operate the systems that power next-generation AI
infrastructure.
What we offer:
Competitive salary and comprehensive benefits package.
Opportunities for professional growth within Nebius.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
Equal Opportunity Statement:
Nebius is an equal opportunity employer. We are committed to fostering an inclusive and diverse workplace and to providing equal employment opportunities in all aspects of employment. We do not discriminate on the basis of race, color, religion, sex (including pregnancy), national origin, ancestry, age, disability, genetic information, marital status, veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by applicable law.
Applicants must be authorized to work in the country in which they apply, and will be required to provide proof of employment eligibility as a condition of hire.
- ...quality days at home. About the job Were looking for a Site Reliability Engineer (E3) to help build and operate the infrastructure that powers... ...care for more quality days at home. This position is remote and requires working East Coast business hours (EST). What...Remote workLocal area
- ...Site Reliability Engineer OXIO is the first NeoTelco. We are building the world’s largest, most accessible, and insightful Telecom network. Our platform empowers anyone to spin up their own carrier from a browser, scaling and supporting you as you scale your network to...Remote work
- Alegeus is seeking a Site Reliability Engineer I in New York to drive technology vision and solve high-impact technical challenges for critical... ..., and a Bachelor’s degree. This hybrid role allows for 75% remote work and offers competitive salary and benefits including paid...Remote job
- ...is looking for an experienced infrastructure engineer to join their infra team. In this role, you will ensure production reliability for their Kubernetes-based platform,... ...ideal candidate will have over four years in site reliability engineering, familiarity with operational...Remote job
$111k - $130k
...DIAGNOSTICS INC is seeking a Performance II‑Epic to provide reliability engineering services through observability and performance engineering... ...range of $111,000‑130,000, advantages for both hybrid and remote work. Best-in-class benefits include medical, vacation, financial...Remote job- Hetzner Online is seeking a Software Developer for Site Reliability Engineering (m/f/d) in Gunzenhausen, Germany. The role involves ensuring website performance, protecting systems from attacks, and designing caching solutions. Ideal candidates have strong programming...Remote jobFlexible hours
$160k - $300k
Overview Site Reliability Engineer (SRE) - Remote, Full‑time. Base pay $160K-$300K/year. Responsibilities Build and automate systems that ensure platform reliability, scalability, and performance. Lead incident response, including triage, resolution, and post‑mortem analysis...Remote jobFull time- We are looking for a Site Reliability Engineer to join our team at Rustici Software. Come work alongside our software development teams, deploying... ...we create in our AWS hosted infrastructure. We are a remote/in-office hybrid company located in Franklin, TN. While we...Remote jobTemporary workWork at officeLocal areaHome officeFlexible hours
- ...About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product... .... Employees based in other cities enjoy a fully remote work environment with the ability to travel for...Remote workWork at officeFlexible hours2 days per week
- ...To build the technology infrastructure for an innovative insurance platform, the remote Site Reliability Engineer II will manage internal tooling, implement security best practices, and reduce operational toil through automation. Key Responsibilities Build internal tooling...Remote work
- ...Seeking a Site Reliability Engineer Specialist to work remotely in a full-time capacity, responsible for leading observability and incident response efforts, defining instrumentation standards, and mentoring engineers across teams. Key responsibilities Own the technical...Remote workFull time
- ...Joining a high-performing team remotely, the full-time Senior Site Reliability Engineer will own the reliability and automation of critical AI infrastructure, ensuring systems are resilient and secure while building automation tools to streamline operational workflows...Remote workFull time
- ...Site Reliability Engineers are responsible for ensuring the availability, reliability, scalability, and performance of the firm’s most critical... ...default. This is an on-site position located in Springfield, MO. Remote work is not an option for this position. Primary...Remote workLocal areaFlexible hoursShift work
- ...Leading the transition to a global, AI-powered Intelligent Agreement Management platform, the full-time remote Director of Site Reliability Engineering will take full ownership of global availability, manage the development of reliability tools, and mentor senior SREs...Remote workFull time
- ...Owning the reliability of large-scale GPU infrastructure, the full-time Staff Site Reliability Engineer will manage incident leadership, production operations, and observability systems in a remote environment, ensuring optimal performance for demanding AI workloads....Remote workFull time
- ...To enhance reliability across the organization, the full-time Lead Site Reliability Engineer will analyze incidents, prioritize improvements, and collaborate with engineering teams remotely to implement effective solutions and best practices. Key responsibilities Identify...Remote workFull time
- ...Seeking a Principal Site Reliability Engineer for a hybrid or remote role, this full-time position will design and implement scalable infrastructure across multiple cloud environments while driving an "automation-first" culture to enhance system reliability and observability...Remote workFull time
- ...Working remotely or in a hybrid capacity, the full-time Associate Site Reliability Engineer will monitor and support the live production environment, manage incident responses, and assist with release management while developing essential skills in a collaborative global...Remote workFull timeInternship
- ...organizations work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the Platform Engineering organization... ...millions of user endpoints. Location We are flexible on remote work from home for candidates located in the USA in the following...Remote workPermanent employmentWork from homeFlexible hours
- The Site Reliability Engineer will be responsible for ensuring the availability, reliability, and performance of our customer-facing software applications... ...preferred Ability to travel up to 25% #LI-JB1 #LI-REMOTE This amount is what we reasonably believe we will pay for...Remote workFull timeWork at officeImmediate startWorldwideShift work
$150k - $200k
...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration... ...speed up, troubleshoot, and optimize local developer and remote CI feedback loops. Our software is used by some of the...Remote workFull timeLocal areaWork from home- ...careers, resources, tips and trends from the DevOps World. The Site Reliability Engineer position at Remotive revolves around ensuring the... ...continuous deployment (CI/CD) pipelines. As part of a fast-paced, remote team, strong communication skills and a proactive approach...Remote work
- ...critical services in a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure... ...Wherever you are in India is your office. We are a 100% remote-first team. We will support you, take care of you, and...Remote workWork at office
$160k - $240k
...organizations work. We are currently looking for a Senior Site Reliability Engineer to join our SRE team in the Platform Engineering organization... ...availability of our services. Location We are flexible on remote working from home if you are located in the USA and reside...Remote workPermanent employmentFull timeWork from homeRelocationFlexible hours- ...Job Title: Site Reliability Engineer (SRE) Job Location: Germany (Remote) Job Type: Fixed Term Contract (12 Months) Responsibilities Design, develop, and maintain observability platform component and integrations across Prometheus, Thanos, Grafana, OpenTelemetry, and streaming...Remote workFixed term contract
- ...things on Windows/Biz Tech – but they are shifting towards Linux – (70% Windows, 30% Linux) Remote access technology protocols are a plus Job Description: Site Reliability Engineer Periodic updates and maintenance of Windows-based golden image for ESX & AWS. Patching of...Remote workShift work
$125k - $150k
...display a cookie banner on the external site. You must specify the message in the... ...Denver office.**Role Overview:** As a Site Reliability Engineer (SRE) at Litera, you will play a key... ...Experience working with cloud platforms and remote collocated systems•Deep knowledge in...Remote workWork experience placementWork at officeShift work3 days per week$175k - $250k
...Senior Cloud Infrastructure Engineer Location: San Francisco, CA. Remote unavailable. Modality: On‑Site only. Must live within commuting distance of San Francisco... ...while ensuring scalability, performance, and reliability across environments. What You’ll Do Design...Remote workFull timeRelocationRelocation package$65 - $75 per hour
...Card Holders only Must be on our W2- no C2C, no exceptions Fully remote Key Responsibilities: Process customer requests to add, change,... ..., and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and customers...Remote workContract work$150k - $170k
...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software... ...) initiatives and mentor our engineering team. We offer a remote‑first opportunity for US‑based employees with the option to...Remote workCasual workWork at officeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Remote Site Reliability Engineer. Be the first to apply!
- site reliability engineer remote United States
- site reliability engineer United States
- lead site reliability engineer United States
- site reliability engineer sre United States
- site reliability engineering manager United States
- program coordinator remote United States
- procurement specialist remote United States
- event manager remote United States
- remote optometrist United States
- remote prior authorization pharmacist United States


