Reliability Lead, Common Services
$206k - $303kCoreWeave
What You’ll Do The Common Services organization at CoreWeave is responsible for the shared platforms, APIs, and foundational services that power our AI cloud products and internal engineering teams. From authentication and authorization to core platform primitives and developer experience tooling, this organization ensures that the rest of CoreWeave can build, ship, and operate reliably at scale. As Reliability Lead, Common Services , you will establish and lead the Reliability Engineering and production operations practice for this organization. You’ll partner closely with engineering leaders and teams across Common Services to define how we build, release, monitor, and operate critical services—raising the bar on reliability, availability, and operational excellence across the board. About the Role You will be responsible for defining the reliability strategy, processes, and standards for the Common Services portfolio and driving consistent, high‑quality operational practices across multiple teams. You’ll monitor production incidents within Common Services, and work directly with your partner teams to design systems that are reliable, observable, and supportable. Your day‑to‑day will blend hands‑on technical work and cross‑functional leadership to drive continuous improvement of Common Services production operations. In This Role Establish and lead the SRE / production engineering practice for the Common Services organization, including standards for reliability, incident management, and on‑call, in partnership with the central Product Engineering organization. Develop an Operational Excellence strategy that focuses on not only improving system performance but also monitoring and reducing operational toil. Partner with engineering and product teams to define SLOs, SLIs, and error budgets for critical Common Services, and ensure these become part of how teams plan and make trade‑offs. Own and improve the incident management lifecycle for Common Services, including on‑call rotations, escalation paths, incident tooling, post‑incident reviews, and follow‑through on corrective actions. Drive the observability strategy (metrics, logs, traces, dashboards, alerts) for Common Services, ensuring we have actionable visibility into the health, performance, and capacity of key systems. Collaborate with engineering leads to design and review architectures for reliability, scalability, resilience, and operability, including failure modes, redundancy, and graceful degradation. Lead efforts to automate and harden operational workflows, including deployments, rollbacks, configuration management, change management, and routine maintenance tasks. Build strong, trust‑based relationships with partner teams and stakeholders, becoming a go‑to leader for production readiness and operational risk within Common Services. Hire, mentor, and develop SRE and production engineering talent, fostering a culture of continuous improvement, learning from incidents, and humane on‑call. Partner with other SRE and production engineering leaders across CoreWeave to align on global practices, tools, and reliability goals, representing the needs and constraints of Common Services. Who You Are 7+ years of experience in Site Reliability Engineering, Production Engineering, or similar roles working on distributed systems or cloud/platform services. 2+ years of technical leadership experience (team lead, staff/principal engineer, or people manager) where you drove reliability and operational improvements across multiple services or teams. Strong background in Linux‑based production environments, containers, and orchestration technologies (e.g., Kubernetes), including debugging complex issues in live systems. Hands‑on experience with observability stacks (metrics, logging, tracing) and alerting systems, and a track record of designing meaningful SLIs/SLOs and alert strategies. Proven experience running on‑call rotations and incident response, including leading high‑severity incidents and driving high‑quality post‑incident reviews. Demonstrated ability to design for reliability (capacity planning, redundancy, failover, backoff, circuit breaking, graceful degradation, etc.) in large‑scale or mission‑critical systems. Comfortable working with infrastructure‑as‑code and automation tooling (e.g., Terraform, Ansible, Helm, CI/CD pipelines) to make operations repeatable, auditable, and safe. Strong cross‑functional communication skills—you can translate between engineering, product, and business stakeholders and influence without relying solely on authority. A bias toward data‑driven decision making, using production data, capacity signals, and incident trends to inform priorities and investments. Preferred Background working with GPU workloads, high‑performance computing, or latency/throughput‑sensitive systems. Experience with multi‑tenant, multi‑region, or highly regulated environments, and the associated reliability considerations. Familiarity with service ownership models and strong opinions on how to align ownership, on‑call, and accountability in a scalable way. Experience mentoring or managing senior engineers and building high‑performing teams through coaching, feedback, and clear expectations. What We Offer The base salary range for this role is $206,000 to $303,000. The starting salary will be determined based on job‑related knowledge, skills, experience, and market location. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility). Medical, dental, and vision insurance – 100% paid for by CoreWeave Company‑paid Life Insurance Voluntary supplemental life insurance Short and long‑term disability insurance Flexible Spending Account Health Savings Account Tuition Reimbursement Employee Stock Purchase Program (ESPP) Mental Wellness Benefits through Spring Health Family‑Forming support provided by Carrot Paid Parental Leave Flexible, full‑service childcare support with Kinside 401(k) with a generous employer match Flexible PTO Catered lunch each day in our office and data center locations A casual work environment A work culture focused on innovative disruption Our Workplace While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration. Export Control Compliance This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C.1158, or (iv) asylee under 8 U.S.C.1159; (B) eligible to access the export controlled information without a required export authorization; or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process. CoreWeave is an equal‑opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information. As part of this commitment and consistent with the Americans with Disabilities Act (ADA), CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. #J-18808-Ljbffr CoreWeave
$206k - $303k
A prominent tech company in Bellevue seeks a Reliability Lead for their Common Services organization. You will be essential in establishing and leading the SRE practice, focusing on improving operational excellence and ensuring system reliability. The ideal candidate will...SuggestedFlexible hours$38k - $130k
...Role - Lead Infrastructure Services Roles & Responsibilities 4+ Yrs Data Center Support Experience Knowledge of Networking Devices... ...Generic Managerial Skills, If any Digital: Site Reliability Engineering (SRE); windows TCS Employee Benefits Summary...SuggestedWork experience placementRemote work$18.29 - $26 per hour
...Description Job title: Shift lead Salary: $18.29-$26 Cross... ...Monitor dining room, kitchen, and service flow throughout shift... ...nights, weekends, and holidays Common Performance Expectations Strong... ...Accurate cash handling Reliable shift execution without operational...SuggestedCasual workLocal areaImmediate startFlexible hoursShift workNight shift- The Bay Club Company in Bellevue, WA is hiring a Café Lead to ensure high guest satisfaction and oversee food service operations. Responsibilities include engaging with guests, training staff on service standards, and maintaining product quality. The ideal candidate must...Suggested
$79.12k - $126.59k
Overlake Hospital Medical Center is seeking a Supervisor for Laboratory Support Services in Bellevue, WA. The role includes supervising personnel, implementing goals with lab leadership, and collaborating for medical excellence. Applicants should have a Bachelor's degree...SuggestedFull time$48.15 per hour
...position is open to current Transit Equipment Service Workers (ESW) that have been in their... ...Duty Assignment s for the role of Lead Service Worker (also known as the Lead ESW... ...highly organized, and use initiative to solve common problems. Proficiency in managing...Hourly payFull timeTemporary workPart timeWork experience placementLocal areaImmediate startRemote workAll shiftsFlexible hoursShift workNight shiftWeekend workAfternoon shift- Lids is seeking an Assistant Store Manager in Bellevue, Washington. The ideal candidate will lead the store team to foster a culture of exceptional customer service, manage day-to-day operations, and ensure effective merchandising strategies. Responsibilities include overseeing...
$28 - $32 per hour
...Regular Your Role at Sephora As a Sales and Service Leader , you’ll harness your love for... ...You’ll be a key driver of store success—leading by example to inspire your team, elevate... ...experience Flexible Scheduling and Reliability Must meet the required minimum number of...Hourly payFull timePart timeFlexible hoursShift workNight shiftWeekend work- ...infrastructure team. The role includes mentoring team members, ensuring reliability of infrastructure, and implementing automation standards to... ...technical communication, troubleshooting, and innovation in service management, ensuring that both team operations and customer...
- ...manage shifts, oversee associates, and ensure outstanding customer service. The role requires at least 1 year of supervisory experience, a... ...costs while exceeding customer expectations. Join us to lead a high-volume restaurant team! #J-18808-Ljbffr Potbelly - Sound...Shift work
$29.25 - $31.35 per hour
...application due to a disability, contact this employer to ask for an accommodation or an alternative application process. Customer Service Supervisor Full Time Regular Bellevue, WA, US 4 days ago Requisition ID: 2237 Salary Range: $29.25 To $31.35 Hourly Title: NEMT...Hourly payFull timeContract workWork at officeImmediate startFlexible hours$29 - $32 per hour
...dependent coverage. Job Summary Lead maintenance activities and coordinate vendor services to ensure property safety,... ...annual inspections of units and common areas. Maintain professional appearance... ...and complete trainings. Have reliable transportation between sites if...Hourly pay$21.55 - $35 per hour
...labor, regulatory compliance, and special projects as assigned. Leads and develops Team Members. All Whole Foods Market Retail jobs require... ...company image by providing courteous, friendly, and efficient service to customers and Team Members at all times. All positions must...Hourly payFull timePart timeSeasonal workWork at officeFlexible hoursNight shift- ...team-oriented individual to enhance customer experiences. The role demands strong management skills as well as the ability to oversee service efficiency. Responsibilities include ensuring pleasant service and concierge support for owner inquiries, alongside distributing...
- ...IT Support Manager (Service Desk) Seattle, WA Blink Health is the fastest growing healthcare... ...for everyone. Behind that mission is a reliable, scalable, and user-focused IT... ...'re looking for an IT Support Manager to lead our Service Desk team and own the end-to-...Work at office
- ...seeks an energetic General Manager for a Wendy's franchise in Bellevue, Washington, to lead the team and enhance customer experiences. Responsibilities include ensuring quality service, developing team members, and achieving business objectives. Benefits include...
$60k - $125k
...Bellevue is looking for a team member to enhance the customer experience at their dealership. Responsibilities include overseeing service flow, managing customer inquiries, and effectively distributing technician workload. Ideal candidates will possess strong organizational...$185k - $215k
...Development, New Markets to define and lead the energy strategy for Rowan'... ...that enable the delivery of reliable, scalable, and sustainable... ...interconnection, electrical service, and long-term energy... ...distributed team united by a common mission to transform data center...Work at office- ...IT Customer Support Manager, Information Technology Services Bellevue College is a vibrant, student-centered institution located just 10 miles east of Seattle. Serving one of the most diverse student populations in Washington - we are proud to reflect the global community...Full timeWork experience placementInternshipWork at office
- A grocery retail corporation in Bellevue, WA, is seeking a Plant Maintenance Manager to oversee preventive and corrective maintenance for plant systems and equipment. This role includes managing maintenance staff and ensuring compliance with safety regulations. The ideal...
- ...Sr. Team Lead (EPC/Business Config Mgmt) Location: Bellevue, WA Length of Assignment... ...– providing equipment, software and services to enable transformation through... ...for (Application, System, Database) Reliability/Availability: Analyzing, Modeling, Calculating...Contract work
$305k - $375k
...Protective Services Lead Boston, MA; Remote-Friendly (Travel-Required) | San Francisco, CA... ...Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems... ...empirical science, which has as much in common with physics and biology as with...Work at officeRemote workVisa sponsorshipFlexible hoursShift work$21.5 - $27 per hour
...and maximizing store performance. The ideal candidate will have 1-2 years of retail management experience and a passion for customer service. Benefits include a competitive pay range of $21.50 - $27.00 per hour and opportunities for career development. #J-18808-Ljbffr...Hourly pay$65k - $70k
...Join our Team! Universal Language Service is seeking a full time Interpreter Services Scheduling Manager to assist with the... ...the scheduling department is consistently providing quality and reliable language services to all customers at all times constantly looking...Daily paidFull timeWork at officeLocal areaRemote workFlexible hours$16 - $18 per hour
Future Opening: Customer Service Representative/Dispatcher/Schedule Coordinator We are a locally owned national company that provides... ...over 20 years, Mr. Handyman® franchisees have consistently hired reliable, customer‑focused team members who are both knowledgeable and...Live inWork at officeWork from home- Overlake in Bellevue, Washington, is seeking a Supervisor of Laboratory Support Services to oversee personnel in the lab and implement overall goals. The ideal candidate will possess a Bachelor's degree, be certified in Medical Assistant - Phlebotomy, and have five years...
$79.12k - $126.59k
...9.00The Supervisor of Laboratory Support Services is responsible for supervising the laboratory... ...required.* One year supervisory or lead experience preferred* Experience with laboratory... ...history of increasing responsibility and reliable attendanceOverlake is committed to...For contractorsLocal areaImmediate startShift work$79.12k - $126.59k
...00 The Supervisor of Laboratory Support Services is responsible for supervising the laboratory... ...required. One year supervisory or lead experience preferred. Experience with laboratory... ...history of increasing responsibility and reliable attendance. Overlake is committed to...Local areaImmediate startShift work$69.5k - $80k
...them money. It is the primary duty of the Customer Support Team Lead (Financial) to ensure that expectations are clearly set and that... ...strategy, and opportunities to learn and grow. Ensure that customer service is maintained by: Collaborating with other departments and leads...Temporary workWork at officeWork from homeRelocationFlexible hours- Compass Group USA is seeking a Full-time Food Unit Lead at the Alaska Airlines Training Center in Renton, WA. This position involves coordinating food-service activities, ensuring exemplary customer service, and maintaining safety standards. Successful candidates will have...Full time
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Reliability Lead, Common Services. Be the first to apply!

