Site Reliability Engineer: Platform & Observability
Apple Inc.
A leading technology company is seeking a Site Reliability Engineer in Cupertino, California. The role involves owning the reliability of AWS and Kubernetes services, designing systems, and collaborating with engineering teams for observability and automation. Candidates should have substantial experience with distributed systems, Kubernetes, and AWS, along with strong communication skills. A commitment to innovation and a collaborative environment is crucial. Competitive base salary and comprehensive benefits offered. #J-18808-Ljbffr Apple Inc.
$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production... ...and reliability aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time...Suggested$250k
...source of truth—explainable, reliable, and maintainable—that... ...Overview As Director of Site Reliability Engineering, you will ensure that eGain... ...s AI knowledge management platform operates with the... ...strategy and execution for observability, incident management, capacity...SuggestedWork at office$109k - $145k
...Software Engineer, Observability CoreWeave is The Essential Cloud for AI™.... ...pioneers, CoreWeave delivers a platform of technology, tools, and... ..., while improving system reliability through enhanced monitoring... ...experience in Software Engineering, Site Reliability Engineering,...SuggestedPermanent employmentTemporary workCasual workWork at officeFlexible hours- ...California seeks an experienced Staff Software Engineer to lead the technical direction of their data collection platforms. You will design systems for high-quality... ...with multiple teams, and oversee integrations, observability tools, and best engineering practices. A strong...Suggested
$150k - $195k
...operational efficiency of the Lacework platform Design, build and improve... ...best practices alongside engineering/operations teams to improve the scalability and reliability of internal processes.... ...Experience with monitoring and observability systems and tools (Prometheus...SuggestedFull timeWorldwide$147.4k - $272.1k
Site Reliability Engineer, Enterprise Technology Services Sunnyvale, California, United States Software... ...groundbreaking, world-changing platforms and services. Our ETS applications play... ...of SRE principles, including observability, error budgeting, service reliability...Relocation$147.4k - $272.1k
.... We are a team of software engineers developing web-based tools and... ...day. We’re looking for a Site Reliability Engineer who thinks like a systems... ...— you’ll shape how our platform evolves. Our team operates 5... ...services communicate, how we observe production behavior, and how...RelocationShift work$147.4k - $272.1k
Site Reliability Engineer (Edge Services), Infrastructure Services Sunnyvale, California, United States Software and Services We are seeking... ...role in ensuring our services are resilient, scalable, and observable, bridging the gap between complex distributed systems and...RelocationShift work$172.1k - $258.6k
Site Reliability Engineer, Physical Infrastructure Cupertino, California, United States Software and Services We are looking for a creative and... ...: Design, build, and maintain robust, scalable, and observable systems for our core infrastructure services Automate: Reduce...WorldwideRelocation$188k - $250k
...Staff Software Engineer, Observability CoreWeave is The Essential Cloud for... ...pioneers, CoreWeave delivers a platform of technology, tools, and... ...highly scalable, reliable, and secure systems. The... ...experience in Software Engineering, Site Reliability Engineering,...Permanent employmentTemporary workCasual workWork at officeFlexible hours$120.3k - $194.53k
...infrastructure across multiple public clouds. As a Site Reliability Engineer on the Internet Security Platform team, you will be part of a team supporting... ...troubleshoot complex distributed systems Experience with observability tools (OpenTelemetry, Chronosphere, etc.)...Full timeWork at officeVisa sponsorshipWork visa$235k - $295k
A data and AI company in Mountain View seeks a Software Engineer for the Observability team. This role involves developing solutions for product performance insights and managing cloud infrastructure. The ideal candidate has over 15 years of experience in software development...$184k - $287.5k
...organization is seeking a Senior System Software Engineer to lead the evolution of our next-generation Data & Observability Platform. We serve and collaborate directly with... ...distributed pipelines, and ensure platform reliability. What you’ll be doing: Architect High-...- ...Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale,... ...from operating Kubernetes and cloud platforms at scale. The ideal candidate has... ...and rollback failures Reliability & Observability Establish SLIs/SLOs and production...
- ...a seasoned professional to join their engineering team. The candidate will design and build... ..., ensuring scalability and reliability. A solid background in software development... ...contributing to the development of advanced observability tools for AI solutions. This position...
$184k - $287.5k
...Gruppe is seeking a Senior System Software Engineer to lead the development of their next-generation Data & Observability Platform in Santa Clara, California. This role... ...experience improvements while ensuring platform reliability. The ideal candidate will have over 8...$200k - $287.5k
...Senior Software Engineer — Streaming Data Products At... ...how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake... ...while maintaining reliability at enterprise scale. As... ...the Snowflake Careers Site for salary and benefits...Temporary workFlexible hours$200k - $287.5k
...how work gets done. Observe by Snowflake is an AI-powered observability platform built on the Snowflake AI Data Cloud and engineered for scale. We ingest and... ...while maintaining reliability at enterprise scale. As... ...the Snowflake Careers Site for salary and benefits...Flexible hours$152k - $241.5k
...generation of our global services platform. At NVIDIA, you’ll keep critically... ...host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑driven operations (AIOps/ML... ...Go, Perl, or Ruby. Mentored other engineers and influenced technical direction...- Sanas is looking for a skilled Production Engineer to manage the infrastructure for its high-scale, real-time speech AI platform. The candidate will design and implement robust... ...excellence, developer velocity, and deep observability across systems. #J-18808-Ljbffr Sanas
$200k - $287.5k
...Senior Software Engineer At Snowflake, we are powering the era of the... ...future of how work gets done. Observe by Snowflake is an AI-powered observability platform engineered for scale — ingesting... ...posting on the Snowflake Careers Site for salary and benefits...Flexible hours$200k - $287.5k
...We are looking for a Senior Engineer in Observability to help define and build... ...for Snowflake's global data platform. This role sits at the intersection... ..., debugging, and reliability engineering principles... ...posting on the Snowflake Careers Site for salary and benefits information...Flexible hours$86.33k - $191.9k
...going safely. Identifying reliability anti-patterns and solving them... .... Leveraging AI tools and platforms in your daily work to... ...reduce toil, and improve system observability. Contributing to the definition... ...and platforms to increase engineering productivity, enforce code...Local areaFlexible hours- Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Knowledge of the application of tools/techniques Experience in one coding language (Preferred) Experience in Database (Preferred...
$147.4k - $220.9k
Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn...Relocation- About the Role CrowdStrike's engineering organization depends on shared infrastructure platforms that power critical product capabilities... ...ownership to operate reliably, scale safely, harden for... ...infrastructure costs in check. Build observability - Set up metrics, dashboards,...Work at officeLocal area2 days per week
- ...Graph, our breach containment platform identifies and contains... ...world running. Location: 5 on-site days a week in Sunnyvale, CA... ...Headquarters. Our Team's Vision: Our Engineering team is shaping the future of... ...an experienced Senior Site Reliability Engineer (SRE) with a strong...Work experience placement
- Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day... ...performance using Grafana and other observability tools. Ensure high availability, reliability, and uptime across platforms. Handle infrastructure maintenance, upgrades...
- ...building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into reliable, job‑centric insights and... .... Join our team of innovative engineers who are building this platform... ...ownership of reliability for an observability/AIOps platform: SLOs/SLIs, on‑...
- The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability,... ...production systems that power Nectar's platform. We run high-volume data ingestion... ..., and error budgets, and build the observability, alerting, and on‑call practices to...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer: Platform & Observability. Be the first to apply!
- platform developer Cupertino, CA
- platform engineer Cupertino, CA
- on-site clinical research associate (traveling/remote) Cupertino, CA
- junior website developer Cupertino, CA
- platform product manager Cupertino, CA
- platform manager Cupertino, CA
- site reliability engineer remote
- site reliability engineer sre
- site reliability engineer
- site reliability engineering manager


