Site Reliability Engineer - Kafka
$139.5k - $258.1kApple Inc.
Seattle, Washington, United States Software and Services The Apple Service Engineering – Data Streaming SRE team is looking for Site Reliability Engineers with experience developing processes, tools, and automation for managing distributed systems in production environments. Our SRE team combines software engineering, systems engineering, and Devops practices to build and run large-scale, massively distributed, fault‑tolerant systems. Our software ensures that Apple's services are reliable, scalable, and secure, and we leverage both open‑source and homegrown technologies to provide managed data infrastructure services. You will help build next‑generation Kafka infrastructure and platform services, collaborating cross‑functionally with various ASE teams— from store and commerce to search and recommendations. You'll create platforms that can rapidly scale to serve data with very low latencies. You should be someone who isn't afraid to question assumptions, thrives as a collaborative partner under tight deadlines, and tackles complex problems with elegant technical solutions. Description The Data Service SRE team develops applications and tooling that are safe, reliable, scalable, and fast. This work requires an innovative spirit and an extraordinary degree of care and difficulty in engineering. Team members contribute to all major components of Kafka deployment infrastructure, including maintenance automation, control plane enhancements, monitoring and alerting tooling/dashboards, advanced deployment architecture, focused on safety, stability, performance, and scaling. Come join us at Apple Services Engineering and help us deliver services and applications that are fluid and responsive. You will collaborate with engineers from across Apple to define the metrics, set targets, uncover optimization opportunities, and ship a service that will delight our customers. This role is for engineers who enjoy deep technical engineering that spans large cross‑organizational projects. Your openness to learning and implementing new technologies will contribute to the continuous evolution of our organization. Good ideas are valued and rewarded. Responsibilities Understanding of core SRE concepts – Monitoring, Alerting, Incident management Deep and wide performance engineering (design concepts, profile‑guided optimization) Service lifecycle management across bare metal, and virtualized (EC2), Kubernetes platforms Prepare alert handling procedures, run‑books, and collaborate with other SRE team members. Excellent communication and a high degree of customer focus when engaging with internal platform customers As a distributed team, ability to work optimally with colleagues based in other locations is essential Prior experience with development or maintenance of Kafka infrastructure or similar data service is highly recommended Minimum Qualifications Support of internet‑facing production services and distributed systems via deployments, On Call and Incident Management. Experience running large scale infrastructure with a heavy reliance on automation tooling Excellent troubleshooting and performance deep dive analysis Real operational experience managing services at scale on Kubernetes Proficient in one or more of the following programming languages: Java, Go (golang), Python Operational experience deploying in and running on Datacenter and Cloud architectures (networking topologies, host placement strategies, and failure modes); design of multi‑datacenter systems; failure domains; and wide‑area networking. Self‑motivated, inquisitive with an aptitude to learn new technologies quickly and effectively. Demonstrated expertise developing and troubleshooting distributed systems and database storage engines. Experience developing critical internet services and/or platform infrastructure. Experience with AWS, GCP and IaC such as Terraform Preferred Qualifications Experience managing messaging services such as Kafka or other Data services Apple: base pay is one part of our total compensation package and is determined within a range. The base pay range for this role is between $139,500 and $258,100, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become Apple shareholders through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses—including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. #J-18808-Ljbffr Apple Inc.
$139.5k - $258.1k
...States Software and Services Apple Services Engineering team is one of the most exciting... ...Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud... ...as Kubernetes, Cassandra, Zookeeper, Kafka, Redis, etc, alongside internally developed...SuggestedRelocation- ...Tech Stack Required: # Front end # Node and React # Stand alone application exp # DB Management Suite # Kafka # Golang Will do 1 round of INT w/ Glider of Node & React Highly preferred someone from WMT w/ good ref! Team...SuggestedLocal area
- ...developer ~ Cloud experience with AWS preferred ~ Senior Java development experience ~ Strong troubleshooting skillset ~ Kafka & REST API development for services with high transaction rates ~ Container/Kubernetes microservices using SpringBoot on AWS...Suggested
$134.96k - $188.95k
...experienced personnel to join their ground system team as a site reliability engineer. We are a passionate group developing mobility missions... ...complex applications in Python Experience with REST APIs, Kafka, GRPC Experience in web development and/or database development...SuggestedPermanent employmentFull timeTemporary workLocal areaImmediate start$140k - $180k
...Zachary Piper Solutions is seeking an experienced Site Reliability Engineer (SRE) to support the deployment and sustainment of systems across classified, air-gapped, and government cloud environments. This role blends operations, security, and reliability engineering,...Suggested$70 - $80 per hour
...Our client, a leading organization in the technology sector, is seeking a Sr. Site Reliability Engineer to join their team. As a Sr. Site Reliability Engineer, you will be part of the Cloud Infrastructure Support team supporting development and operations. The ideal candidate...Weekly payTemporary workFlexible hours3 days per week- ...Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our production ecosystems, ensuring that our complex, data-driven AI platforms remain resilient, scalable, and highly performant...Local area
$180k - $225k
...change. Constantly grow as you work hard for a mission that matters at a company where you matter. Your Impact You are a Sr. Site Reliability Engineer II who will help define how Axon builds and operates its core platforms, with a primary focus on Zero Touch, our...Work experience placementWork at officeImmediate startRemote work- ...APPIT Software Solutions is hiring a Senior Site Reliability Engineer (SRE) in Seattle, USA . Lead site reliability engineering efforts for large-scale distributed systems, driving 99.99% availability targets through advanced observability, automation, and resilience engineering...Flexible hours
$134.25k - $214.8k
...matters at a company where you matter. Your Impact Are you an engineer who gets excited about the challenge of making complex... ...it. You will be part of the Observability team within Axon's Site Reliability organization - a focused team responsible for Axon's metrics,...Work experience placementWork at officeRemote work$135k - $154k
...Axon we are on a mission to Protect Life. As the APX platform engineering organization works on our CloudNet team, we build and operate... ...mission‑critical services and maintain the high quality and reliability that our customers demand. You will work closely with sovereign...$122.3k - $158.5k
...Washington, United States of America Vancouver British Columbia Canada Kirkland Washington United States of America Senior Site Reliability Engineer (SRE) - SPEAR Electronic Arts is looking for a Senior Site Reliability Engineer (SRE) to join the Security Platform...Full time$150k - $180k
...tooling that improve cloud infrastructure reliability, scalability, and operational efficiency... ...platforms and tools that enable engineering teams to provision services rapidly, consistently... ...engineering, cloud infrastructure, or site reliability engineering. Experience...$177.57k - $248.59k
...Site Reliability Engineering - Sr. Software Development Engineer Implement and manage the infrastructure for rapid development and deployment of software at Blue Origin. Location: Seattle Compensation: $177,567 - 248,593 USD / year Job Tags: Software About the Role Site...Permanent employmentTemporary workLocal area- ...DevOps Engineer/ Site Reliability Engineer We are seeking a skilled DevOps Engineer with SRE capabilities to join our team in Seattle, WA. This hybrid role (4 days onsite) focuses on enabling application development through automation, CI/CD, containerization, and...
- ...MongoDB, Inc. is seeking a Software Engineer to join its North America team. The role involves designing and optimizing data structures... ...backend experience and familiarity with Java, Spring Boot, and Kafka. Position includes collaboration with AI researchers and contributing...
- ...within the Infrastructure division and is responsible for the reliability, performance, security, and automation of Airwallex's database... ...more. The team's mission is to make databases invisible: product engineers should be able to provision, scale, and operate databases...
$139.5k - $258.1k
...Washington, United States Software and Services Apple Services Engineering (ASE) designs, builds, and operates the cloud infrastructure... ...a highly skilled, collaborative, and pragmatic Storage Site Reliability Engineer to join our team. In this role, you will help build...Relocation- Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible...
- ...and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III - DevOps Engineer at JPMorgan Chase within the Commercial and Investment Bank, you will solve complex and broad...
$163.62k - $212.71k
...maintaining the tools, platforms, and processes that improve our engineering teams’ productivity and streamline the software development... ...: We are seeking a seasoned and strategic Lead/Principal Site Reliability Engineer to drive the reliability, scalability, and...Permanent employmentFull timePart timeWork experience placementWork at officeLocal areaImmediate startRemote workWork from homeFlexible hoursShift work3 days per week1 day per week- About the job We are looking for a senior site reliability engineer to join the Cloud FinOps team at Hopper. We manage a large infrastructure in Google Cloud that is used by hundreds of engineers to provide a first class experience to millions of end users around the world...Remote jobWork from homeSleeping nights
$175k - $200k
...for you. About the Role: As a member of the Product and Engineering team at PitchBook, you will be part of a team of big thinkers... ...constantly improve. Join our team and grow with us! As a Sr. Site Reliability Engineer (SRE) in PitchBook’s engineering division, you will...Full timeWork at officeRemote workVisa sponsorship$135.6k - $230.5k
...workplace. Learn how Premera supports our members, customers and the communities that we serve through our Healthsource blog: Site Reliability Engineer IV Job Description Summary As a Site Reliability Engineer IV, you will drive reliability and operational excellence across...Full time$42.75 per hour
Site Reliability Engineer Intern (Compute Platform) - 2026 Summer (BS/MS) Location: Seattle Employment Type: Intern Job Code: A20124A Responsibilities Responsible for the reliability of all TikTok's major data warehouse products, services, and query engines, such as...Hourly paySummer workInternshipLocal area- ...Blue Origin is seeking a Site Reliability Engineer to enhance the digital infrastructure supporting safe human spaceflight. This role involves improving the software lifecycle from design to deployment, particularly in cloud environments. The ideal candidate will possess...
$320k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large-scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This highly specialized discipline...$148.2k - $300.96k
Site Reliability Engineer - Data Infrastructure Location: Seattle Employment Type: Regular Job Code: A32205 Responsibilities Incident response and triage: Serve as a first responder for production alerts and incidents, execute established runbooks to mitigate issues...Temporary workLocal area$207k - $301k
Site Reliability Manager, GCE Node, Site Reliability Engineering Google Seattle, WA, USA Advanced Experience owning outcomes and decision making, solving ambiguous problems and influencing stakeholders; deep expertise in domain. Benefits Health, dental, vision, life...Temporary workWorldwide$139.5k - $258.1k
...Software Engineer, Apple Services Engineering Seattle, Washington, United States Software and Services We are looking for an experienced... ...Familiar with event streaming and queueing systems, e.g., Kafka Experience using AWS, GCP, and cloud-native technologies (Containers...Relocation
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer - Kafka. Be the first to apply!
- site reliability engineer sre Seattle, WA
- site reliability engineer Seattle, WA
- website coordinator Seattle, WA
- site leader Seattle, WA
- on site coordinator Seattle, WA
- site safety Seattle, WA
- site recruiter Seattle, WA
- on-site clinical research associate (traveling/remote) Seattle, WA
- junior website developer Seattle, WA
- site services specialist Seattle, WA

