Site Reliability Engineer - Kafka

$139.5k - $258.1k

Apple Inc.

Seattle, Washington, United States Software and Services The Apple Service Engineering – Data Streaming SRE team is looking for Site Reliability Engineers with experience developing processes, tools, and automation for managing distributed systems in production environments. Our SRE team combines software engineering, systems engineering, and Devops practices to build and run large-scale, massively distributed, fault‑tolerant systems. Our software ensures that Apple's services are reliable, scalable, and secure, and we leverage both open‑source and homegrown technologies to provide managed data infrastructure services. You will help build next‑generation Kafka infrastructure and platform services, collaborating cross‑functionally with various ASE teams— from store and commerce to search and recommendations. You'll create platforms that can rapidly scale to serve data with very low latencies. You should be someone who isn't afraid to question assumptions, thrives as a collaborative partner under tight deadlines, and tackles complex problems with elegant technical solutions. Description The Data Service SRE team develops applications and tooling that are safe, reliable, scalable, and fast. This work requires an innovative spirit and an extraordinary degree of care and difficulty in engineering. Team members contribute to all major components of Kafka deployment infrastructure, including maintenance automation, control plane enhancements, monitoring and alerting tooling/dashboards, advanced deployment architecture, focused on safety, stability, performance, and scaling. Come join us at Apple Services Engineering and help us deliver services and applications that are fluid and responsive. You will collaborate with engineers from across Apple to define the metrics, set targets, uncover optimization opportunities, and ship a service that will delight our customers. This role is for engineers who enjoy deep technical engineering that spans large cross‑organizational projects. Your openness to learning and implementing new technologies will contribute to the continuous evolution of our organization. Good ideas are valued and rewarded. Responsibilities Understanding of core SRE concepts – Monitoring, Alerting, Incident management Deep and wide performance engineering (design concepts, profile‑guided optimization) Service lifecycle management across bare metal, and virtualized (EC2), Kubernetes platforms Prepare alert handling procedures, run‑books, and collaborate with other SRE team members. Excellent communication and a high degree of customer focus when engaging with internal platform customers As a distributed team, ability to work optimally with colleagues based in other locations is essential Prior experience with development or maintenance of Kafka infrastructure or similar data service is highly recommended Minimum Qualifications Support of internet‑facing production services and distributed systems via deployments, On Call and Incident Management. Experience running large scale infrastructure with a heavy reliance on automation tooling Excellent troubleshooting and performance deep dive analysis Real operational experience managing services at scale on Kubernetes Proficient in one or more of the following programming languages: Java, Go (golang), Python Operational experience deploying in and running on Datacenter and Cloud architectures (networking topologies, host placement strategies, and failure modes); design of multi‑datacenter systems; failure domains; and wide‑area networking. Self‑motivated, inquisitive with an aptitude to learn new technologies quickly and effectively. Demonstrated expertise developing and troubleshooting distributed systems and database storage engines. Experience developing critical internet services and/or platform infrastructure. Experience with AWS, GCP and IaC such as Terraform Preferred Qualifications Experience managing messaging services such as Kafka or other Data services Apple: base pay is one part of our total compensation package and is determined within a range. The base pay range for this role is between $139,500 and $258,100, and your base pay will depend on your skills, qualifications, experience, and location. Apple employees also have the opportunity to become Apple shareholders through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses—including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program. Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant. #J-18808-Ljbffr Apple Inc.

Apply

Vacancy posted 5 days ago

Similar jobs that could be interesting for youBased on the Site Reliability Engineer - Kafka in Seattle, WA vacancy

Senior Site Reliability Engineer
$139.5k - $258.1k
...States Software and Services Apple Services Engineering team is one of the most exciting... ...Cloud Service Infrastructure team, as a Site Reliability Engineer, to help support and scale cloud... ...as Kubernetes, Cassandra, Zookeeper, Kafka, Redis, etc, alongside internally developed...
Suggested
Relocation
Apple Inc.
Seattle, WA
4 days ago
Ground System Site Reliability Engineer II
$134.96k - $188.95k
## Ground System Site Reliability Engineer IIApplylocations: Greater Seattle Areatime type: Full timeposted on: Posted Yesterdayjob requisition... ...complex applications in Python* Experience with REST APIs, Kafka, GRPC* Experience in web development and/or database development...
Suggested
Permanent employment
Temporary work
Local area
Blue Origin LLC
Seattle, WA
3 days ago
Java Developer (Kafka)
...Java Microservices Developer Cloud experience with AWS Senior Java development experience Strong troubleshooting skillset Kafka & REST API development for services with high transaction rates Container/Kubernetes microservices using SpringBoot on AWS...
Suggested
Samprasoft
Seattle, WA
1 day ago
Senior Site Reliability Engineer (SRE)
...APPIT Software Solutions is hiring a Senior Site Reliability Engineer (SRE) in Seattle, USA . Lead site reliability engineering efforts for large-scale distributed systems, driving 99.99% availability targets through advanced observability, automation, and resilience engineering...
Suggested
Flexible hours
Appit LLC
Seattle, WA
2 days ago
Sr. Site Reliability Engineer
...Role Overview We are seeking a high-caliber Site Reliability Engineer (SRE) to join our Forward Engineering team. You will be the guardian of our production ecosystems, ensuring that our complex, data-driven AI platforms remain resilient, scalable, and highly performant...
Suggested
Local area
Tiger Analytics
Seattle, WA
2 days ago
Staff Software Engineer - Java | Kafka | Kubernetes
$140.6k - $173.1k
Staff Software Engineer - Java | Kafka | Kubernetes page is loaded## Staff Software Engineer - Java | Kafka | Kuberneteslocations: New Jersey... ...make an impact*** Design, develop, and implement scalable and reliable software solutions using Kafka, ElasticSearch, and other...
Work at office
Remote work
WEX
Seattle, WA
5 days ago
Site Reliability Engineer IV
$135.6k - $230.5k
## Site Reliability Engineer IVApplylocations: Mountlake Terrace WAtime type: Full timeposted on: Posted 2 Days Agojob requisition id: R28853**Workforce Classification:**Hybrid**Join Our Team: Do Meaningful Work and Improve People’s Lives**Our purpose, to improve customers...
Premera
Mountlake Terrace, WA
3 days ago
Senior Site Reliability Engineer I
$134.25k - $214.8k
...matters at a company where you matter. Your Impact Are you an engineer who gets excited about the challenge of making complex... ...it. You will be part of the Observability team within Axon's Site Reliability organization - a focused team responsible for Axon's metrics,...
Work experience placement
Work at office
Remote work
Koitecc Solutions
Seattle, WA
2 days ago
Senior Site Reliability Engineer
...role, we encourage you to apply. The Role As a Senior Platform Engineer, you are a champion for DevOps and SRE culture and industry... ...goals are met. What You Will Be Doing Improving production reliability and system resilience within an SRE scoped team Championing high...
Flexible hours
Megaport
Seattle, WA
6 days ago
Site Reliability Engineer II
$135k - $154k
...Axon we are on a mission to Protect Life. As the APX platform engineering organization works on our CloudNet team, we build and operate... ...mission‑critical services and maintain the high quality and reliability that our customers demand. You will work closely with sovereign...
Accreditation Council for Graduate Medical Education
Seattle, WA
2 days ago
Senior Site Reliability Engineer I Seattle, Washington, United States Seattle, Washington
$150k - $180k
...tooling that improve cloud infrastructure reliability, scalability, and operational efficiency... ...platforms and tools that enable engineering teams to provision services rapidly, consistently... ...engineering, cloud infrastructure, or site reliability engineering. Experience...
Axon Enterprise
Seattle, WA
5 days ago
Site Reliability Engineering - Sr. Software Development Engineer
$177.57k - $248.59k
Site Reliability Engineering - Sr. Software Development Engineer Implement and manage the infrastructure for rapid development and deployment of software at Blue Origin. Location: Seattle Compensation: $177,567 - 248,593 USD / year Job Tags: Software About the Role...
Permanent employment
Temporary work
Local area
jobs.frontdoordefense.com - Jobboard
Seattle, WA
4 days ago
Cleared Site Reliability Engineer - Database
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible...
Ll Oefentherapie
Seattle, WA
6 days ago
Data Migration Backend Engineer — Real-Time with Java & Kafka
MongoDB, Inc. is seeking a Software Engineer to join its North America team. The role involves designing and optimizing data structures... ...backend experience and familiarity with Java, Spring Boot, and Kafka. Position includes collaboration with AI researchers and contributing...
jobr.pro
Seattle, WA
6 days ago
Site Reliability Engineer III - DevOps Engineer
...and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III - DevOps Engineer at JPMorgan Chase within the Commercial and Investment Bank, you will solve complex and broad...
Next Frontier Capital
Seattle, WA
2 days ago
Director, Site Reliability Engineering
...infrastructure and service and provides input on best practices for reliability and functionality. Establishes direction to ensure accurate... ...with new technology, executing improvements, building site reliability knowledge, and providing clear data. #J-18808-Ljbffr...
Ll Oefentherapie
Seattle, WA
2 days ago
Principal Site Reliability Engineer
$163.62k - $212.71k
...maintaining the tools, platforms, and processes that improve our engineering teams’ productivity and streamline the software development... ...: We are seeking a seasoned and strategic Lead/Principal Site Reliability Engineer to drive the reliability, scalability, and...
Permanent employment
Full time
Part time
Work experience placement
Work at office
Local area
Immediate start
Remote work
Work from home
Flexible hours
Shift work
3 days per week
1 day per week
iSpot.tv, Inc.
Bellevue, WA
4 days ago
Infrastructure & Technology Site Reliability Engineer Professional Bellevue, US
Site Reliability Engineer Your role and responsibilities Manage deployments of Apptio services to AWS GovCloud. Monitor KPIs of services running in AWS GovCloud. Author and maintain documentation of deployment and monitoring processes. Use runbooks to troubleshoot and...
Temporary work
Remote work
IBM
Bellevue, WA
5 days ago
Site Reliability Engineer (Senior or Staff), Infrastructure Security
$127k - $249k
Senior / Staff Engineer - SRE, InfraSec We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team to guide the security of our cloud‑based infrastructure. You will be highly hands‑on technically while also mentoring a small team of SREs. The...
Local area
Remote work
The Consulting Solutions
Seattle, WA
3 days ago
Storage Site Reliability Engineer - Apple Service Engineering
$139.5k - $258.1k
...Washington, United States Software and Services Apple Services Engineering (ASE) designs, builds, and operates the cloud infrastructure... ...a highly skilled, collaborative, and pragmatic Storage Site Reliability Engineer to join our team. In this role, you will help build...
Relocation
Apple Inc.
Seattle, WA
5 days ago
Senior Site Reliability Engineer - Cloud Platforms & Automation
Blue Origin is seeking a Site Reliability Engineer to enhance the digital infrastructure supporting safe human spaceflight. This role involves improving the software lifecycle from design to deployment, particularly in cloud environments. The ideal candidate will possess...
jobs.frontdoordefense.com - Jobboard
Seattle, WA
4 days ago
Principal Site Reliability Engineer
Designs and architects infrastructure and service to ensure reliability and functionality. Forecasts demands and responds to capacity needs... ...new tools and develops and maintains advanced knowledge of site reliability trends. Only Oracle brings together the data, infrastructure...
Full time
Flexible hours
Oracle
Seattle, WA
1 day ago
Site Reliability Manager, GCE Node, Site Reliability Engineering
$207k - $301k
Site Reliability Manager, GCE Node, Site Reliability Engineering Google Seattle, WA, USA Benefits Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off: 20 days of vacation per year, accruing at a rate of 6....
Temporary work
Google Inc.
Seattle, WA
3 days ago
Sr. Software Engineer, Systems Infrastructure
$139k - $229k
...production environment a platform that enables search quality engineers to rapidly innovate, experiment and improve relevance-while at... ...such as Hadoop, Spark, Kubernetes, Feather, GraphQL, GRPC, Apache Kafka, Pinot, Samza or Venice Experience with open-source project...
For contractors
Work experience placement
Work at office
Flexible hours
LinkedIn
Bellevue, WA
4 days ago
System Software Engineer
...Cloud Storage Engineer The Cloud Storage team is in charge of design and implementation of New Generation Cloud Storage services: including... ...and experience on containerization (docker/Swarm, K8s) and/or Big data analytics (Spark, Flink, Kafka, etc) is a plus....
Netpace
Bellevue, WA
5 days ago
Platform Engineer
...Platform Engineer Location: Onsite in Seattle, WA Visa: GC and USC only - W2 Role Location: - Seattle... .... Examples include Java/Spring Boot, CQRS, event streaming, Kafka, Spark Knowledge in databases Desired Skills & Experience...
Local area
Equiliem
Seattle, WA
4 days ago
Software Engineer II - Ad Platform, CRM Analytics
$120.3k - $161.3k
...Software Engineer II - Ad Platform, CRM Analytics Disney Entertainment and ESPN Product & Technology Technology is at the heart... ...persistence and data streaming technology experience (Snowflake, Kafka). Experience with Python and Databricks is highly desirable....
Work experience placement
Local area
Disney France
Seattle, WA
5 days ago
Cloud Platform Engineer - Privacy‑Focused Distributed Systems
$139.5k - $258.1k
Software Engineer, Apple Services Engineering Seattle, Washington, United States Software and Services We are looking for an experienced... ...Familiar with event streaming and queueing systems, e.g., Kafka Experience using AWS, GCP, and cloud-native technologies (Containers...
Relocation
Apple Inc.
Seattle, WA
5 days ago
Senior Software Engineer, Loans Platform
...lending landscape. SoFi is seeking enthusiastic Senior Software Engineers who are ready to lead the development of key advancement to the... ...compute, storage, and infrastructure. ~ Experience with Kafka, Docker, Kubernetes, and Spring Boot or similar frameworks. ~...
Full time
Work experience placement
Remote work
SoFi
Seattle, WA
5 days ago
Software Engineer II - Ad Platform, CRM Analytics
...Software Engineer Disney Entertainment and ESPN Product & Technology is a global organization of engineers, product developers, designers... ...and data streaming technology experience (Snowflake, Kafka). Experience with Python and Databricks is highly desirable....
Work experience placement
The Walt Disney Studios
Seattle, WA
4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer - Kafka. Be the first to apply!