Site Reliability Engineering Sênior
DAREDE
Descrição da vaga A Darede tem o objetivo de continuar revolucionando os negócios em Cloud no Brasil e nos tornarmos a mais relevante consultoria do segmento, afinal, THE FUTURE IS CLOUD! Buscamos um SRE Sênior apaixonado por estabilidade, performance e automação para integrar uma Squad estratégica de Engenharia de Resiliência. O desafio principal é atuar de forma proativa em um ecossistema complexo e de alta criticidade, movendo a operação de um modelo reativo para uma cultura de confiabilidade. Você será responsável por projetar e implementar soluções que previnam falhas, garantindo que sistemas que sustentam a receita do negócio operem com máxima disponibilidade. Se você tem curiosidade e vontade de aprender novas ferramentas, plataformas e tecnologias, é Data Driven e HandsOn, é uma pessoa sempre antenada nas novidades do Mundo Cloud, essa vaga é para você! Candidate-se, queremos te conhecer! Responsabilidades e atribuições Liderança em Incidentes: Atuar como Líder de Resposta a Incidentes em War Rooms, coordenando a resolução técnica e a comunicação com stakeholders. Engenharia de Observabilidade: Projetar e evoluir a telemetria no Datadog (Logs, APM, Traces e métricas de negócio) para reduzir o MTTD e o esforço cognitivo do time. Gestão de Workloads em AWS Amplify: Garantir a resiliência e a escalabilidade de aplicações front‑end e APIs críticas hospedadas. Governança de SRE: Definir e monitorar SLIs, SLOs e SLAs, gerindo o Error Budget para equilibrar a velocidade de entrega com a estabilidade. Automação de Mitigação: Desenvolver ferramentas e scripts de auto‑healing (rollback automático, restart controlado, isolamento de componentes). Análise de Causa Raiz: Conduzir processos de Post‑mortem blameless e garantir a implementação de melhorias estruturais para evitar reincidências. Modernização de Sistemas: Atuar junto aos times de desenvolvimento para implementar padrões de resiliência (Circuit Breakers, Bulkheads e Rate Limiting) tanto em arquiteturas modernas quanto em sistemas legados. IA na Operação: Implementar soluções de detecção de anomalias e resposta inteligente utilizando AIOps (Datadog Bits AI ou AWS DevOps Agent). Requisitos e qualificações Senioridade comprovada em SRE ou DevOps: Experiência sólida em ambientes de alta escala e missão crítica. Domínio Profundo de AWS: Experiência avançada em EC2, RDS, S3, IAM, EKS e Amplify. Domínio de ferramentas de Observabilidade: Sólida experiência em monitoramento, logs e APM (preferencialmente utilizando Datadog). Containers & Orquestração: Sólidos conhecimentos em Docker e Kubernetes (EKS/GKE). Infraestrutura como Código (IaC): Domínio de Terraform. Desenvolvimento/Scripts: Fluidez em Python, Go ou Shell Script para automação. Gestão de Incidentes: Experiência real com plantões on‑call e resolução de problemas em tempo real. Diferenciais (Desejáveis) Perfil Analítico para Sistemas Legados: Experiência em troubleshooting de aplicações em .NET Framework e bancos de dados Oracle ou PostgreSQL. Chaos Engineering: Experiência na execução de testes de estresse e resiliência controlados. Certificações: AWS Certified DevOps Engineer – Professional ou Certificações oficiais Datadog. Competências Comportamentais Perfil de liderança técnica e resiliência sob pressão. Excelente comunicação para interagir com áreas de negócio e tecnologia. Protagonismo e senso de responsabilidade na resolução definitiva de problemas. Informações adicionais
BENEFÍCIOS
Incentivos Educacionais (Parcerias com Instituições de Ensino) Férias Remuneradas TotalPass Birthday off Assistência Médica Assistência Odontológica Licença Maternidade Licença Paternidade Reembolso em Certificações AWS #J-18808-Ljbffr DAREDE$182.3k - $220k
...healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the... ...hardening infrastructure and building tools that empower our engineers to ship safely and confidently. You will work across teams...SuggestedLocal areaFlexible hours- ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...Suggested
- ...they are shifting towards Linux - (70% Windows, 30% Linux) Remote access technology protocols are a plus Job Description: Site Reliability Engineer Periodic updates and maintenance of Windows-based golden image for ESX & AWS. Patching of software, systems, appliances etc...SuggestedRemote workShift work
$7.5k
...and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor...SuggestedWork at officeLocal area- ...contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across our production SaaS environment. You’ll play a critical role in...SuggestedRemote workFlexible hoursNight shift
- ...future of legal tech — we’re defining it. Ready to join us in building the intelligent future of law? The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering hub, sitting within Foundations. You'll own critical services...Work at office
$123k - $165k
Department/Group Overview Our engineering fleet is a horizontal set of teams providing engineering... .... Our specific team provides reliability engineering and operational support to backend... ...products and brands. We are seeking a Site Reliability Engineer who will contribute...- ...the future of the Internet. Summary At Latitude.sh, the Reliability team is responsible for the health and resilience of the infrastructure... ...that powers our global bare metal cloud. As a Senior Site Reliability Engineer (SRE), you’ll focus on building reliable, observable, and...For contractors
$150k - $170k
Senior Site Reliability Engineer - Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and process billions of dollars in payments. We’re looking for a seasoned...Casual workWork at officeRemote workFlexible hours$150k - $175k
Site Reliability Engineer The world of digital assets is accelerating in speed, magnitude, and complexity, opening the door to new ways for leveraging the blockchain. Fireblocks’ platform and network provide the simplest and most secure way for companies to work with digital...Remote work$111k - $160k
Join Mizuho as a Site Reliability Engineer! In this role you will play a crucial role in maintaining the reliability, scalability, and overall performance of our production systems. This position collaborates closely with development, operations, and product teams to automate...Work at officeLocal areaRemote work$175k - $225k
...from enterprises across different industries. We’re fully in‑person at our NYC HQ near Union Square and are looking for exceptional engineers who are passionate about creating great products. The Role You’ll play a key role in designing and developing the core systems...$170k - $220k
Location New York City Employment Type Full time Location Type On-site Department Engineering & Product Engineering Compensation $170K - $220K • Offers Equity The role As a Site Reliability Engineer at Legora you'll join the founding SRE team at our new NYC engineering...Full timeWork at office- SRE (Site Reliability Engineer) (Intern) Short-term unpaid remote internship for students. Work with a mentor on our SRE team. You'll learn how to keep systems reliable, respond to incidents, and build infrastructure that scales as the platform grows. About GoOffer Go...Temporary workInternshipRemote work
$143k - $179k
...and applications ensure you can connect with your customers reliably and securely, at every step of their journey. At Sinch we... ...and global enterprises alike. We're looking for a Senior Site Reliability Engineer to join our SRE team, the group responsible for keeping...Remote workFlexible hours- We are hiring a Senior Site Reliability Engineer to help build and operate the infrastructure foundation that supports engineering teams. The role centers on reliability, scalability, cloud infrastructure, Kubernetes operations, and automation that allows developers to...
$170k - $230k
...improve the technical foundations of Perchwell while exemplifying engineering rigor and excellence across our engineering culture and... ...responsible for building the ability to innovate faster in a safe and reliable way. Reliability, resiliency and adaptability are our north...Work experience placementWork at officeFlexible hours3 days per week1 day per week$180k - $200k
Parabola is looking for a Senior Site Reliability Engineer to improve performance and reliability of its software systems in New York. This role requires 5+ years of SRE or DevOps experience and expertise in AWS and containerization tools. Offering a salary of $180,000...Work at office3 days per week$175k - $200k
...proudly named a 50 to Watch by Spend Matters and a Best Place to Work by BuiltIn and Inc. Magazine. The Role As a Senior Site Reliability Engineer on the Platform team, you will ensure that software systems are reliable, scalable, performant, and operationally efficient...Part timeWork at officeFlexible hours- ⚡ Senior Site Reliability Engineer (Azure) The Company Storm2's client is a fast-growing software company at the centre of one of the more credible enterprise blockchain ecosystems in market, supporting a proof-of-stake public network governed by major institutions across...
- ...Founded by key contributors to Bazel, we build tools that empower engineering teams—from startups to Fortune 500 companies—to enhance... ...pipelines to monitoring and recovery Manage scalability and reliability for high-throughput, low-latency systems Implement and maintain...Remote work
- Como SRE você vai: Contribuir e evoluir soluções que agregam à plataforma e área de tecnologia. Difundir, orientar e conscientizar o time de tecnologia sobre serviços e ferramentas internas, influenciando diretamente na DevXperience dos times. Possuir o ownership dos...Remote workHome office
- We are seeking a Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and Kubernetes to ensure the reliability, performance, and scalability of cloud and on-premise systems. This role focuses on building resilient infrastructure, automating...
$93.9k - $156.5k
Hybrid role , 2 days on site. Role is located in NYC with alternative location Chicago... .... Working hours: 9am‑5pm EST. Site Reliability EngineerII (Tuesday‑Saturday). CME Group... ...successful candidate will work alongside senior engineers to learn how we observe, monitor,...Local area- Overview The role of an Intermediate Site Reliability Engineer at Remotive focuses on enhancing system reliability and performance in a dynamic work environment. Applicants should possess excellent technical skills to ensure robust operations and a seamless user experience...Remote work
- ...Malaysia will be considered. Job Responsibilities System Architecture: Review architecture and software components with software engineers. Ensure best practices are consistent across all teams. Operational Excellence: Own and ensure SLOs and SLAs are met. Monitor operational...Work at officeRemote work
- Position: Senior Site Reliability Engineer + MongoDB Basic Purpose The Platform Database Engineer is responsible for designing, deploying, administering, and optimizing MongoDB (Atlas and on-premise) databases within a large-scale, cloud-based enterprise ecosystem. The...Remote workWork from home
- ...forward to hearing from passionate, goal-oriented applicants ready to make their mark in the blockchain space. As a Senior Site Reliability Engineer, you'll work at the intersection of cloud infrastructure and blockchain, building the platform that our product teams...
$170k - $225k
About The Role Zora is looking for an experienced infrastructure / site reliability software engineer to work closely with the development team to ensure that the infrastructure / site reliability meets the needs of the business and is scalable and highly available, including...Local areaRemote workHome officeFlexible hours$150k - $200k
Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve...Full timeLocal areaRemote workWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineering Sênior. Be the first to apply!
- site reliability engineer remote New York, NY
- site reliability engineer sre New York, NY
- site reliability engineer New York, NY
- site reliability engineering manager New York, NY
- after school site coordinator New York, NY
- website coordinator New York, NY
- site leader New York, NY
- on site coordinator New York, NY
- site safety New York, NY
- site recruiter New York, NY

