Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineering Sênior

DAREDE

Descrição da vaga A Darede tem o objetivo de continuar revolucionando os negócios em Cloud no Brasil e nos tornarmos a mais relevante consultoria do segmento, afinal, THE FUTURE IS CLOUD! Buscamos um SRE Sênior apaixonado por estabilidade, performance e automação para integrar uma Squad estratégica de Engenharia de Resiliência. O desafio principal é atuar de forma proativa em um ecossistema complexo e de alta criticidade, movendo a operação de um modelo reativo para uma cultura de confiabilidade. Você será responsável por projetar e implementar soluções que previnam falhas, garantindo que sistemas que sustentam a receita do negócio operem com máxima disponibilidade. Se você tem curiosidade e vontade de aprender novas ferramentas, plataformas e tecnologias, é Data Driven e HandsOn, é uma pessoa sempre antenada nas novidades do Mundo Cloud, essa vaga é para você! Candidate-se, queremos te conhecer! Responsabilidades e atribuições Liderança em Incidentes: Atuar como Líder de Resposta a Incidentes em War Rooms, coordenando a resolução técnica e a comunicação com stakeholders. Engenharia de Observabilidade: Projetar e evoluir a telemetria no Datadog (Logs, APM, Traces e métricas de negócio) para reduzir o MTTD e o esforço cognitivo do time. Gestão de Workloads em AWS Amplify: Garantir a resiliência e a escalabilidade de aplicações front‑end e APIs críticas hospedadas. Governança de SRE: Definir e monitorar SLIs, SLOs e SLAs, gerindo o Error Budget para equilibrar a velocidade de entrega com a estabilidade. Automação de Mitigação: Desenvolver ferramentas e scripts de auto‑healing (rollback automático, restart controlado, isolamento de componentes). Análise de Causa Raiz: Conduzir processos de Post‑mortem blameless e garantir a implementação de melhorias estruturais para evitar reincidências. Modernização de Sistemas: Atuar junto aos times de desenvolvimento para implementar padrões de resiliência (Circuit Breakers, Bulkheads e Rate Limiting) tanto em arquiteturas modernas quanto em sistemas legados. IA na Operação: Implementar soluções de detecção de anomalias e resposta inteligente utilizando AIOps (Datadog Bits AI ou AWS DevOps Agent). Requisitos e qualificações Senioridade comprovada em SRE ou DevOps: Experiência sólida em ambientes de alta escala e missão crítica. Domínio Profundo de AWS: Experiência avançada em EC2, RDS, S3, IAM, EKS e Amplify. Domínio de ferramentas de Observabilidade: Sólida experiência em monitoramento, logs e APM (preferencialmente utilizando Datadog). Containers & Orquestração: Sólidos conhecimentos em Docker e Kubernetes (EKS/GKE). Infraestrutura como Código (IaC): Domínio de Terraform. Desenvolvimento/Scripts: Fluidez em Python, Go ou Shell Script para automação. Gestão de Incidentes: Experiência real com plantões on‑call e resolução de problemas em tempo real. Diferenciais (Desejáveis) Perfil Analítico para Sistemas Legados: Experiência em troubleshooting de aplicações em .NET Framework e bancos de dados Oracle ou PostgreSQL. Chaos Engineering: Experiência na execução de testes de estresse e resiliência controlados. Certificações: AWS Certified DevOps Engineer – Professional ou Certificações oficiais Datadog. Competências Comportamentais Perfil de liderança técnica e resiliência sob pressão. Excelente comunicação para interagir com áreas de negócio e tecnologia. Protagonismo e senso de responsabilidade na resolução definitiva de problemas. Informações adicionais

BENEFÍCIOS

Incentivos Educacionais (Parcerias com Instituições de Ensino) Férias Remuneradas TotalPass Birthday off Assistência Médica Assistência Odontológica Licença Maternidade Licença Paternidade Reembolso em Certificações AWS #J-18808-Ljbffr DAREDE

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineering Sênior in New York, NY vacancy
  • $182.3k - $220k

     ...healthcare by putting patients first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the...  ...hardening infrastructure and building tools that empower our engineers to ship safely and confidently.   You will work across teams... 
    Suggested
    Local area
    Flexible hours

    Ro

    New York, NY
    27 days ago
  •  ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas...  ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle... 
    Suggested

    Forhyre

    New York, NY
    21 days ago
  •  ...they are shifting towards Linux - (70% Windows, 30% Linux) Remote access technology protocols are a plus Job Description: Site Reliability Engineer Periodic updates and maintenance of Windows-based golden image for ESX & AWS. Patching of software, systems, appliances etc... 
    Suggested
    Remote work
    Shift work

    TechDigital Group

    New York, NY
    18 hours ago
  • $7.5k

     ...and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor... 
    Suggested
    Work at office
    Local area

    The Voleon Group

    New York, NY
    2 days ago
  •  ...contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across our production SaaS environment. You’ll play a critical role in... 
    Suggested
    Remote work
    Flexible hours
    Night shift

    CERTIFID

    New York, NY
    1 day ago
  •  ...future of legal tech — we’re defining it. Ready to join us in building the intelligent future of law? The role As a Senior Site Reliability Engineer you'll join the founding SRE team at our new NYC engineering hub, sitting within Foundations. You'll own critical services... 
    Work at office

    Legora AB

    New York, NY
    1 day ago
  • $123k - $165k

    Department/Group Overview Our engineering fleet is a horizontal set of teams providing engineering...  .... Our specific team provides reliability engineering and operational support to backend...  ...products and brands. We are seeking a Site Reliability Engineer who will contribute... 

    The Walt Disney Company (France)

    New York, NY
    4 days ago
  •  ...the future of the Internet. Summary At Latitude.sh, the Reliability team is responsible for the health and resilience of the infrastructure...  ...that powers our global bare metal cloud. As a Senior Site Reliability Engineer (SRE), you’ll focus on building reliable, observable, and... 
    For contractors

    Latitude.sh

    New York, NY
    2 days ago
  • $150k - $170k

    Senior Site Reliability Engineer - Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software applications that serve millions of customers and process billions of dollars in payments. We’re looking for a seasoned... 
    Casual work
    Work at office
    Remote work
    Flexible hours

    Zip Co

    New York, NY
    8 hours ago
  • $150k - $175k

    Site Reliability Engineer The world of digital assets is accelerating in speed, magnitude, and complexity, opening the door to new ways for leveraging the blockchain. Fireblocks’ platform and network provide the simplest and most secure way for companies to work with digital... 
    Remote work

    Fireblocks

    New York, NY
    1 day ago
  • $111k - $160k

    Join Mizuho as a Site Reliability Engineer! In this role you will play a crucial role in maintaining the reliability, scalability, and overall performance of our production systems. This position collaborates closely with development, operations, and product teams to automate... 
    Work at office
    Local area
    Remote work

    Mizuho Financial Group Inc.

    New York, NY
    18 hours ago
  • $175k - $225k

     ...from enterprises across different industries. We’re fully in‑person at our NYC HQ near Union Square and are looking for exceptional engineers who are passionate about creating great products. The Role You’ll play a key role in designing and developing the core systems... 

    I did my part and supported the Regular Toilet

    New York, NY
    1 day ago
  • $170k - $220k

    Location New York City Employment Type Full time Location Type On-site Department Engineering & Product Engineering Compensation $170K - $220K • Offers Equity The role As a Site Reliability Engineer at Legora you'll join the founding SRE team at our new NYC engineering... 
    Full time
    Work at office

    Menlo Ventures

    New York, NY
    4 days ago
  • SRE (Site Reliability Engineer) (Intern) Short-term unpaid remote internship for students. Work with a mentor on our SRE team. You'll learn how to keep systems reliable, respond to incidents, and build infrastructure that scales as the platform grows. About GoOffer Go... 
    Temporary work
    Internship
    Remote work

    Go Offer

    New York, NY
    2 days ago
  • $143k - $179k

     ...and applications ensure you can connect with your customers reliably and securely, at every step of their journey. At Sinch we...  ...and global enterprises alike. We're looking for a Senior Site Reliability Engineer to join our SRE team, the group responsible for keeping... 
    Remote work
    Flexible hours

    Sinch

    New York, NY
    2 days ago
  • We are hiring a Senior Site Reliability Engineer to help build and operate the infrastructure foundation that supports engineering teams. The role centers on reliability, scalability, cloud infrastructure, Kubernetes operations, and automation that allows developers to... 

    Rad-Hires

    New York, NY
    2 days ago
  • $170k - $230k

     ...improve the technical foundations of Perchwell while exemplifying engineering rigor and excellence across our engineering culture and...  ...responsible for building the ability to innovate faster in a safe and reliable way. Reliability, resiliency and adaptability are our north... 
    Work experience placement
    Work at office
    Flexible hours
    3 days per week
    1 day per week

    Perchwell

    New York, NY
    1 day ago
  • $180k - $200k

    Parabola is looking for a Senior Site Reliability Engineer to improve performance and reliability of its software systems in New York. This role requires 5+ years of SRE or DevOps experience and expertise in AWS and containerization tools. Offering a salary of $180,000... 
    Work at office
    3 days per week

    Parabola

    New York, NY
    2 days ago
  • $175k - $200k

     ...proudly named a 50 to Watch by Spend Matters and a Best Place to Work by BuiltIn and Inc. Magazine. The Role As a Senior Site Reliability Engineer on the Platform team, you will ensure that software systems are reliable, scalable, performant, and operationally efficient... 
    Part time
    Work at office
    Flexible hours

    Order.co

    New York, NY
    2 days ago
  • ⚡ Senior Site Reliability Engineer (Azure) The Company Storm2's client is a fast-growing software company at the centre of one of the more credible enterprise blockchain ecosystems in market, supporting a proof-of-stake public network governed by major institutions across... 

    Storm2

    New York, NY
    2 days ago
  •  ...Founded by key contributors to Bazel, we build tools that empower engineering teams—from startups to Fortune 500 companies—to enhance...  ...pipelines to monitoring and recovery Manage scalability and reliability for high-throughput, low-latency systems Implement and maintain... 
    Remote work

    EngFlow

    New York, NY
    2 days ago
  • Como SRE você vai: Contribuir e evoluir soluções que agregam à plataforma e área de tecnologia. Difundir, orientar e conscientizar o time de tecnologia sobre serviços e ferramentas internas, influenciando diretamente na DevXperience dos times. Possuir o ownership dos...
    Remote work
    Home office

    idwall

    New York, NY
    2 days ago
  • We are seeking a Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and Kubernetes to ensure the reliability, performance, and scalability of cloud and on-premise systems. This role focuses on building resilient infrastructure, automating... 

    Compunnel, Inc.

    New York, NY
    1 day ago
  • $93.9k - $156.5k

    Hybrid role , 2 days on site. Role is located in NYC with alternative location Chicago...  .... Working hours: 9am‑5pm EST. Site Reliability EngineerII (Tuesday‑Saturday). CME Group...  ...successful candidate will work alongside senior engineers to learn how we observe, monitor,... 
    Local area

    CME Chicago Mercantile Exchange Inc.

    New York, NY
    1 day ago
  • Overview The role of an Intermediate Site Reliability Engineer at Remotive focuses on enhancing system reliability and performance in a dynamic work environment. Applicants should possess excellent technical skills to ensure robust operations and a seamless user experience... 
    Remote work

    DevOpsChat

    New York, NY
    2 days ago
  •  ...Malaysia will be considered. Job Responsibilities System Architecture: Review architecture and software components with software engineers. Ensure best practices are consistent across all teams. Operational Excellence: Own and ensure SLOs and SLAs are met. Monitor operational... 
    Work at office
    Remote work

    CoinGecko

    New York, NY
    2 days ago
  • Position: Senior Site Reliability Engineer + MongoDB Basic Purpose The Platform Database Engineer is responsible for designing, deploying, administering, and optimizing MongoDB (Atlas and on-premise) databases within a large-scale, cloud-based enterprise ecosystem. The... 
    Remote work
    Work from home

    HCLTech

    New York, NY
    2 days ago
  •  ...forward to hearing from passionate, goal-oriented applicants ready to make their mark in the blockchain space. As a Senior Site Reliability Engineer, you'll work at the intersection of cloud infrastructure and blockchain, building the platform that our product teams... 

    SSV Labs

    New York, NY
    2 days ago
  • $170k - $225k

    About The Role Zora is looking for an experienced infrastructure / site reliability software engineer to work closely with the development team to ensure that the infrastructure / site reliability meets the needs of the business and is scalable and highly available, including... 
    Local area
    Remote work
    Home office
    Flexible hours

    Framework Ventures

    New York, NY
    2 days ago
  • $150k - $200k

    Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration platform that helps software teams adopt and improve DORA capabilities (including continuous delivery) in order to achieve... 
    Full time
    Local area
    Remote work
    Work from home

    Gradle Inc.

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineering Sênior. Be the first to apply!