Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

SRE (Site Reliability Engineer) Team Lead

Mattermost

Mattermost is the leading collaborative workflow platform for defense, intelligence, security, and critical infrastructure. Department of War and Fortune 500s, our platform runs on-premises and in private clouds, delivering secure messaging, file sharing, workflow automation, audio/screenshare, and project management—all with full data and operational Mattermost powers high-stakes workflows across mission planning, real-time, real-world operations, DevSecOps, incident response, and cyber defense—enabling secure collaboration from tactical edge and DDIL environments Teams operate across web, desktop, and mobile, with embedded interoperability for Microsoft Teams, Outlook, and Microsoft 365.

Mattermost is seeking an experienced and visionary Lead Site Reliability collaboration platform.

driving strategic initiatives for scalability, observability, performance, and automation across cloud and hybrid environments. and operations teams to ensure our customers in defense, government, and performance.

Define the strategy, architecture, and roadmap for Mattermost’s site reliability engineering function, aligning infrastructure initiatives with Lead the design, deployment, and optimization of production-grade containerized workloads, infrastructure-as-code, and compliant cloud Establish and evolve observability, monitoring, and alerting frameworks to ensure performance, reliability, and capacity planning at scale. * analysis, and systemic reliability improvements. * Partner with security and compliance teams to meet data sovereignty, security, and regulatory requirements. * risk, and scale operations. * Oversee cloud cost management and capacity planning to optimize infrastructure spending while meeting performance targets. * Build and maintain a developer platform that enables fast, secure software delivery and improves application stability in production. * BS in Computer Science, Cybersecurity, Software Engineering, or a related technical field, or equivalent experience, with 5+ years of relevant experience in site reliability engineering, DevOps, or cloud infrastructure Proven expertise in container orchestration platforms, ideally Kubernetes. * Strong background in cloud platforms, ideally AWS. * Demonstrated experience designing and implementing monitoring, alerting, and performance optimization strategies. * Proficiency in at least one scripting or programming language for Experience leading globally distributed teams in a remote-first environment. * S. government security clearance in the future. S. citizens and eligible under applicable Applicants must meet eligibility requirements for access to export-controlled Experience designing high-availability, disaster recovery, and scaling Exposure to GCP and Azure cloud environments. * Experience preparing, delivering, and maintaining software offerings through AWS Marketplace and other cloud provider marketplaces (e.g., Marketplace, Google Cloud Marketplace), including packaging, compliance Open-source contributions in reliability, DevOps, or infrastructure tooling. * Certifications in cloud infrastructure, reliability, or DevOps engineering (CKA, CKAD, AWS Certified Solutions Architect).

Mattermost is an EEO Employer, we are a remote-first, open-source company.

ensuring compliance with local laws and regulations, which takes time.

national origin, age, disability, pregnancy status, veteran status, or other personal characteristics.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the SRE (Site Reliability Engineer) Team Lead in New York, NY vacancy
  •  ...States | Posted on 11/13/2025 Title: Senior Site Reliability Engineer (SRE) Location: Remote AboutJanuary AtJanuary,...  ...pipeline reliability — enabling the engineering team to ship confidently and efficiently. KeyResponsibilities Lead incident response and develop sustainable... 
    Suggested
    Remote work

    Govserviceshub

    New York, NY
    3 days ago
  •  ...hatch I.T. is partnering with CardioOne to find a Site Reliability Engineer (SRE) to join their team. See deteails below: About the Role: CardioOne is seeking...  ...( Datadog ). Participate in on-call rotations and lead incident response efforts. Perform root-cause analysis... 
    Suggested
    Full time

    Hatchit Co

    New York, NY
    1 day ago
  •  ...Why join us: Mission-driven teams: Work alongside industry...  ...looking for a experienced SRE to take ownership of reliability across our multi-region, cloud...  ...AWS, Pulumi, Kubernetes). Lead initiatives to improve...  ...harden the platform. Mentor engineers and set best practices for... 
    Suggested
    Remote work

    Oscilar

    New York, NY
    1 day ago
  •  ...We are seeking a Site Reliability Engineer (SRE) with deep expertise in AWS cloud infrastructure , Infrastructure as Code (IaC) , and large-scale...  ...scalable, secure, and highly available infrastructure in AWS Lead large‑scale AWS deployments across multi‑account, multi‑... 
    Suggested
    Remote work
    Shift work

    GR8 People

    New York, NY
    1 day ago
  • $60 - $65 per hour

     ...SRE Engineer (W2) Jersey City, NJ (Onsite) 6 Months Contract to Hire Job Description: Proficient in application development skills for more than one technology as well as multiple design techniques. Working proficiency in development toolset to design, develop, test,... 
    Suggested
    Full time
    Contract work
    Work experience placement

    Pinnacle Group

    Jersey City, NJ
    1 day ago
  •  ...Itlearn360 is seeking an experienced SRE Engineer to manage and automate infrastructure on AWS. The role requires over 8 years of experience as an SRE or DevOps Engineer, with strong skills in Terraform, Go-Lang, and container orchestration technologies like Docker and... 
    Remote work

    Itlearn360

    New York, NY
    1 day ago
  •  ...Versana is seeking a motivated SRE/DevOps Engineer with strong observability...  ...indicators. • Improve system reliability and resiliency. • Conduct...  ...failures. • Assist teams in implementing observability...  ...5+ years of experience as a Site Reliability Engineer or similar... 
    Work experience placement
    Local area

    Versana

    New York, NY
    17 hours ago
  •  ...DroneUp, LLC is hiring an SRE - Platform Engineer in the United States, focusing on the reliability and performance of their IT infrastructure while mentoring teams. Responsibilities include managing SLOs and incident response while working with cloud technologies such... 

    DroneUp

    New York, NY
    1 day ago
  •  ...of View! Come fly with us as our team goes through our checklists that...  ...the role DroneUp is seeking an SRE - Platform Engineer who will focus on ensuring the reliability, scalability, and performance of...  ...Indicators (SLIs), and error budgets Lead incident response, including on-... 
    Contract work
    Remote work

    DroneUp

    New York, NY
    1 day ago
  •  ...As a contributor in the SRE organization, you are passionate...  ...the high quality and reliability our customers demand. You...  ...will reach the entire engineering organization to enable product teams to continuously deliver features...  ...and tools. Apply site reliability engineering principles... 
    Remote work

    BOSTON TRUST WALDEN COMPANY

    New York, NY
    1 day ago
  •  ...An established industry player is seeking a talented SRE Engineer to join their innovative team. This role focuses on leveraging infrastructure automation tools and cloud services to enhance system reliability and performance. You will work closely with cross-functional... 

    TechDigital Group

    New York, NY
    1 day ago
  •  ...The Voleon Group is seeking a Site Reliability Engineer (SRE) to enhance production operations alongside...  ...diagnosing bugs, automating workflows, and leading deployments. The ideal candidate will...  ...analytical skills. Our supportive team promotes a growth mindset and diverse... 

    The Voleon Group

    New York, NY
    1 day ago
  •  ...connect with 28,396 DevOps professionals. Responsibilities The Site Reliability Engineer (SRE) role involves ensuring the reliability, availability, and...  .... Successful candidates will collaborate with development teams to design, build, and maintain the systems that support... 
    Remote work

    DevOpsChat

    New York, NY
    1 day ago
  • $160k - $300k

     ...Overview Site Reliability Engineer (SRE) – Remote, Full‑time. Base pay $160K–$300K/year. Responsibilities...  ...reliability, scalability, and performance. Lead incident response, including triage,...  ...engineering. Partner with engineering teams to embed reliability best practices... 
    Full time
    Remote work

    Crossing Hurdles

    New York, NY
    1 day ago
  •  ...Site Reliability Engineer 2 DevOps | REMOTE (US Citizenship required) The job opening for a Site Reliability Engineer (SRE) at Jobicy emphasizes the importance of enhancing system performance...  ...SRE will collaborate with various teams to streamline application deployments... 
    Remote work

    DevOpsChat

    New York, NY
    1 day ago
  • $150k - $200k

     ...are looking for a seasoned Senior Site Reliability Engineer to join our dynamic team in a foundational role, owning reliability...  ...and infrastructure as our first SRE. This role will involve ensuring...  ...of our production systems, leading infrastructure initiatives, and mentoring... 
    Work experience placement
    Remote work

    Barti

    New York, NY
    1 day ago
  •  ...this challenge without our incredible team. We have been recognized as one of...  ...fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements...  ...Incident Response: Participate in and help lead an on-call rotation, serving as an... 
    Remote work
    Flexible hours
    Night shift

    CertifID LLC

    New York, NY
    5 hours ago
  • $182.3k - $220k

     ...mission depends on reliable, secure, and scalable...  ...systems. As a Senior SRE on the infrastructure team, you’ll sit at the...  ...tools that empower our engineers to ship safely and...  ...harness learnings, leading efforts to minimize...  ....e., during team on-sites).   At Ro, we believe... 
    Local area
    Flexible hours

    Ro

    New York, NY
    17 hours ago
  • $150k - $200k

     ...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc....  ...platform that helps software teams adopt and improve DORA capabilities...  ...used by some of the world’s leading software organizations, such...  ...You Are We’re building a new SRE team and looking for... 
    Full time
    Local area
    Remote work
    Work from home

    Gradle Inc.

    New York, NY
    1 day ago
  •  ...Position Summary With team members and customers in 39 countries...  ...global organization. As the Site Reliability Engineer, you will help ensure the...  ...during production incidents, leading incident coordination,...  ...documentation, and promote SRE best practices across engineering... 
    Remote work
    Worldwide
    Flexible hours

    HostPapa

    New York, NY
    1 day ago
  • $115k - $125k

     ...Piper Companies is seeking a Site Reliability Engineer to join a Cloud Services team in a Remote (U.S.-based) role. This is a highly technical, client-facing...  ...Infrastructure-as-Code and automation tools. Partner with leading Cloud Service Providers (CSPs) to support digital... 
    Remote work

    Piper Companies

    New York, NY
    1 day ago
  •  ...The Team This is a fully remote role, we will consider...  ...based in LATAM. Our Engineering team is having a blast...  ...to define and lead the industry. As part...  ...infrastructure. As a Site Reliability Engineer, you will keep...  ...should HODL 5+ years in SRE, DevOps, or backend infrastructure... 
    Local area
    Remote work

    Framework Ventures

    New York, NY
    1 day ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational functions that support...  ...the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper... 
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    1 day ago
  • $165k - $235k

     ...system? Since 2014, the mission-driven team at the Stellar Development Foundation (...  ...ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation...  ...cloud-based systems operations, as a SRE or DevOps engineer. First-hand experience... 
    Temporary work
    Work at office
    Worldwide
    Flexible hours

    Crypto Pro Network

    New York, NY
    1 day ago
  •  ...About the Role SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our engineering teams. You'll ensure the reliability and...  ...in cloud-based systems operations, as a SRE or DevOps engineer. First-hand experience... 

    TechChain Talent

    New York, NY
    17 hours ago
  •  ...To learn more, visit Role Summary As an Intermediate Site Reliability Engineer, you will support the reliability, performance, and...  ...services and database platforms. You will work with senior team members to implement SRE best practices, improve automation, enhance... 
    Remote work
    Worldwide
    Home office

    Cority Inc

    New York, NY
    1 day ago
  •  ...A leading open-source software firm is seeking a Senior Site Reliability / Gitops Engineer to enhance automation practices and manage cloud infrastructure. You will collaborate...  ...home-based work environment with opportunities for global team collaboration. #J-18808-Ljbffr... 
    Remote work
    Work from home
    Worldwide
    Flexible hours

    Canonical Group Ltd

    New York, NY
    1 day ago
  • $165k - $225k

     .... In a single environment, teams design and operate analytics...  ...governance. The world's leading companies rely on Dataiku...  ...true business performance engine delivering measurable value...  ...make an impact: As a Site Reliability Engineer (SRE) with advanced expertise in... 
    Work at office
    Flexible hours

    Dataiku

    New York, NY
    5 hours ago
  •  ...Canonical is a leading provider of open source software...  ..., data science, AI, engineering innovation, and IoT....  ...few office‑based roles. Teams meet two to four times...  .... We are hiring a Site Reliability / Gitops Engineer to our...  ...million Ubuntu users. As an SRE & Gitops engineer you’... 
    Work at office
    Remote work
    Work from home
    Flexible hours

    Canonical Group Ltd

    New York, NY
    1 day ago
  • $140k - $205k

     ...Senior Technology Site Reliability Engineer Cooley is seeking a Senior Site Reliability Engineer to...  ...Technology Site Reliability Engineer("SRE") is responsible for ensuring the reliability...  ...and the ability to work as a team towards complex and layered objectives.... 
    Full time
    Temporary work
    Work at office
    Flexible hours
    Weekend work

    Cooley

    New York, NY
    5 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to SRE (Site Reliability Engineer) Team Lead. Be the first to apply!