Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Lead Site Reliability Engineer

JPMorgan Chase & Co.

Job Description

Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.

As a Sr Lead Site Reliability Engineer at JPMorgan Chase within the  Consumer & Community Banking Data and Analytics team,  you will help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure and reducing work through automation. You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment you’ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. A

Job responsibilities

  • Creates high quality designs, roadmaps, and program charters that are delivered by you or the engineers under your guidance
  • Provides advice and mentoring to other engineers and acts as a key resource for technologists seeking advice on technical and business-related issues
  • Demonstrates site reliability principles and practices every day and champions the adoption of site reliability throughout your team
  • Collaborates with others to create and implement observability and reliability designs for complex systems that are robust, stable, and do not incur additional toil or technical debt
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Works toward becoming an expert on the applications and platforms in your remit while understanding their interdependencies and limitations
  • Evolves and debug critical components of applications and platforms
  • Uses enterprise-authorized AI capabilities within the work environment to accelerate reliability design and operational decisioning (e.g., incident/post-incident analysis and requirements traceability), validating outputs and handling operational data according to sensitivity and security requirements.

  • Leads reuse-first adoption of AI-assisted reliability workflows across SDLC/toolchain practices (e.g., testing/validation automation and production readiness), ensuring traceability/auditability, resiliency, and security controls.

     

Required qualifications, capabilities, and skills

  • Formal training or certification on site reliability engineering concepts and 5+ years applied experience 
  • Advanced knowledge in site reliability culture and principles with demonstrated ability to implement site reliability within an application or platform.
  • At least 2+ years of hands-on experience in architecting, scaling, and providing SRE support for AI/ML platforms and products, including infrastructure tech stacks such as Databricks, GPU clusters, Model Serving frameworks, Feature Stores, Vector Databases, and LLM inference pipelines.
  • Demonstrated ability to apply core SRE fundamentals — including reliability patterns, capacity planning, incident management, performance tuning, and toil reduction — specifically to AI/ML and data-intensive, compute-heavy workloads.
  • Experience in defining and enforcing SLOs/SLIs tailored to AI/ML workloads (e.g., model latency, throughput, data freshness, inference availability) to drive reliability at scale.
  • Demonstrated experience using enterprise-authorized AI capabilities within the work environment to improve reliability engineering workflows with strong validation habits and awareness of data sensitivity.

  • Ability to set team practices for safe AI usage in operations (e.g., review/approval expectations and escalation paths) while maintaining resiliency, security, and auditability outcomes.

  • Proven hands-on experience in designing and implementing Agentic AI-based solutions to deliver SRE capabilities at scale, including practical expertise with AI Agents, Skills, Context Management, Retrieval-Augmented Generation (RAG), and tool-use patterns.

  • Ability to apply Agentic AI frameworks to automate and augment core SRE functions such as intelligent incident detection and remediation, automated root cause analysis, predictive alerting, self-healing infrastructure, runbook automation, and observability enrichment to reduce toil and accelerate MTTR.
  • Contribute to governance and controls of AI usage with site reliability mindset and principles of CCB systems and platforms.
  • Advanced knowledge and experience in observability such as white and black box monitoring, service level objectives, alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.

     

Preferred qualifications, capabilities, and skills

  • Experience with cloud-based data and analytics architecture, including AWS storage, Snowflake, Kubernetes (EKS), event-driven architectures, streaming services, batch jobs, and ETL pipelines.
  • Proficiency with modern data processing frameworks such as Apache Kafka, Apache Spark, and similar tools, with a focus on ensuring scalability, reliability, and performance of data and analytics platforms.
  • Strong communication skills with ability to mentor and educate others on site reliability principles and practices.
  • Recognized as an active contributor of the engineering community.

About Us

Chase is a leading financial services firm, helping nearly half of America’s households and small businesses achieve their financial goals through a broad range of financial products. Our mission is to create engaged, lifelong relationships and put our customers at the heart of everything we do. We also help small businesses, nonprofits and cities grow, delivering solutions to solve all their financial needs. 

We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions.  We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process. 

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Visit our  FAQs for more information about requesting an accommodation.

Equal Opportunity Employer/Disability/Veterans

About the Team

Our Consumer & Community Banking division serves our Chase customers through a range of financial services, including personal banking, credit cards, mortgages, auto financing, investment advice, small business loans and payment processing. We’re proud to lead the U.S. in credit card sales and deposit growth and have the most-used digital solutions – all while ranking first in customer satisfaction.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Senior Lead Site Reliability Engineer in Jersey City, NJ vacancy
  • $152.6k - $191.5k

     ...partnering with leaders across engineering and technology to define objective reliability goals for services. Key responsibilities...  .... Position Summary: The Senior GCP Site Reliability Engineer acts as an...  ..., and workload onboarding Lead complex platform reliability initiatives... 
    Senior
    Full time
    Work at office
    Shift work
    Day shift

    Bank of America

    Jersey City, NJ
    2 days ago
  •  ...Senior Site Reliability Engineer (SRE) Our client is a global technology consulting and digital solutions company that enables enterprises across industries to reimagine business models, accelerate innovation, and maximize growth by harnessing digital technologies.... 
    Senior
    Local area

    E-Solutions

    New York, NY
    4 days ago
  • $110k - $180k

    A pioneering energy firm in New Jersey is seeking experienced mechanical engineers to lead the design and development of components for stellarator fusion systems. You will manage structural and thermomechanical designs, perform analyses using advanced tools like Ansys... 
    Senior

    Hitachi Ventures GmbH

    Kearny, NJ
    4 days ago
  •  ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient... 
    Senior

    TechChain Talent

    New York, NY
    4 days ago
  • $182.3k - $220k

     ...impact, scale, and the chance to help lead the patient revolution, come...  ...first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team,...  ...building tools that empower our engineers to ship safely and confidently.... 
    Senior
    Local area
    Flexible hours

    Modern Fertility

    New York, NY
    1 day ago
  • $120k - $160k

    Pearson is seeking a strategic Senior Program Manager in Hoboken, NJ to oversee cross-functional programs, managing complex initiatives...  ...outcomes. This role requires 7+ years in program management, leading projects aligning with organizational goals. The successful candidate... 
    Senior

    Pearson

    Hoboken, NJ
    4 days ago
  • $150k - $175k

     ...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed... 
    Senior
    Remote work

    ASAPP

    New York, NY
    4 days ago
  •  ...Overview Would you like to lead modernization initiatives while building a public...  ...public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate...  ...in our SRE team: SRE I → SRE II → Senior → Senior II → Principal → Senior Principal... 
    Senior
    Work at office
    Remote work

    Akamai

    New York, NY
    4 days ago
  • Walmart is seeking a Business Development Manager in Hoboken to drive growth within the Independent Agency ecosystem. You will lead high-level efforts by enhancing relationships and aligning teams across Sales and Product. The role requires deep expertise in business development... 
    Senior

    Walmart

    Hoboken, NJ
    6 days ago
  • $189k - $283.6k

     ...proactively and reactively improve the reliability of Block's platform and critical infrastructure...  ...0) services. In this role, you will lead incident command, coordinate mitigation,...  ...strong desire to perform and grow as an engineer ~5+ years of software development... 
    Senior
    Full time
    Local area
    Remote work
    Relocation package
    Flexible hours
    Shift work

    Block USA

    New York, NY
    2 days ago
  • $127k - $249k

     ...The Team Platform Engineering is the department within SRE that is responsible for a range of critical infrastructure and operational...  ...fleet, alongside the critical components that ensure cluster reliability and security (e.g., CoreDNS, cert-manager, and Gatekeeper).... 
    Senior
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours

    MongoDB

    New York, NY
    5 days ago
  • $150k - $170k

     ...Senior Site Reliability Engineer – Zip Co Join to apply for the Senior Site Reliability Engineer role at Zip Co At Zip, we build cloud‑native software...  ...20 days PTO every year Generous paid parental leave Leading family support policies Company‑sponsored 401k match Learning... 
    Senior
    Casual work
    Work at office
    Remote work
    Flexible hours

    ZIP

    New York, NY
    1 day ago
  • Creative Solutions Services, LLC is seeking a Lead Validation Engineer to oversee GMP equipment installation and qualification projects in Hoboken, New Jersey. You will manage end-to-end CQV activities, ensuring compliance with regulatory and quality requirements. The... 
    Senior

    Creative Solutions Services, LLC

    Hoboken, NJ
    3 days ago
  • $200k - $240k

     ...About us Layer Health was founded in 2023 by leading machine learning researchers from MIT and Harvard Medical School. We are...  ...and medicine. Job Description We’re hiring an experienced Site Reliability Engineer for our Boston or NYC office! You can expect to: Design, build... 
    Senior
    Work at office

    Verana Health

    New York, NY
    1 day ago
  • $65 - $75 per hour

     ...Confluence, and IT Service Management tools. Description: As an Engineer 2, you will collaborate with management, departments, and...  ...event management, and automation across the IT organization. Seniority level Mid-Senior level Employment type Contract Job function Information... 
    Senior
    Contract work
    Remote work

    SBS Creatix

    New York, NY
    4 days ago
  •  ...and Antler, we empower CISOs to proactively manage human risk—the leading cause of cybersecurity breaches—and build safer, more resilient organizations. The Role: As a Senior Site Reliability Engineer (SRE) at Dune Security, you will play a critical role in ensuring... 
    Senior
    Full time
    Work at office

    Dune Security

    New York, NY
    2 days ago
  • $125k - $165k

     ...capacity for consumer ease. For more information, visit or follow us on LinkedIn. About the Role We're looking for a Senior Site Reliability Engineer who genuinely enjoys the craft. Someone who takes pride in a clean Terraform module, cares about observability because... 
    Senior
    Temporary work
    Remote work

    DexCare

    New York, NY
    1 day ago
  • CarePoint Health Management Associates is seeking a Medical Staff Coordinator in Hoboken, NJ. The role involves coordinating medical staff functions, assisting with compliance, and managing credentialing processes. Candidates should have at least five years of experience...
    Senior

    CarePoint Health Management Associates

    Hoboken, NJ
    6 days ago
  •  ...Senior Site Reliability Engineer – Azure Cloud Join to apply for the Senior Site Reliability Engineer role at Concord Technologies Concord Technologies...  ...a team’s efforts, in an enterprise environment. Lead conversations on improving existing processes or products,... 
    Senior
    Full time
    Local area
    Immediate start
    Remote work
    Flexible hours

    Concord Technologies

    New York, NY
    4 days ago
  • $150k - $200k

     ...Join to apply for the Senior Site Reliability Engineer role at Gradle Inc. Develocity is a first‑of‑its‑kind toolchain observability and acceleration...  ...loops. Our software is used by some of the world’s leading software organizations, such as Netflix, Airbnb, SAP, several... 
    Senior
    Full time
    Local area
    Remote work
    Work from home

    Gradle Inc.

    New York, NY
    4 days ago
  • $130k - $165k

     ...Job Title: Senior Software Engineer Company: Snapsheet Job Location: USA, Remote Job Type: Full-time, direct hire Job Department: Technology Team: Site Reliability Engineering About Snapsheet Snapsheet exists to simplify claims. We leverage... 
    Senior
    Full time
    Temporary work
    Local area
    Remote work
    Visa sponsorship
    Work visa
    Flexible hours

    Snapsheet

    New York, NY
    3 days ago
  • $185k - $227k

     ...research, design, and innovation. Backed by leading technology investors, we are committed...  ...and we are hiring the world’s best engineers, scientists, designers, product...  ...details. ROLE AND RESPONSIBILITIES A Senior Site Reliability Engineer (SRE) is expected to own the... 
    Senior
    Remote work

    JUUL Labs

    New York, NY
    4 days ago
  • $157.5k - $254.35k

     ...We are looking for a self‑motivated, driven and creative Senior Site Reliability Engineer to join the Site Reliability team. Metrics and analytics drive...  ...for a Senior Site Reliability Engineer (Senior SRE) to lead reliability initiatives for high‑impact services. In this... 
    Senior
    Contract work
    Work at office
    Local area
    Remote work

    DocuSign

    New York, NY
    5 days ago
  • $175k - $230k

     ...Senior/Staff Site Reliability Engineer New York, New York, United States Sage is on a mission to improve care and quality of life for older adults...  ...residing in senior living facilities. Falls are the leading cause of injury-related death among adults over 65. And... 
    Senior
    Apprenticeship
    Work at office
    Local area
    Remote work
    2 days per week

    SAGE

    New York, NY
    2 days ago
  • $182.3k - $220k

     ...impact, scale, and the chance to help lead the patient revolution, come...  ...first - and that mission depends on reliable, secure, and scalable systems. As a Senior SRE on the infrastructure team,...  ...building tools that empower our engineers to ship safely and confidently.... 
    Senior
    Local area
    Flexible hours

    Ro

    New York, NY
    19 days ago
  • $130k - $180k

    A leading fusion energy company in New Jersey is looking for an Engineering Program Manager to oversee the development of innovative hardware technology. This role requires over 5 years of experience in multidisciplinary engineering, strong communication skills, and the... 
    Senior

    Thea Energy, Inc.

    Kearny, NJ
    5 days ago
  • $175k - $245k

    A leading asset management firm in New York is seeking a Site Reliability Engineer to ensure high availability of technology services. The ideal candidate will have experience with AWS, Docker, and various operating systems. This role includes responsibilities like streamlining... 
    Senior

    Point72 Asset Management, L.P

    New York, NY
    2 days ago
  • A leading technology firm is seeking a Sr. Site Reliability Engineer in the United States. The ideal candidate will enhance system reliability and stability and should possess over 8 years of relevant experience in site reliability engineering. The position covers cloud... 
    Senior

    Jobgether

    New York, NY
    4 days ago
  • Tavily Inc. in New York City is seeking a Senior Site Reliability Engineer to manage Kubernetes clusters and own the full infrastructure. You will improve CI/CD pipelines and ensure systems are reliable and scalable. This role offers the chance to work on real scaling... 
    Senior

    Tavily Inc.

    New York, NY
    2 days ago
  • Alkami Technology, Inc. is seeking a Sr Site Reliability Engineer to manage environments and improve software delivery for banks and credit unions. This remote-first position requires experience with scripting and server administration. You will collaborate with teams to... 
    Senior
    Remote work

    Alkami Technology, Inc.

    New York, NY
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Lead Site Reliability Engineer. Be the first to apply!