Senior Lead Site Reliability Engineer - AI/ML and Data Platforms
JPMorgan Chase & Co.
Job Description
Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.
As a Senior Lead Site Reliability Engineer at JPMorgan Chase within the Chief Data & Analytics Office (CDAO) AI/ML & Data Platforms team, you work with your fellow stakeholders to define non-functional requirements (NFRs) and availability targets for services supporting large-scale data platforms and data lake ecosystems. You will ensure those NFRs are embedded into product design and testing phases, that service level indicators effectively measure customer and data platform performance, and that service level objectives are defined with stakeholders and implemented in production to support secure, scalable, and high-performing analytics and AI/ML workloads.
Job responsibilities
- Creates and delivers high quality designs, roadmaps, and program charters alongside the engineering teams, including data platform and distributed systems initiatives
- Acts as a key resource and mentor for technologists in your area seeking advice on technical and business issues, and serves as a culture carrier and site reliability adoption champion for your team
- Collaborates with others to create and implement observability and reliability designs for complex systems and data platforms which are robust, stable, and do not incur additional toil or technical debt
- Uses enterprise-authorized AI capabilities within the work environment to accelerate reliability design and operational decisioning (e.g., incident/post-incident analysis and requirements traceability), validating outputs and handling operational data according to sensitivity and security requirements
- Drives evolution and debugging of critical components, including platform and data system dependencies, by understanding application and infrastructure interdependencies and limitations
- Provides comprehensive and ongoing guidance, tools, and solutions to support the firm’s growth, including scalable data platform infrastructure and engineering best practices
- Makes significant contributions to JPMorganChase’s site reliability community via internal forums, communities of practice, guilds, and conferences
- Leads reuse-first adoption of AI-assisted reliability workflows across SDLC/toolchain practices (e.g., testing/validation automation and production readiness), ensuring traceability/auditability, resiliency, and security controls across application and data platform environments
Required qualifications, capabilities, and skills
- Formal training or certification on site reliability engineering concepts and 5+ years applied experience
- Brings an advanced understanding of site reliability culture and principles and a track record of demonstrating how to implement site reliability within applications, platforms, or large-scale data systems, including strong understanding of SLI/SLO/SLA and error budgets
- Advanced knowledge and experience in observability such as white and black box monitoring, service level objectives, alerting, and telemetry collection across distributed and data platform environments, including tools such as Grafana, Dynatrace, Prometheus, Datadog, and Splunk
- Demonstrated experience using enterprise-authorized AI capabilities within the work environment to improve reliability engineering workflows with strong validation habits and awareness of data sensitivity
- Ability to set team practices for safe AI usage in operations (e.g., review/approval expectations and escalation paths) while maintaining resiliency, security, and auditability outcomes, ensuring compliance with risk controls and company-wide standards
- Advanced knowledge of software applications and technical processes, including distributed systems, system design, resiliency, testing, operational stability, and disaster recovery, with considerable depth in one or more technical disciplines
- Demonstrated ability to communicate data-based solutions with complex reporting and visualization methods and collaborate effectively across teams to drive incident resolution and improvements
- Recognized as an active contributor of the engineering community
- Strong communication skills and a desire to mentor and educate others on site reliability engineering principles and practices while building strong cross-functional relationships
Preferred qualifications, capabilities, and skills
- Experience with AWS platforms and managed data platforms such as Databricks, including platform administration and engineering support
- Experience in building and managing data pipelines using Spark or similar distributed compute frameworks
- Familiarity with big data ecosystem tools (e.g., Spark, Glue, MapReduce)
- Knowledge of containerization (Docker, Kubernetes) and orchestration frameworks
- Experience with CI/CD pipelines, automation frameworks, and infrastructure as code (e.g., Terraform)
- Proficiency in Python or similar programming languages for automation and platform development
- Familiarity with large-scale distributed systems and data processing environments
About Us
JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans
About the Team
Our professionals in our Corporate Functions cover a diverse range of areas from finance and risk to human resources and marketing. Our corporate teams are an essential part of our company, ensuring that we’re setting our businesses, clients, customers and employees up for success.$250k - $290k
...Execution - Growth Platforms - Neuro-Symbolic AI Business Build - Senior Director The opportunity... ..., and applied AI engineering to deliver scalable... ..., and enterprise data assets to produce... ...Responsibilities Lead the build and scaling... ...inference engines, ML integration, APIs,...PlatformDataSeniorWork experience placementSummer holidayWork at officeFlexible hours- Workiva Inc. is looking for a Senior AI Product Manager to drive the execution... ...of AI capabilities within its platform. This role involves working closely with engineering and data science teams to create... ...experience, a solid understanding of AI/ML, and a track record of...PlatformDataSeniorRemote job
- A leading cloud engineering company seeks a Principal Architect to drive AI solutions and technical leadership. This role... ...high-scale AI/ML systems. The ideal candidate... ..., collaborate with data architects, and contribute... ...engagement with cloud platforms is essential,...PlatformDataSenior
$140k - $160k
...Senior Software Engineer – Applied AI & Generative Systems Pearson... ...AI Engineer to lead the design,... ...intersection of platform engineering, applied... ...into reliable, repeatable systems... ...spanning data ingestion & transformation... ...with ML/LLM Ops practices... ...our career site as a result of...PlatformDataSeniorFull time- ...seeking a Systems Manager for C3.ai & Azure Databricks to lead their enterprise data platform's strategy and operational... ...platform performance ensuring data reliability and security for analytics.... ...candidate will oversee the engineering and architecture of both platforms...PlatformDataSenior
$176.72k - $265.08k
...a highly experienced Senior Solution Architect with... ...years of experience to lead our large-scale... ...and architecting modern data platforms. You will be a proven... ...teams of architects and engineers, providing technical... ...platforms, including AI/ML solutions, defining frameworks...PlatformDataSeniorFull time$171k - $260k
...Senior Lead Software Engineer - AI Platform engineer Jersey City, NJ, United States Job Information... ...transformer architecture, ML training, and inference.... ...(SQL/NoSQL), and data modeling. Familiarity with... ...health care coverage, on‑site health and wellness centers...PlatformDataSeniorFull timeFor contractors$160k - $200k
...Technologies is looking for a Senior DevOps Engineer to manage cloud infrastructure for a leading AI and data analytics platform. This hands-on role... ...scalable systems, ensuring reliability and security in a fast-paced... ...Collaboration with backend and ML teams is essential, as is...PlatformDataSenior$170k - $190k
As a Medrio Senior Site Reliability Engineer, you will be a part of the ITOps group responsible... ...the SDLC for Medrio’s platform. Essential Duties and... ...problem solving Experience with AI/ML tools for automation,... ...detailed information on the data we collect during the application...PlatformDataSeniorRemote jobTemporary workWork from homeFlexible hours$144.9k - $283.5k
...fundamentally reshaping the data storage industry. Here, you lead with innovative... ...come join us. Senior Business Development Leader, AI Native... ...between disruptive AI engineering and global cloud... ...positioning the reliability, power... ...of the Everpure™ Platform as the only sustainable...PlatformDataSeniorFlexible hoursShift work- ...infrastructure company is seeking a Senior Systems Engineer to support the Department of Energy.... ...research institutions in deploying VAST's data platform for critical workloads. Candidates... ...skills and understanding of HPC and AI/ML are crucial. Join us for this pivotal...PlatformDataSenior
- ...is building the reliability and AI operations foundation... ...intelligence platform that runs the... ...re looking for a Senior Site Reliability Engineer who wants to own... ...regional failover Lead incident... ...cost impact as AI/ML workloads scale... ...semiconductor, SaaS, or data‑intensive...PlatformDataSeniorRemote jobFlexible hours
- ...operations, product, engineering, and data to deliver... ...for reliability, control, and... ...Learning Lead at JPMorganChase... ...with senior stakeholders... ...generative AI solutions that... ...delivering AI/ML initiatives... ...products and platforms across multiple... ..., on-site health and wellness...PlatformDataFull time
$160k - $215k
...and imagery data is processed through our AI models to deliver... ...our web platform. To date, our... ...funding from leading investors including... ...our customer sites (i.e. Bay... ...including software engineers) to visit... ...and reliability foundations that... ...style domains. ML Ops skills (model...PlatformDataSeniorFull timeWork experience placementWork at office2 days per week- ...products firm in Idaho is seeking a Senior Manager, CX Analytics to lead CX Data capability. This role requires deep... ...You will define global CX metrics, engineer data models in Snowflake, and... ...experience and strong knowledge of AI/ML techniques. A competitive salary and...DataSenior
$120k - $140k
We’re seeking a senior AI/ML Engineer to lead the design and delivery of machine learning solutions in New York. The ideal candidate will have 9-10 years of experience in AI/ML, with a strong programming background in Python and expertise in deploying solutions on AWS,...PlatformSenior- ...designing and developing AI/ML applications and LLM,... ...: • Experienced AI/ML Engineer with expertise in Machine... ...Generative AI, and Big Data Engineering. • Proven track record in leading large-scale AI/ML initiatives... ...edge AI/ML-powered data platforms. • This is a critical...PlatformDataSenior
$175k - $225k
...Cervin in New York is seeking a Senior Engineer, Data & AI to architect and implement AI-powered features for their platform. The ideal candidate will have over five years of relevant... ...and SQL, and competency in building AI/ML-driven products and data pipelines. This position...PlatformDataSeniorFull timeRemote work- PulseRise Technologies is seeking a Full-Stack Engineer to join their ambitious AI and data analytics team in New York City. In this hybrid role, you will... ...cloud infrastructure. You will collaborate closely with ML engineers and enterprise clients to create impactful, user...PlatformDataSenior
$159k - $278.25k
...play the role of a backend engineer working across the stack (backend services, data pipelines, APIs) and will... ...to contribute to more ML specific tasks. Successful... ...excellence and scale our data platform capabilities. Work with other teams to build AI & data-driven GTM...PlatformDataSenior- Aledade, Inc. is seeking a Senior Software Engineer to enhance our web application and data pipelines. This role involves developing scalable solutions and mentoring... ...junior engineers while implementing cutting-edge AI/ML strategies across healthcare systems. The ideal...PlatformDataSenior
- ...Senior Developer - AI Software Engineer Join a dynamic team as a Senior Developer in the... ...end AI systems, including data processing and model deployment... .... Stay updated on AI/ML research and evaluate new... ...frameworks. Experience with cloud platforms and AI services. Python...PlatformDataSeniorFull time
- A leading data analytics startup is seeking a Senior Software Engineer to design and implement AI/ML platforms that balance speed and cost. This fully remote position allows you to work with a talented team innovating in the field of data engineering. Ideal candidates...PlatformDataSeniorRemote jobHome office
- RentFlow (YC S24) in New York is seeking an AI/ML Lead to take ownership of underwriting, cash-flow intelligence, and data insights. The ideal candidate will have experience in building ML systems, comfort with financial data, and a desire to impact real-world outcomes...DataSenior
- ...professional for the role focused on AI at their New York City office.... ...a cross-functional team of engineers and product managers. The... ...have extensive experience in AI/ML systems and programming skills... ...field and strong cloud platform knowledge are essential for this...PlatformSeniorWork at office
- ...global technology firm seeks a Senior AI Engineer for remote work. This position... ...designing and deploying AI systems, leading model monitoring, and developing data pipelines. Candidates should be... ...SQL, with experience in cloud platforms and ML frameworks. The company values...PlatformDataSeniorRemote job
- Rippling is hiring a backend engineer in New York, New York, to design, develop, and test backend software systems. This role involves building AI & data-driven products and requires 6+ years of experience across various engineering stack levels. Ideal candidates should...PlatformDataSenior
$105.8k - $174.8k
.... Technology – Data and Decision Science – AI Native Engineering Physical AI Engineering... ...Consultant, Senior Consultant... ...experience with popular ML packages such as TensorFlow... ...MLOps methods and platforms such as MLFlow.... ..., to ensure the reliability and robustness of...PlatformDataSeniorFull timeWork experience placementSummer holidayFlexible hours$170k - $200k
...Get AI-powered advice on this job and more exclusive... ...Quantix Search - AI & Data Science Talent Acquisition... ...Type : Full-time A leading life sciences platform is looking for a Senior AI Engineer to help drive the next phase... ...Software Engineer - AI/ML, Multiple Locations United...PlatformDataSeniorFull timeFreelanceRemote work$102.5k - $187.9k
...better working world. AI Finance, Senior, Tech Consulting... ...Finance Applications Data Lead in executing the overall... ..., consistent, and reliable finance application data... ...specific technology platforms will be crucial in... ...including data modeling and ML. What we look...PlatformDataSeniorWork experience placementSummer holidayFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Lead Site Reliability Engineer - AI/ML and Data Platforms. Be the first to apply!
- lead operating engineer Jersey City, NJ
- lead network engineer Jersey City, NJ
- lead infrastructure engineer Jersey City, NJ
- lead engineer Jersey City, NJ
- remote data engineer Jersey City, NJ
- entry level big data engineer Jersey City, NJ
- big data devops engineer Jersey City, NJ
- data engineer Jersey City, NJ
- data engineer contract Jersey City, NJ
- software data engineer Jersey City, NJ

