Lead Site Reliability Engineer
Mastercard
Lead Site Reliability Engineer
Overview:
The role of Business Operations Organization is to be the production readiness steward for Mastercard products. As a Business Operations we are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principals that includes operational design, automation, capacity planning, monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture.
We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. A biz ops focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders.
• Lead and own the full lifecycle of services—from architecture and design through deployment, operations, and continuous optimization—ensuring scalability, reliability, and alignment with business objectives.
• Analyze platform-level ITSM performance and proactively establish feedback loops with engineering teams, influencing roadmap prioritization to address systemic gaps and improve resiliency.
• Define and drive production readiness standards, including operational design reviews, capacity planning, and launch governance, ensuring services meet reliability and scalability benchmarks before go-live.
• Define and evolve monitoring frameworks for availability, latency, and system health, leveraging metrics and telemetry to proactively prevent incidents and improve service performance.
• Champion automation-first principles to scale systems efficiently, reducing manual toil while improving deployment velocity and overall system reliability.
• Lead the design and governance of CI/CD pipelines, implementing robust validation, operational gates, and best practices to drive consistency, quality, and speed across environments.
• Drive best-in-class incident response practices, including rapid mitigation, stakeholder communication, and blameless postmortems, ensuring continuous improvement and resilience.
• Take a holistic, system-wide approach during critical incidents, connect
• Collaborate effectively across distributed, global teams, ensuring alignment, continuity, and high performance across time zones and technology hubs.
• Act as a technical leader and mentor, developing junior engineers, promoting best practices, and raising the overall bar for engineering excellence within the organization. All about you
• Bachelor’s degree in computer science, Engineering, or a related technical field (e.g., Physics, Mathematics), or equivalent practical experience.
• 8–15 years of relevant experience in Site Reliability Engineering, Infrastructure, or DevOps roles, with a combination of hands-on technical expertise and early leadership responsibilities.
• Strong technical foundation across enterprise platforms, Linux/UNIX systems, operating systems, and database environments (Oracle/SQL, DBA), with the ability to provide technical guidance and support to the team.
• Experience with observability and monitoring tools (e.g., Splunk, Dynatrace), driving improved system visibility, performance, and reliability.
• Solid experience in DevOps and CI/CD practices, with the ability to support and guide automation, deployment pipelines, and operational improvements.
• Proficiency in one or more programming or scripting languages such as Python, Java, Go, C/C++, Perl, or Ruby, with practical application in automation or system
• Strong foundation in Security and/or Enterprise Monitoring environments, with exposure to coding and system-level design.
• Experience designing, analyzing, and troubleshooting large-scale distributed systems, with a strong focus on reliability, scalability, and performance optimization.
• Strong program management capabilities, with a track record of successfully leading large-scale, cross-functional initiatives from concept through execution.
• Extensive experience working across development, operations, and product teams to prioritize initiatives, build strong partnerships, and deliver end-to-end solutions.
• Practical knowledge of cloud platforms, preferably AWS, with familiarity in cloud-native architectures and operational best practices.
• Ability to critically assess existing processes and challenge the status quo, identifying opportunities to improve efficiency, scalability, and overall business impact. We are seeking site reliability engineers with an appetite for change and who can push the boundaries of what can be completed through automation, while managing service levels for some of Mastercard’s most critical security services. Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard’s security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
- ...realize their greatest potential. Title and Summary Senior Site Reliability Engineer The Xborder team is looking for a Senior Site... ...everything you can? Overview Business Operations is leading the Site Reliability Engineering (SRE) transformation at Mastercard...SuggestedFull timeWorldwide
- ...thinking organization, apply now. We are currently seeking a Site Reliability Engineer to join our team in Guadalajara, Jalisco (MX-JAL), Mexico (... ...through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched...SuggestedWork at officeRemote workMonday to FridayFlexible hoursRotating shiftDay shift
- ...greatest potential. Title and Summary Business Operations Site Reliability Engineer Overview: The role of Business Operations... ...operational design, automation, capacity planning, monitoring that leads to fault-tolerant, scalable products. We see the big picture...SuggestedFull timeWorldwideShift work
- ...potential. Title and Summary Director, Infrastructure & Site Reliability Engineering Who is Mastercard? Mastercard is a global technology... ...seeking a Director of Site Reliability Engineering (SRE) to lead strategic initiatives that ensure the reliability, scalability...SuggestedFull timeWorldwide
- ...governments realize their greatest potential. Title and Summary Site Reliability Engineering Manager The Xborder team is looking for a Site... ...to automate everything you can? Business Operations is leading the Site Reliability Engineering (SRE) transformation at Mastercard...SuggestedFull timeWorldwideShift work
- ...Home based role ICON plc is a world-leading healthcare intelligence and clinical research organization. We’re proud to foster an inclusive... ...future of clinical development. We are currently seeking a Site Contracts Lead to join our diverse and dynamic team. As a Site...Remote jobContract workWork from homeFlexible hours
$6,000 per month
...As a Mobile Systems Developer , you are the engine under the hood of our app experience. While our UX... ...hardware integrations that make our travel companion reliable in the real world. You will partner with our Tech Lead to build a robust, offline-capable mobile...Contract workLocal area- ...We are currently seeking a Cloud DBA Lead to join our team in City of Mexico, Guanajuato... ...00) - REQUIRED Google Cloud Associate Engineer - REQUIRED Azure Database... ...hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective...Work at officeRemote workFlexible hoursShift work
- ...organization, apply now. We are currently seeking a SAP BASIS Team Lead to join our team in Mexican Republic, México (MX-MEX), Mexico (... ...possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored...Work experience placementWork at officeImmediate startRemote workFlexible hours
- ..., Mexico, Ciudad de Mexico, Ciudad de Mexico Senior Software Engineer - Full Stack Do you love building and pioneering in the technology... ..., educational tools or other information available through this site. Capital One Financial is made up of several different...InternshipLocal area
- ...their greatest potential. Title and Summary Director, Platform Engineering Mastercard powers economies and empowers people in 200+... ...enabling self-service infrastructure delivery, and improving reliability, consistency, and velocity across cloud and on prem platforms...Full timeWorldwide
- ...and back-end team Provide the technical guidance to team and lead on issue resolution. Qualifications: ~7+ years of experience... ...possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored...Work at officeRemote workFlexible hours
- ...Latino (97001), Mexico, Ciudad de Mexico, Ciudad de Mexico Lead Software Engineer - Full Stack Do you love building and pioneering in the... ...tools or other information available through this site. Capital One Financial is made up of several different...Full timeInternshipLocal area
- ...connectivity. We are one of the leading providers of digital and AI... ...NTT DATA offices or client sites. This ensures we can provide... ...looking for a Senior DevOps Engineer with strong experience in infrastructure... ..., security, and application reliability. The candidate should be...Work at officeRemote workFlexible hours
- ...now. We are currently seeking a System Engineering - Azure to join our team in Guadalajara,... ...trusts, forest, domain tree structures, sites, DNS, GPOs, OU, FRS, DFSR. Good... ...responsible innovation. We are one of the world's leading AI and digital infrastructure providers,...Work at officeRemote workWork from homeHome officeFlexible hoursNight shiftWeekend work
- ..., Mexico (MX). # L3 Production Support Engineer : Job Description Mandatory Qualifications... ...and communication skills, being able to lead in a global environment ~... ...hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective...Work at officeRemote workFlexible hours
- ...-CMX), Mexico (MX). Technical Support Engineer – Azure Databricks Job Summary We are... ...innovation. We are one of the world's leading AI and digital infrastructure providers,... ...hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective...Work at officeRemote workFlexible hours
- ...society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched... ...Whenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored...Work experience placementWork at officeRemote workFlexible hoursShift work
- ...seeking a Full Stack Software Engineer (IoT) to join our team in... ..., and maintain high-quality, reliable, and scalable code following... ...innovation. We are one of the world's leading AI and digital infrastructure... ...NTT DATA offices or client sites. This ensures we can provide...Work at officeRemote workFlexible hours
- ...self‑driven Full Stack Support Engineer (L4) who thrives in fast‑... ...stability, performance, and reliability of modern, scalable applications... .... We are one of the world's leading AI and digital infrastructure... ...to NTT DATA offices or client sites. This ensures we can provide...Work at officeRemote workFlexible hours
- ...part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Sr. Salesforce Technical Lead to join our team in Mexico. Responsibilities : Design, Coding/Programming , develop, test and deploy custom applications...
- ...operational efficiency and reduce data obscurity 2. Experience in Leading and drives solution discussions both with business and other IT... ...possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored...Work experience placementWork at officeRemote workFlexible hours
- ...currently seeking a L3 Support Engineer (Python & MongoDB) to join... ...operations teams to ensure system reliability and performance. Key... ...innovation. We are one of the world's leading AI and digital infrastructure... ...NTT DATA offices or client sites. This ensures we can provide...Work at officeRemote workFlexible hours
- ...A/B testing results Ensure compliance with accessibility standards (WCAG, ADA) Collaborate closely with product managers, engineers, and stakeholders Present design concepts and rationale through clear storytelling and documentation Core Competencies #...
$2,500 per month
...test, and continuously tune live offer funnels and e-commerce sites that take real orders from real customers every hour of the... ...job demands. ● A degree (or equivalent) in computer science, engineering, or a related technical field. ● A genuine foundation in programming...Permanent employmentFull timeWorldwideTrial period- ...Ciudad de Mexico Senior Manager, Software Engineering (People Leader) Do you love building... ...Capital One. What You’ll Do: Lead a portfolio of diverse technology projects... ...other information available through this site. Capital One Financial is made up of...InternshipLocal area
$28.5k
...Establish strong collaboration with business counterparts, vendors, and Global Process Owners to align with global strategies. Lead global and regional project initiatives and support internal and external audits. Oversee accurate financial reporting and documentation...- ...de Mexico Senior Director, Software Engineering Capital One is seeking an experienced... ...help us build and grow our Technology Site in Mexico City. Based in Mexico City, the... ...Engineering will be responsible for building and leading a team of software engineers who focus...Local areaShift work
- ...Mexico, Ciudad de Mexico Director, Software Engineering Capital One is seeking a Director of Software Engineering to lead, manage, mentor, and build extremely... ...practices ~7+ years of experience with Site Reliability Engineering (SRE) At Capital One, we...Local area
- ...Mexico (MX). 2. L3 Production Support Engineer: Job Description Mandatory Qualifications... ...and communication skills, being able to lead in a global environment · Understanding... ...locally to NTT DATA offices or client sites. This ensures we can provide timely and effective...Work at officeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead Site Reliability Engineer. Be the first to apply!
