Director, Infrastructure & Site Reliability Engineering
Full-time
Mastercard
Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary
This role is ideal for a seasoned leader who combines deep technical expertise with a passion for operational excellence, automation, and cross-functional collaboration. Key Responsibilities • Define and execute the strategic roadmap for Site Reliability Engineering across distributed platforms.
• Lead modernization efforts including hardware lifecycle management, virtualization upgrades, and infrastructure optimization.
• Champion a culture of automation, resilience, and continuous improvement.
• Build, mentor, and scale a high-impact SRE organization with a focus on technical excellence and career development.
• Establish clear objectives, performance metrics, and development plans for team members.
• Promote knowledge sharing and operational maturity through documentation and onboarding programs.
• Oversee the health and performance of VMware clusters, ESXi hosts, and Oracle Linux environments.
• Ensure robust disaster recovery and high availability strategies are in place and tested.
• Drive incident management and root cause analysis for critical infrastructure issues.
• Lead the adoption of Infrastructure-as-Code and automation frameworks using tools like Chef, Ansible, PowerCLI, Python, and Jenkins.
• Reduce operational toil through scalable automation and self-healing systems.
• Align engineering practices with DevOps principles and agile methodologies.
• Architect observability solutions using Prometheus, Grafana, Splunk, and Dynatrace.
• Define and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
• Optimize alerting and telemetry to support proactive incident response.
• Ensure infrastructure compliance with security baselines, OS configurations, and regulatory standards.
• Collaborate with InfoSec and audit teams to maintain a secure and compliant environment.
• Partner with application, network, and storage teams to align infrastructure capabilities with business needs.
• Communicate technical strategies, upgrade plans, and operational impacts to executive stakeholders.
• Influence enterprise architecture and platform engineering decisions. All about you • 10+ years in Infrastructure, SRE, or Platform Engineering, with 5+ years in leadership roles
• Strong expertise in VMware (ESXi, clusters) and Linux (preferably Oracle Linux)
• Proven experience driving large-scale infrastructure modernization and automation initiatives
• Hands-on experience with IaC and automation (e.g., Ansible, Chef, Python, Jenkins)
• Solid understanding of SRE practices (SLOs, SLIs, error budgets, incident management)
• Experience with observability tools (e.g., Prometheus, Grafana, Splunk, Dynatrace)
• Strong knowledge of high availability, disaster recovery, and enterprise-scale infrastructure
• Excellent leadership, stakeholder management, and executive communication skills Corporate Security Responsibility
Director, Infrastructure & Site Reliability Engineering
Who is Mastercard?
Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. With connections across more than 210 countries and territories, we are building a sustainable world that unlocks priceless possibilities for all. Overview Are you a visionary leader who thrives on driving transformation in complex infrastructure environments? Do you excel at building high-performing teams, fostering innovation, and aligning technology with business outcomes? The Distributed Platform Operations team is seeking a Director of Site Reliability Engineering (SRE) to lead strategic initiatives that ensure the reliability, scalability, and performance of our VMware and Oracle Linux platforms.This role is ideal for a seasoned leader who combines deep technical expertise with a passion for operational excellence, automation, and cross-functional collaboration. Key Responsibilities • Define and execute the strategic roadmap for Site Reliability Engineering across distributed platforms.
• Lead modernization efforts including hardware lifecycle management, virtualization upgrades, and infrastructure optimization.
• Champion a culture of automation, resilience, and continuous improvement.
• Build, mentor, and scale a high-impact SRE organization with a focus on technical excellence and career development.
• Establish clear objectives, performance metrics, and development plans for team members.
• Promote knowledge sharing and operational maturity through documentation and onboarding programs.
• Oversee the health and performance of VMware clusters, ESXi hosts, and Oracle Linux environments.
• Ensure robust disaster recovery and high availability strategies are in place and tested.
• Drive incident management and root cause analysis for critical infrastructure issues.
• Lead the adoption of Infrastructure-as-Code and automation frameworks using tools like Chef, Ansible, PowerCLI, Python, and Jenkins.
• Reduce operational toil through scalable automation and self-healing systems.
• Align engineering practices with DevOps principles and agile methodologies.
• Architect observability solutions using Prometheus, Grafana, Splunk, and Dynatrace.
• Define and enforce Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
• Optimize alerting and telemetry to support proactive incident response.
• Ensure infrastructure compliance with security baselines, OS configurations, and regulatory standards.
• Collaborate with InfoSec and audit teams to maintain a secure and compliant environment.
• Partner with application, network, and storage teams to align infrastructure capabilities with business needs.
• Communicate technical strategies, upgrade plans, and operational impacts to executive stakeholders.
• Influence enterprise architecture and platform engineering decisions. All about you • 10+ years in Infrastructure, SRE, or Platform Engineering, with 5+ years in leadership roles
• Strong expertise in VMware (ESXi, clusters) and Linux (preferably Oracle Linux)
• Proven experience driving large-scale infrastructure modernization and automation initiatives
• Hands-on experience with IaC and automation (e.g., Ansible, Chef, Python, Jenkins)
• Solid understanding of SRE practices (SLOs, SLIs, error budgets, incident management)
• Experience with observability tools (e.g., Prometheus, Grafana, Splunk, Dynatrace)
• Strong knowledge of high availability, disaster recovery, and enterprise-scale infrastructure
• Excellent leadership, stakeholder management, and executive communication skills Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard’s security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Vacancy posted 24 days ago
Similar jobs that could be interesting for youBased on the Director, Infrastructure & Site Reliability Engineering in Mexico vacancy
- ...realize their greatest potential. Title and Summary Senior Site Reliability Engineer The Xborder team is looking for a Senior Site... ...• Strong knowledge of operating systems, platforms, and infrastructure components. • Knowledge of Artificial Intelligence Use...SuggestedFull timeWorldwide
- ...thinking organization, apply now. We are currently seeking a Site Reliability Engineer to join our team in Guadalajara, Jalisco (MX-JAL), Mexico... .... We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-...SuggestedWork at officeRemote workMonday to FridayFlexible hoursRotating shiftDay shift
- ...realize their greatest potential. Title and Summary Lead Site Reliability Engineer Overview: The role of Business Operations... ...of relevant experience in Site Reliability Engineering, Infrastructure, or DevOps roles, with a combination of hands-on technical...SuggestedFull timeWorldwideShift work
- ...greatest potential. Title and Summary Business Operations Site Reliability Engineer Overview: The role of Business Operations... ...• Strong knowledge of operating systems, platforms, and infrastructure components. • Experience working through others to solve...SuggestedFull timeWorldwideShift work
- ...governments realize their greatest potential. Title and Summary Site Reliability Engineering Manager The Xborder team is looking for a Site... ...of relevant experience in Site Reliability Engineering, Infrastructure, or DevOps roles, with a combination of hands-on...SuggestedFull timeWorldwideShift work
- ...Availability Groups, and ensuring system reliability and performance in a hybrid cloud... ...of the world's leading AI and digital infrastructure providers, with unmatched capabilities... ...locally to NTT DATA offices or client sites. This ensures we can provide timely and...Work at officeRemote workFlexible hours
- ...governments realize their greatest potential. Title and Summary Director, Software Engineering Overview The CNPF Data & AI organization is looking... ...translate emerging AI and agentic concepts into secure, reliable, observable, and production-grade systems, while also...Full timeWorldwide
- ...Mexico, Ciudad de Mexico, Ciudad de Mexico Sr. Manager Software Engineering (Individual Contributor) Do you love building and... ...educational tools or other information available through this site. Capital One Financial is made up of several different entities...Local area
- ...01), Mexico, Ciudad de Mexico, Ciudad de Mexico Senior Director, Software Engineering Capital One is seeking an experienced software engineering... ...Engineering, to help us build and grow our Technology Site in Mexico City. Based in Mexico City, the Senior Director...Local areaShift work
- ...Ciudad de Mexico, Ciudad de Mexico Director, Software Engineering Capital One is seeking a... ...responsible for managing the cloud infrastructure of our critical platform and the underlying... ...~7+ years of experience with Site Reliability Engineering (SRE) At Capital...Local area
$6,000 per month
...Mobile Systems Developer , you are the engine under the hood of our app experience.... ...integrations that make our travel companion reliable in the real world. You will partner... ...tracking, and state persistence. Infrastructure Optimization: Architect the data flow...Contract workLocal area- ...Ciudad de Mexico, Ciudad de Mexico Senior Manager, Software Engineering (People Leader) Do you love building and pioneering in the... ...educational tools or other information available through this site. Capital One Financial is made up of several different entities...InternshipLocal area
- ...greatest potential. Title and Summary Director, Platform Engineering Mastercard powers economies and... ...in building and operating infrastructure and applications globally. The Director... ...infrastructure delivery, and improving reliability, consistency, and velocity across...Full timeWorldwide
- ..., Mexico, Ciudad de Mexico, Ciudad de Mexico Senior Software Engineer - Full Stack Do you love building and pioneering in the technology... ..., educational tools or other information available through this site. Capital One Financial is made up of several different...InternshipLocal area
- ...realize their greatest potential. Title and Summary Software Engineer II Overview The CNPF Data & AI organization is looking for... ...platform innovation—turning emerging technologies into secure, reliable, and reusable capabilities that create measurable business...Full timeWorldwide
- ...unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Director, Customer Technical Services (Contact Center Lead) Overview: Mastercard Cross-Border Services helps banks, digital providers,...Full timeWork at officeLocal areaWorldwide
- ...Coordinate activities across development, QA, infrastructure, business, and vendor teams. Track... ...with architects, developers, QA engineers, and Product Owners to ensure delivery... ...locally to NTT DATA offices or client sites. This ensures we can provide timely and...Full timeWork at officeRemote workFlexible hours
- ...management of applications, infrastructure and connectivity. We are one... ...NTT DATA offices or client sites. This ensures we can provide... ...looking for a Senior DevOps Engineer with strong experience in infrastructure... ..., security, and application reliability. The candidate should be...Work at officeRemote workFlexible hours
- ...realize their greatest potential. Title and Summary Lead Software Engineer Overview The CNPF Data & AI organization is looking for a... ...You will lead the design and delivery of secure, scalable, and reliable agentic applications that can reason, orchestrate tools,...Full timeTemporary workWorldwide
- ...innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI,... ...Whenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support...Work at officeRemote workFlexible hours
- ...innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI,... ...Whenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support...Work at officeRemote workFlexible hours
- ...the Job Our client is seeking a Digital Marketing & Growth Director to own the full consultant lifecycle, from acquisition to retention... ...digital innovation. You will lead multiple teams and growth engines, ensuring a fully integrated approach between digital and traditional...Permanent employment
- .... We are currently seeking a System Engineering - Azure to join our team in Guadalajara... ...trusts, forest, domain tree structures, sites, DNS, GPOs, OU, FRS, DFSR. Good... ...of the world's leading AI and digital infrastructure providers, with unmatched capabilities...Work at officeRemote workWork from homeHome officeFlexible hoursNight shiftWeekend work
- ...SAP solutions and offerings 22. Showcasing expertise in cloud infrastructure architecture, including compute, storage, networking, and... ...Whenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored...Work experience placementWork at officeRemote workFlexible hours
- ...currently seeking a L3 Support Engineer (Python & MongoDB) to join... ...teams to ensure system reliability and performance. Key Responsibilities... ...'s leading AI and digital infrastructure providers, with unmatched... ...NTT DATA offices or client sites. This ensures we can provide...Work at officeRemote workFlexible hours
- ...Mexico (MX). # L3 Production Support Engineer : Job Description Mandatory Qualifications... ...of the world's leading AI and digital infrastructure providers, with unmatched capabilities... ...locally to NTT DATA offices or client sites. This ensures we can provide timely and...Work at officeRemote workFlexible hours
- ...CMX), Mexico (MX). Technical Support Engineer – Azure Databricks Job Summary We... ...one of the world's leading AI and digital infrastructure providers, with unmatched capabilities... ...hire locally to NTT DATA offices or client sites. This ensures we can provide timely and...Work at officeRemote workFlexible hours
- Job Title: COBOL Unisys Developer Job Summary We are looking for an experienced COBOL Unisys Developer with 5+ years of experience to support and enhance legacy systems. The ideal candidate will be responsible for maintaining applications, troubleshooting issues,...
- NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Sr. Salesforce Technical Lead to join ...
- ...‑driven Full Stack Support Engineer (L4) who thrives in fast‑paced... ...stability, performance, and reliability of modern, scalable... ...world's leading AI and digital infrastructure providers, with unmatched capabilities... ...NTT DATA offices or client sites. This ensures we can provide...Work at officeRemote workFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Director, Infrastructure & Site Reliability Engineering. Be the first to apply!
Related searches
