Lead, SRE Engineer
Full-time
Mastercard
Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary
Mastercard powers economies and empowers people across more than 200 countries and territories worldwide.
We are committed to building an inclusive, digital economy that benefits everyone, everywhere—by making transactions safe, simple, smart, and accessible. Through secure data, trusted networks, strong partnerships, and relentless innovation, we help individuals, financial institutions, governments, and businesses unlock their greatest potential. About the Role:
Mastercard’s Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers. We achieve this by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards, ensuring compliance with rigorous security requirements.
Within Mastercard, SRE focuses on the reliability and performance of core infrastructure, networks, and foundational services that power our applications. Our mission is to ensure these components operate with excellence, enabling applications to deliver an outstanding customer experience.
In this role, you will join our Payments Network SRE team and take ownership of continuously assessing and elevating the end to end service quality of our platform. You will leverage data to drive root cause analysis and deliver strategic insights to key stakeholders on resource utilization, capacity forecasting, and performance trends—ensuring the availability, scalability, and resilience of our network. Key Responsibilities: Lead continuous assessments of the application infrastructure supporting critical Mastercard applications, focusing on health, performance, monitoring and alerting, and capacity analysis. Collaborate with Product and Development teams to forecast growth requirements and ensure scalability and resiliency. Champion observability as a core principle for infrastructure services by assessing environments and technologies to uncover gaps in monitoring and alerting. Design and implement strategies to close these gaps, ensuring all infrastructure telemetry is integrated into a unified, single-pane-of-glass view. Build custom dashboards to investigate and perform root cause analysis on complex issues. Lead regular incident reviews with internal support teams to ensure root causes are identified. When patterns of failure or compatibility issues between software and infrastructure emerge, develop and implement strategies to remediate or mitigate risks. Leverage automation and AI technologies to enhance proactive issue detection, enable self-healing capabilities, reducing Mean Time to Detect (MTTD) and Mean Time to Mitigate (MTTM). Develop testing and validation plans for new environment builds, disaster recovery exercises and post-maintenance activities to certify environment readiness before customer traffic is routed to it. Champion continuous learning, development, and knowledge sharing across networking and other infrastructure disciplines to strengthen multi-disciplinary SRE team capabilities. Lead training initiatives for team members and Product and Development on networking aspects of the platforms. Evaluate vendor hardware, firmware, and software upgrade roadmaps, and conduct proof-of-concept (POC) testing to identify potential risks and opportunities for improvement in upcoming releases. All about you: • 5–10 years of experience in an SRE or SRE related operations role, including 3+ years supporting e commerce, financial services, or large scale SaaS platforms.
• Excellent infrastructure troubleshooting and analytical problem solving skills.
• Strong hands on experience with observability and monitoring tools such as Splunk, Dynatrace, or equivalent, with a proven ability to triage and investigate complex issues.
• Familiarity with network telemetry tools such as SolarWinds and NetScout.
• Proficiency in packet level debugging, including capturing traffic with tools like tcpdump and analyzing packets using Wireshark.
• Broad understanding of end to end infrastructure supporting payment platforms—spanning platform services, networking, databases, and storage.
• Experience with automation and Infrastructure as Code tools such as Chef, Ansible, and Terraform, as well as structured data formats (JSON/YAML).
• Excellent communication skills with the ability to coordinate cross functional troubleshooting efforts and lead RCA processes to closure.
• Demonstrated ability to troubleshoot complex production issues, perform root cause analysis, and drive long term corrective actions.
• Experience partnering with development teams to shape architecture, define SLIs/SLOs, and embed reliability into services from design through operation.
• Strong understanding of monitoring and observability ecosystems, including Prometheus, Grafana, ELK/EFK, Splunk, and OpenTelemetry.
• Effective incident management skills with a structured, analytical approach to problem solving. The Payments Network SRE team is responsible for the runtime availability of some of Mastercard’s most critical core payment systems, which support national infrastructure and operate 24/7 year‑round. As a result, this role will include periodic on‑call responsibilities when required. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must: • Abide by Mastercard’s security policies and practices;
• Ensure the confidentiality and integrity of the information being accessed;
• Report any suspected information security violation or breach, and
• Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines. Corporate Security Responsibility
Lead, SRE Engineer
Lead SRE Engineer, Site Reliability Engineering
Our Purpose:Mastercard powers economies and empowers people across more than 200 countries and territories worldwide.
We are committed to building an inclusive, digital economy that benefits everyone, everywhere—by making transactions safe, simple, smart, and accessible. Through secure data, trusted networks, strong partnerships, and relentless innovation, we help individuals, financial institutions, governments, and businesses unlock their greatest potential. About the Role:
Mastercard’s Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers. We achieve this by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards, ensuring compliance with rigorous security requirements.
Within Mastercard, SRE focuses on the reliability and performance of core infrastructure, networks, and foundational services that power our applications. Our mission is to ensure these components operate with excellence, enabling applications to deliver an outstanding customer experience.
In this role, you will join our Payments Network SRE team and take ownership of continuously assessing and elevating the end to end service quality of our platform. You will leverage data to drive root cause analysis and deliver strategic insights to key stakeholders on resource utilization, capacity forecasting, and performance trends—ensuring the availability, scalability, and resilience of our network. Key Responsibilities: Lead continuous assessments of the application infrastructure supporting critical Mastercard applications, focusing on health, performance, monitoring and alerting, and capacity analysis. Collaborate with Product and Development teams to forecast growth requirements and ensure scalability and resiliency. Champion observability as a core principle for infrastructure services by assessing environments and technologies to uncover gaps in monitoring and alerting. Design and implement strategies to close these gaps, ensuring all infrastructure telemetry is integrated into a unified, single-pane-of-glass view. Build custom dashboards to investigate and perform root cause analysis on complex issues. Lead regular incident reviews with internal support teams to ensure root causes are identified. When patterns of failure or compatibility issues between software and infrastructure emerge, develop and implement strategies to remediate or mitigate risks. Leverage automation and AI technologies to enhance proactive issue detection, enable self-healing capabilities, reducing Mean Time to Detect (MTTD) and Mean Time to Mitigate (MTTM). Develop testing and validation plans for new environment builds, disaster recovery exercises and post-maintenance activities to certify environment readiness before customer traffic is routed to it. Champion continuous learning, development, and knowledge sharing across networking and other infrastructure disciplines to strengthen multi-disciplinary SRE team capabilities. Lead training initiatives for team members and Product and Development on networking aspects of the platforms. Evaluate vendor hardware, firmware, and software upgrade roadmaps, and conduct proof-of-concept (POC) testing to identify potential risks and opportunities for improvement in upcoming releases. All about you: • 5–10 years of experience in an SRE or SRE related operations role, including 3+ years supporting e commerce, financial services, or large scale SaaS platforms.
• Excellent infrastructure troubleshooting and analytical problem solving skills.
• Strong hands on experience with observability and monitoring tools such as Splunk, Dynatrace, or equivalent, with a proven ability to triage and investigate complex issues.
• Familiarity with network telemetry tools such as SolarWinds and NetScout.
• Proficiency in packet level debugging, including capturing traffic with tools like tcpdump and analyzing packets using Wireshark.
• Broad understanding of end to end infrastructure supporting payment platforms—spanning platform services, networking, databases, and storage.
• Experience with automation and Infrastructure as Code tools such as Chef, Ansible, and Terraform, as well as structured data formats (JSON/YAML).
• Excellent communication skills with the ability to coordinate cross functional troubleshooting efforts and lead RCA processes to closure.
• Demonstrated ability to troubleshoot complex production issues, perform root cause analysis, and drive long term corrective actions.
• Experience partnering with development teams to shape architecture, define SLIs/SLOs, and embed reliability into services from design through operation.
• Strong understanding of monitoring and observability ecosystems, including Prometheus, Grafana, ELK/EFK, Splunk, and OpenTelemetry.
• Effective incident management skills with a structured, analytical approach to problem solving. The Payments Network SRE team is responsible for the runtime availability of some of Mastercard’s most critical core payment systems, which support national infrastructure and operate 24/7 year‑round. As a result, this role will include periodic on‑call responsibilities when required. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must: • Abide by Mastercard’s security policies and practices;
• Ensure the confidentiality and integrity of the information being accessed;
• Report any suspected information security violation or breach, and
• Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines. Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard’s security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Vacancy posted 6 days ago
Similar jobs that could be interesting for youBased on the Lead, SRE Engineer in Dublin, CA vacancy
- ...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Lead SRE Network Engineer Lead Network Engineer, Site Reliability Engineering Our Purpose: Mastercard powers economies and empowers people...SuggestedFull timeWorldwide
- ...A leading IT services company in Pleasanton is seeking a Senior Site Reliability Engineer / DevOps Engineer to manage AWS infrastructure and automation. The ideal candidate will have extensive experience in AWS cloud environments, infrastructure as code, and strong Linux...SuggestedFull time
$157.03k - $212k
...Overview: We are hiring a hands-on front-end technical lead for our high-traffic, multi-region e-commerce experience (digital... ...cards). This is a builder role first. You will write code, set engineering standards, and raise the bar for front-end quality and delivery...SuggestedFull timeWork experience placementWork at officeLocal areaRemote workFlexible hoursShift work- ...Lead Software Engineer Position: Lead Software Engineer – Bachelor’s degree in Computer Science or Computer Information Systems with 5+ years of experience. Key responsibilities: Develop applications using cloud technologies, including SAP Cloud, Amazon Web Services...SuggestedRemote workRelocation
$140k - $150k
...A leading global digital commerce firm in California seeks an experienced Android Developer to serve as the primary contact for client requirements. You will lead all phases of Android application development, collaborate closely with Product stakeholders, and provide...Suggested$140k - $150k
Nisum is a leading global digital commerce firm headquartered in California, with services spanning digital strategy and transformation, insights and analytics, blockchain, business agility, and custom software development. Founded in 2000 with the customer-centric motto...Work experience placement- Android Lead Developer Location: Pleasanton CA or Plano TX - Onsite Contract Job Description Design and Develop Android Applications Lead the development of high‑quality Android applications by writing clean, efficient, and maintainable code. Follow best practices for...Contract workLocal area
$123.75k - $218.4k
...Lead Sales Engineer Industrial Eaton's North American Sales Division is currently seeking a Lead Sales Engineer Industrial, to join its West Region team. This position can be based in one of the following locations: Pleasanton, CA; Tukwila, WA; or Boise, ID. Relocation...Full timeContract workH1bLocal areaVisa sponsorshipRelocation package- ...We are looking for a UI Engineer to design and develop high-performance, scalable, and user-friendly web applications. The ideal candidate should have strong expertise in React, Redux, JavaScript (ES6), Node.js (ExpressJS). You will play a key role in building responsive...
- Analyze LTE & VoLTE call flows, signaling procedures, & call processing; Configure & validate mobile software & telecommunication network features across GSM, WCDMA, LTE, WiFi, & Bluetooth technologies; Perform advanced tracing, debugging, & log analysis to identify & resolve...Relocation
- ...company based in Pleasanton is seeking a Physical Design (Layout) Lead to oversee ASIC physical design activities. The ideal candidate... ...include leading design activities and collaborating with RTL engineers on methodology improvements. This role offers the opportunity...
- ...services Develop web services automation framework Write SQL queries to set up required data for web services Test backend systems Lead Salesforce CPQ implementation projects, driving configuration, validation & optimization of Sales Cloud & CPQ modules to streamline...Relocation
$123.75k - $218.4k
Eaton is seeking a Lead Sales Engineer - Industrial, situated in Pleasanton, CA, to own key customer relationships and deliver technical solutions in the electrical sector. The successful candidate will combine sales strategy with technical presentations and collaborate...Relocation package- A leading technology firm in Pleasanton, CA is seeking an experienced electrical engineer to develop electronic solutions for quantum instruments. The role requires ownership of PCB design from concept through implementation, collaboration with multidisciplinary teams,...
- ...Lead Sales Engineer Eaton's North American Sales Division is seeking a Lead Sales Engineer to join our team. This role will be based in either Aurora, CO; Chandler, AZ; Pleasanton, CA; Reno, NV; Roseville, CA; Spokane, WA; Tukwila, WA; or Wilsonville, OR with a focus...H1bLocal areaRelocationVisa sponsorship
- ...reliabilityanalysis WorkwithRTLengineersandtoolvendor(s)forongoingtool/methodology improvement Requirements PhysicalDesign(Layout)Lead Qualification: MSEEwith10+yearsofexperienceinimplementationofASIC/SoCinlatest processnodes ExperienceinLogicsynthesis,DFTinsertion...
$123.75k - $218.4k
Eaton Corporation is seeking a Lead Sales Engineer - Industrial in Pleasanton, California. This role focuses on owning customer relationships while delivering technical and commercial solutions across the industrial market. Responsibilities include building relationships...Full time$175k - $220k
...Lead Software Engineer - Retail Systems San Ramon, California, United States Mindful movement. It's at the core of why we do what we do at ALO—it's our calling. Because mindful movement in the studio leads to better living. It changes who yogis are off the mat,...- ...Lead Software Engineer Confidential Client - Placed by VoltForce Role Overview A Lead Software Engineer at this company is a technical and cultural leader with deep expertise across multiple systems. You will make strategic decisions about major components...
- ...A leading IT staffing provider in Pleasanton is seeking a candidate with strong AngularJS experience for a position requiring excellent communication skills. The ideal applicant will articulate the usage of HTML 5 effectively and demonstrate a sound understanding of code...
$55k - $125k
...committed to making a positive impact on its customers, employees, and communities. The Role Do you dream about leading an engineering team with the talent, passion, and financial backing to build killer, industry-changing cloud-based applications and...Work at officeLocal areaRemote workWork from homeFlexible hours3 days per week- ...clean shipping affordable, starting with today’s fleet. We retrofit existing engines with a hydrogen-diesel dual-fuel “plugin” that reduces fuel consumption and emissions. We’re seeking a Lead Process Engineer to own the end-to-end ammonia/ Methanol cracking process,...Start working today
- ...procedures and managing complex system performance improvements. The ideal candidate will have a Bachelor's degree in Electrical Engineering and at least 9 years of experience. This hybrid position involves working from a remote office and the San Ramon office about two...Work at officeRemote work2 days per week
$140k - $180k
Veev, located in Hayward, CA, is seeking an MEP Engineer to oversee the integration of electrical, mechanical, and plumbing systems in modular homes. This role is vital for ensuring designs are manufacturable and code-compliant. Candidates should hold a Bachelor’s in Electrical...- Lawrence Livermore National Laboratory is seeking a Lead Surveillance Engineer responsible for leading technical surveillance activities in support of nuclear weapon systems. The role requires an active Department of Energy (DOE) Q-level clearance and a Bachelor's degree...
$140k - $180k
Veev Group is looking for an MEP Engineer in Hayward, CA to oversee the integration of electrical, mechanical, and plumbing systems in modular homes. This role requires collaboration with various teams to ensure designs are manufacturable and code-compliant. Candidates...- A software development company in California is seeking a Senior .NET Developer to provide technical leadership and hands-on development for enterprise applications. The role includes mentoring team members and designing scalable solutions. Ideal candidates will have experience...
- ...Java/J2EE, Spring frameworks, and MVC architecture. Candidates should have a solid understanding of OOAD principles and the ability to lead technology projects. Responsibilities include developing responsive web applications and managing quality through unit testing. The...
- A leading IT consulting firm is seeking a Technology Risk Management Specialist to develop and manage a comprehensive technology risk management strategy and framework. The role requires collaboration with various teams and delivering enterprise-level risk reporting. The...
- Salesforce Apex Senior Technical Lead with Lightning Framework Job Summary The Senior Technical Lead will be responsible for overseeing Salesforce administration, Salesforce Apex development, and Salesforce Lightning. They will play a key role in managing technical aspects...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Lead, SRE Engineer. Be the first to apply!


