Site Reliability Engineer II
Mastercard
Site Reliability Engineer II
Who is Mastercard?
At Mastercard technology, we work to connect and power an inclusive, digital economy that benefits everyone, everywhere, by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team – one that makes better decisions, drives innovation, and delivers better business results.
What we create today will define tomorrow. Revolutionary technologies that reshape the digital economy to be more connected and inclusive than ever before. Safer, faster, more sustainable.
And we need the best people to do it. Technologists who are energized by the challenges of a truly global network. With the talent and vision to create the critical systems and products that power global commerce and connect people everywhere to the vital goods and services they need every day.
Working at Mastercard means being part of a unique culture. Inclusive and diverse, a rich collaboration of ideas and perspectives. A place that celebrates your strengths, values your experiences, and offers you the flexibility to shape a career across disciplines and continents. And the opportunity to work alongside experts and leaders at every level of the business, improving what exists, and inventing what’s next. About the Role
The Business Operations team is seeking a highly motivated and experienced Site Reliability Engineer II (SRE) to join our team. You will play a critical role in ensuring the reliability, scalability, and performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation, and the ability to mentor. The role of the Business Operations Site Reliability Engineer is to be the production readiness steward for Mastercard products. As Business Operations SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to running our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principles that include operational design, automation, capacity planning, and monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle. As part of the Business Operations team, you will:
• Work independently on elements of projects/processes within the Site Reliability Engineering area by applying intermediate/practical knowledge and area best practices to meet organizational standards of quality and excellence.
• Support the implementation and maintenance of high-availability systems to ensure operational stability.
• Assist in evaluating operational needs and developing technical solutions under guidance.
• Contribute to automation and scripting projects to streamline routine operational tasks.
• Troubleshoot and resolve basic to moderate system issues, escalating more complex problems as needed.
• Document operational procedures and shares knowledge with team members.
• Participate in quality checks and reviews to ensure system stability and reliability.
• Utilize experience and a comprehensive understanding of area processes and tools to make minor adjustments or enhancements to resolve identifiable issues. May manage smaller project/initiatives as an experienced individual contributor with specialized knowledge within the Site Reliability Engineering area. Role qualifications:
The ideal candidate will apply the following skills independently in routine and moderately complex situations, requiring occasional guidance typically only in unfamiliar or highly complex scenarios. They will demonstrate growing consistency and reliability in applying the skills. • Observability - Ability to use scripting and tooling to implement observability solutions, enabling the collection, analysis, and visualization of metrics, logs, and traces to support incident detection, diagnosis, and continuous service improvement.
• Programming and Scripting - Ability to write and maintain code and scripts to automate tasks, build operational tools, and support monitoring, deployment, and incident response using languages such as Python, Go, Bash, or similar.
• Systems and Network Administration - Ability to configure, operate, and troubleshoot Linux/Unix systems and network components, applying knowledge of networking concepts, protocols, security, and system reliability.
• Cloud Computing and Infrastructure - Ability to design, deploy, and manage applications and infrastructure on cloud platforms (e.g., AWS, Azure, GCP), ensuring scalability, security, availability, and operational efficiency.
• Reliability and Scalability - Ability to design and operate systems for high availability, fault tolerance, and disaster recovery, while ensuring systems can scale to meet current and future demand
• DevOps Practices - Ability to apply DevOps principles and practices, including CI/CD pipelines, containerization, and orchestration, to enable faster, more reliable software delivery and operations.
• Troubleshooting - Capability to systematically identify, diagnose, and resolve technical issues across systems, applications, and networks, using analytical methods and tools to restore functionality, minimize disruption, and ensure stable operations.
• Capacity Planning and Performance Optimization - Ability to monitor resource utilization, forecast future capacity needs, and optimize system performance to support growth, scalability, and efficient infrastructure usage.
• IT Service Management - Ability to apply IT service management principles to incident, problem, and change management, ensuring reliable service delivery, effective incident response, and continuous service improvement aligned to business needs.
• Proactive Monitoring and Improvement (SRE Applications) - The ability to use application reliability signals to anticipate issues, identify risks, and drive preventative improvements that enhance application performance and availability. Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard’s security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
- ...products and services that help people, businesses and governments realize their greatest potential. Title and Summary Senior Site Reliability Engineer Who is Mastercard? At Mastercard technology, we work to connect and power an inclusive, digital economy that...SuggestedRemote jobFull timeWorldwide
- ...and developer satisfaction. As Senior Software Engineer ((Internal Job Title: Software Engineer II), you will contribute to the development of foundational... ...maintain software systems with a focus on quality, reliability, and maintainability Collaborate with designers,...SuggestedFlexible hours
$500 per month
...tooling and automation that makes large-scale database operations reliable and repeatable Work closely with the core database and... ...and turn them into solved problems Support and educate other engineering teams using our internal tools About you ~5+ years of...SuggestedRemote jobLocal areaHome officeFlexible hours- ...The PlayStation brand falls under Sony Interactive Entertainment, a wholly-owned subsidiary of Sony Group Corporation. Site Reliability Engineer As a part of Sony Computer Entertainment, the Future Technology Group (FTG) is leading the cloud gaming revolution, putting...SuggestedRemote jobWork experience placementImmediate start
- ...be working: we’re #1 in our category, profitable, and have hockey-stick growth. With that growth comes the need for a Software Engineer, Platform to join our newly formed Platform team and help us scale our infrastructure, optimize the performance of key product flows...SuggestedRemote job
- FEQ326R17 At Databricks, our core principles are at the heart of everything we do; creating a culture of proactiveness and a customer-centric mindset guides us to create a unified platform that makes data science and analytics accessible to everyone. We aim to inspire...Remote jobWorldwide
- ...Senior Software Engineer, Video - KICK About Kick Kick is one of the fastest-growing live streaming platforms in the world. We are a global rival to established players, known for pushing boundaries and delivering cutting-edge viewer experiences. As we build a world...Full timeWork at officeFlexible hoursDay shift
- ...We are looking for a Software Engineer who has an experience with Programming and AWS cloud services to be part of the Test Engineering team. As a Software Engineer with Programming and AWS knowledge, you will be an enthusiastic self-starter who can think outside the...Remote job
- ...next generation Healthcare Informatics platform. As a Software Engineer, you must possess world class technical skills and a strong... ...best-in-class, cloud-native web and mobile applications that are reliable, scalable and secure. · Produce system design and...Remote job
- ...of modern development to accelerate the pace of Resmed’s next generation Healthcare Informatics platform. As a Senior Software Engineer, you must possess world class technical skills and a strong sense of empathy for your fellow developers. You will work closely...Remote job
- ...join Appian, you'll be part of a passionate team that's dedicated to accomplishing hard things. As part of the Appian Platform Engineering team, you will be responsible for building the cloud infrastructure that powers the Appian platform. This isn’t your average...Full timeWork experience placementInternshipLocal area
$100k
...growing our team and looking for contributors of all seniorities. Tenstorrent is seeking a Robotics/Automotive System Software Engineer to help shape the future of intelligent vehicles and robotics. In this role, you’ll design and optimize system software that sit at...Remote jobPermanent employment- ...people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups...Remote jobWork at office
- ...generation Healthcare Informatics platform. As an Associate Software Engineer, you must be a motivated individual seeking an opportunity to... ...-in-class, cloud-native web and mobile applications that are reliable, scalable and secure Collaborate with other engineers in the...
- ...– Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and... ...the future of our Android team Own, maintain and improve reliability metrics for key features Participate in discussions across...Remote jobWork at officeNight shift
- ...tools, or player-focused features, we craft the systems that drive performance, scalability, and engagement. As a Staff Software Engineer (Internal Job Title: Software Engineer III), you’ll take technical ownership of key areas within our workstreams. You’ll...Remote jobFlexible hours
- ...core of everything we do, and how we build and maintain our user space infrastructure is key to our global success. Our Operations Engineering team designs, automates, and scales the systems that power our workstations, tools, and user environments across our Windows...
- ...strategic and long-term value, and to create innovative and sustainable solutions. We are currently recruiting for multiple Site Engineers to join our team in Canberra to help lead the delivery of a long-term project. In this role you will play a key role in the...For subcontractorWork at office
- Location: Canberra, Australia Requirement: Must have Baseline clearance Job Description: Location: ACT, NSW, QLD, VIC & SA, working from home is supported and will be considered. Role: Join our team as a Salesforce Developer (Mid to Senior Level) and ...Work from home
- ...of the Day and 2025 Inclusivity Design Award ) for its impact and accessibility. We’re a fully remote, distributed team of engineers, designers, researchers, and product builders from world-class companies like Amazon, Microsoft, Google, Stripe, and more. We move...Remote jobFlexible hours
- 1. Responsible for infrastructure exploration at the project site before the shipment of the energy storage project battery compartment, as well as inspection of the lifting and wiring of the battery compartment upon arrival; 2. Responsible for daily system operation and...Full time
- ...raised $282M to date. With a valuation of +$1.5B, the company's technology protects 900 million people in 190 countries. The Engineering (Tech) Team is responsible for all Feedzai product development. Together with Product Management and Data Science, we build the...Remote jobContract workWork at office
- ...which consists of the US Navy, Royal Australian Air Force and Northrop Grumman. Key Responsibilities As a Network Systems Engineer on the MQ-4C Triton, you will be responsible for: Identifying and managing the Triton Network integration risks, through close...Remote jobContract workInterim roleRelocation packageFlexible hours
- ...raised $282M to date. With a valuation of +$1.5B, the company's technology protects 900 million people in 190 countries. The Engineering (Tech) Team is responsible for all Feedzai product development. Together with Product Management and Data Science, we build the...Remote jobContract workWork at office
- ...Responsibilities: Driving the security strategy for the entire engineering organisation towards Zero Trust principles. Take ownership... ...refreshments, all on the house Break up the week with on site remedial massage Wednesdays In house full-time barista’s providing...Full timeWork at officeWork from home
- ...What success looks like in this role: The Field Engineer performs activities associated with installing, diagnosing, maintaining and... ...and Systems at client locations. Provide both remote and on-site Level 2 and Level 3 support to all End Users for Incidents and Service...Remote work
- ...DOF is hiring - 1st/2nd/3rd Engineers (OOW Engineers) DOF is actively hiring experience as a 1st/2nd/3rd Engineer (OOW Engineers) for casual opportunities on various vessels. As a Marine Engineer with DOF, you would be maintaining our world class fleet of vessels....Remote jobTemporary workCasual workWork at office
- ...and governments realize their greatest potential. Title and Summary Senior Bizops Engineer About the Team: The Mastercard RiskPS BizOps team is looking for a Senior Site Reliability Engineer who can help solve problems, implement automation, and leverage best...Full timeWorldwideShift work
- What success looks like in this role: This role oversees, facilitates and administers ITIL based service support and requires a strong understanding of Service Management practices, with a particular emphasis on Service Request Management and Service Level Management...
$90k
...and security systems. Collaborating with architects and other engineers to create integrated building designs. Conducting electrical... ...the installation and commissioning of electrical systems on-site. Providing support during the construction phase and addressing...For contractors
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer II. Be the first to apply!
- on-site clinical research associate (traveling/remote) Australia
- site reliability engineer remote
- site reliability engineer
- lead site reliability engineer
- junior site reliability engineer
- site reliability engineer sre
- site reliability engineering manager
- well site geology
- site acquisition
- site services specialist
