Senior Site Reliability Engineer
$121.4k - $218.6kAkamai
Do you enjoy collaborating with teams to solve complex challenges?
Do you enjoy solving large scale distributed content delivery challenges?
Join our critical AI Hardware SRE Team!
The AI Hardware SRE team is responsible for overseeing, scaling, and optimizing our next-generation dedicated AI hardware infrastructure. You will be responsible for ensuring best-in-class uptime and reliability of our AI hardware infrastructure offerings.
Partner with the best
In this role, you'll play a part in pioneering the reliability an elite, high-density hardware and software infrastructure spanning the globe. You'll collaborate with product teams from the earliest stages of development to ensure the reliability, scalability, and performance of our systems. You'll define key performance indicators and defend them when they are breached.
As a Senior Site Reliability Engineer, you will be responsible for:
Developing and scaling robust programmatic tooling and infrastructure-as-code utilities in Python to eliminate operational toil and automate fleet-wide provisioning.
Integrating automated workflows across disconnected corporate ticketing systems to optimize time-to-mitigate metrics for hardware and network break-fix events.
Leveraging advanced AI utilities and LLM-assisted development paradigms where appropriate to accelerate technical execution, script authorship, and system analysis
Working on cutting-edge private cloud and compute technologies to improve the availability, latency, and overall systemic health of high-density hardware environments.
Designing and implementing telemetry pipelines, custom Prometheus/Grafana monitoring dashboards, and AI-based anomaly detection tailored for bare-metal and virtualized environments.
Participating in 24x7x365 on-call rotations, spearheading real-time incident management, and managing high-severity service disruption protocols via automated PagerDuty and Slack workflows.
Partnering directly with third-party infrastructure vendors and coordinating on-site field technicians to facilitate uptime activities.
Do what you love
To be successful in this role you will:
Have 5 years of relevant experience and a Bachelor's degree in Computer Engineering, Computer Science or equivalent
Possess tooling and coding ability in languages like Python to construct scalable operational tools, API integrations, and automation frameworks.
Show hands-on experience with modern observability stacks and timeseries engines, like Prometheus, Grafana, OpenTelemetry, and Loki.
Possess a working understanding of advanced networking topologies, high-bandwidth routing/switching infrastructure, BGP, and dual-stack IPv4/IPv6 networks.
Have experience acting as a key designer for new service rollouts, including establishing operational readiness criteria, telemetry baselines, and alerting thresholds.
Demonstrate extensive experience building technical runbooks, leading complex incident response bridges, and driving comprehensive, blameless post-mortems.
Display a proven ability to take absolute ownership of ambiguous technical problems, coordinate cross-functional teams, and drive for production-grade solutions.
About us
At Akamai, we make life better for billions of people, trillions of times a day.
Whether you're streaming live events, scrolling social media, watching your favorite series, or managing your savings, we're the engine behind the scenes. We provide the world's most distributed platform from Cloud to Edge to help the giants of the digital world work faster and stay more secure, making the internet a better experience for everyone.
Our focus is simple:
Cloud and Edge: Running apps closer to users for instant performance.
Security : Neutralizing threats before they ever reach your data.
Content Delivery : Scaling the world's biggest moments without a glitch.
AI : Enabling our customers to build, secure, and scale AI apps on the world's most distributed cloud platform.
At Akamai, we don't just support the internet; we power and protect it, because behind every great digital experience is a massive hidden challenge. And we're the ones who solve it. When millions of people hit play or pay, Akamai ensures it just works.
Benefits at Akamai: We support your health, well-being, finances, and life beyond work. See our benefits. (
FlexBase adapts to your job's needs
Akamai's FlexBase program is yet another way we show our commitment to providing employees with an exceptional workplace experience. It's not about telling employees where to work; it's about supporting employees to do their best work.
We trust our incredible employees to work in ways that suit them best: at home, in an office, or a combination of both.
Connect with us on social and see what life at Akamai is like!
Compensation
Akamai is committed to fair and equitable compensation practices. For US based candidates only - the base salary for this position ranges from $121,400 - $218,600/year; a candidate's salary is determined by various factors including, but not limited to, relevant work experience, skills, certifications and location. Compensation for candidates outside the US will vary. The compensation package may also include incentive compensation opportunities in the form of annual bonus or incentives, equity awards and an Employee Stock Purchase Plan (ESPP). Akamai provides industry-leading benefits including healthcare, 401K savings plan, company holidays, vacation (in the form of PTO), sick time, family friendly benefits including parental leave and an employee assistance program including a focus on mental and financial wellness; Eligibility requirements apply.
Equal Employment Opportunity Rights
Akamai Technologies is an Affirmative Action, Equal Opportunity Employer that values the strength that diversity brings to the workplace. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of gender, gender identity, sexual orientation, race/ethnicity, protected veteran status, disability, or other protected group status.
$152k - $195k
...Senior Site Reliability Engineer Austin, TX (Hybrid) SecurityScorecard is the global leader in cybersecurity ratings, with over 12 million companies continuously rated, operating in 64 countries. Founded in 2013 by security and risk experts Dr. Alex Yampolskiy and...Senior- ...Senior Site Reliability Engineer Austin, Texas, United States Who We Are At 2K, we create some of the most iconic and culture-shaping video games in entertainment, including NBA® 2K, one of the top-selling franchises in the world, and legendary titles like BioShock...Senior
- ...across the globe, and we're honored to support first responders. And this is where you come in. We're seeking a Senior Site Reliability Engineer who can own our data tier at high availability while also pulling weight across the broader platform. As Zello scales,...SeniorPermanent employmentLocal areaFlexible hours
- ...commercialization, and mass production to change the world for the better. JOB SUMMARY We are seeking an experienced Site Reliability Engineer to own and maintain the deployment of our cloud-based infrastructure to customer sites. In this role, you will work...SeniorFull timeLocal area
$110.7k - $171.8k
...components Participation in on-call rotation as a platform reliability escalation point Incident response, post-incident reviews,... ..., and internal control requirements. Collaborate with engineering teams across the organization to influence platform adoption,...SeniorWork experience placementWork at officeLocal area$111.6k - $186k
...Company Cox Automotive - USA Job Family Group Engineering / Product Development Job Profile Sr Software... ...may include an incentive program. Job Description Senior Site Reliability Engineer Department: Engineering / Platform...SeniorRemote workRelocationFlexible hoursShift work- ...of in-office collaboration and fully intend for the selected candidate for this role to work on site in the specified location(s). As a Senior Site Reliability Engineer within the CET SAvE organization, you will play a critical leadership role advancing the reliability...SeniorFull timeWork at office
$79.1k - $158.2k
...service according to terms for reliability and functionality. ~... ...~Gains basic knowledge of site reliability trends and shares... ...identify and escalate issues to senior team members. Collects and reviews... ...a skilled Site Reliability Engineer to design, build, operate, and...SeniorTemporary workImmediate startFlexible hoursShift work- ...exceptional interactions, smarter decision-making, and accelerated growth in the AI-driven world. We’re looking for a Senior Site Reliability Engineer to help build and scale a high-impact SRE function. You’ll be a technical leader on a team responsible for improving...Senior
- .... Our infra has to match. The role We’re looking for a Senior SRE to own the reliability, scalability, and operational posture of Satsuma’s multi... ...AI‑assisted development workflows Partner closely with engineering on reliability reviews and architecture decisions 5‑8 years...Senior
- ...infrastructure and/or service according to terms for reliability and functionality. Assists team members... ...deployments. Gains basic knowledge of site reliability trends and shares relevant... ...to identify and elevate issues to senior team members. Collects and reviews basic...SeniorImmediate startShift work
- Senior Site Reliability Engineer - Trustwise (Austin) About Trustwise: At Trustwise, we are deeply committed to building an AI Trust layer that helps companies unlock Generative AI’s full potential. Our software helps enterprises deploy AI systems that are safe, aligned...SeniorRemote work
- About the Role We are looking for a Senior SRE to serve as the operations owner for the... ...developer tooling ecosystem that shapes how engineers work day to day, including Python and .... ...them. What You’ll Work On Operations & Reliability: Serve as a primary escalation point for...Senior
- Zowta, LLC is seeking a Senior Site Reliability Engineer in Austin, TX. This full-time, hybrid role involves maintaining and improving cloud systems using AWS and Infrastructure as Code. Candidates should have over 5 years of relevant experience, strong skills in DevOps...SeniorFull time
- ...importance of in‑office collaboration and fully intend for the selected candidate for this role to work on site in the specified location. As a Senior Reliability Engineer, you’ll play a pivotal role in shaping the reliability and scalability of our mission‑critical...SeniorWork at office
- About the Role We are looking for a Senior SRE to join our Platform Engineering team as the operations owner of our observability platforms. You’ll be responsible for the reliability, scalability, and continued evolution of the tools that give our engineering organization...Senior
$152k - $241.5k
Senior Site Reliability Engineer - HPC page is loaded## Senior Site Reliability Engineer - HPClocations: US, CA, Santa Clara: US, TX, Austin: US, NC, Durhamtime type: Full timeposted on: Posted Todayjob requisition id: JR2013271NVIDIA has been transforming computer graphics...Senior- What You’ll Do Evangelize the Site Reliability Engineering (SRE) mindset and solve problems through systematization Identify opportunities to build innovative tools and address unique operational challenges for large-scale, enterprise applications Create scripts to...Senior
$127k - $249k
...remote from Eastern or Central time zones. The role supports the Atlas platform as part of the Senior SRE Atlas team. Role Overview Seeking a senior Site Reliability Engineer to design, build, and operate complex systems that support the Atlas platform. The role emphasizes...SeniorLocal areaRemote workFlexible hours$127k - $249k
Senior / Staff Engineer - SRE, InfraSec We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team to guide the security of our cloud‑based infrastructure. You will be highly hands‑on technically while also mentoring a small team of SREs. The...SeniorLocal areaRemote work- Charles Schwab Corporation is seeking a Senior Site Reliability Engineer to lead efforts in enhancing the reliability, scalability, and performance of its mobile and digital platforms. This role entails collaborating across teams to embed best practices into the software...SeniorWork at office
$109.5k - $150.55k
...strive for the best, own our actions, and grow and evolve. Job Description Renaissance is looking for an experienced Sr Site Reliability Engineer to be part of the Engineering Enablement group's Site Reliability Team with a focus on Application and Infrastructure...SeniorFor contractorsLocal areaRemote workWorldwideWork visaFlexible hoursWeekend work$110.7k - $171.8k
...Product Reliability Engineer About Us Visa is a world leader in payments technology, facilitating transactions between consumers, merchants... ...duties and developing systems and software that help increase site reliability and performance. Site reliability engineering (SRE...SeniorPermanent employmentWork experience placementWork at officeLocal areaImmediate startFlexible hoursWeekend work- The Consulting Solutions is seeking an experienced Senior / Staff Engineer for our SRE, InfraSec team in Seattle. The role involves leading the security of cloud-based infrastructure, mentoring a team of SREs, and collaborating with other engineering teams to ensure high...SeniorRemote job
- Sr. Software Engineer - Site Reliability About ShipperHQ: ShipperHQ is a trusted leader in the e-commerce shipping space, with over 15 years of... ...e-commerce logistics. Position Overview: We’re seeking a Senior Site Reliability Engineer to join our fast-paced Engineering...SeniorFull timeWork at office
- Sr Site Reliability Engineer, Customer Systems Austin, Texas, United States Software and Services Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn’t have...Senior
- Senior Site Reliability Engineer This role contributes to the reliability, observability, and operational excellence of our platform infrastructure serving millions of users. As a Senior SRE, you will be a strong technical contributor who implements best practices, solves...SeniorWork at officeLocal areaImmediate startFlexible hours
- Upstart is seeking a Senior Software Engineer focused on Site Reliability Tooling. This role involves enhancing the reliability and observability of our production systems while working closely with other engineers at Upstart. Qualifications include a minimum of 6 years...SeniorRemote job
- Programmers.io is looking for mid-senior level candidates for a full-time position in Austin, Texas. The ideal candidate will have strong experience in Linux administration and Site Reliability Engineering (SRE), along with proficiency in GitHub, Kubernetes, CI/CD methodologies...SeniorFull time
- ...Site Reliability Engineer Location: Austin, Texas Schedule: Full-Time Pay Range: Competitive pay, based on experience and qualifications. Make a Difference as a Site Reliability Engineer in Austin! Are you a problem-solving Site Reliability Engineer...Full timeFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!
- site reliability engineer remote Austin, TX
- site reliability engineer sre Austin, TX
- site reliability engineer Austin, TX
- senior cloud service delivery manager Austin, TX
- senior business analyst contract Austin, TX
- senior product design engineer Austin, TX
- senior game producer Austin, TX
- senior software manager Austin, TX
- senior manager business analytics Austin, TX
- senior marketing account manager Austin, TX

