Senior SRE - AI-Driven Reliability for SaaS Platforms
CERTIFID
Cybercrime is rising, reaching record highs in 2024. According to the FBI's IC3 report , total losses exceeded $16 billion. With investment fraud and BEC scams at the forefront, the message is clear: the real estate sector remains a lucrative target for cybercriminals. At CertifID, we take this threat seriously and provide a secure platform that verifies the identities of parties involved in transactions, authenticates wire transfer instructions, and detects potential fraud attempts. Our technology is designed to mitigate risks and ensure that every transaction is conducted with confidence and peace of mind. We know we couldn’t take on this challenge without our incredible team. We have been recognized as one of the Best Startups to Work for in Austin, made the Inc. 5000 list , and won Best Culture by Purpose Jobs three years in a row. We are guided by our core values and our vision of a world without wire fraud. We offer a dynamic work environment where you can contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud. We are seeking a Senior Site Reliability Engineer (Senior SRE) to drive reliability improvements across our production SaaS environment. You’ll play a critical role in building scalable infrastructure patterns, advancing observability, improving incident response, and partnering with engineering teams to embed reliability into system design and delivery. This role is ideal for an experienced Sr. SRE who enjoys solving complex operational problems, building automation, and mentoring others. What You’ll Do Reliability & Platform Operations: Own and improve the reliability, availability, and performance of production systems while defining and operationalizing SLIs/SLOs and error budgets. AI Agent Enablement : Design and implement autonomous and semi-autonomous AI agents for monitoring distributed systems and applications. Build agents capable of consuming multi-source observability data (metrics, logs, traces, etc.). Incident Response: Participate in and help lead an on-call rotation, serving as an escalation point for major incidents and facilitating blameless postmortems. Automation & Infrastructure: Build automated workflows to eliminate manual work and design/maintain Infrastructure-as-Code with Terraform. Observability: Improve metrics, logs, traces, and alerting using tools like Datadog or Prometheus to reduce noise and increase signal. Collaboration & Mentorship: Partner with application teams to implement reliability best practices and mentor junior engineers to foster a culture of knowledge sharing. Who You Are Strategic Architect: You look beyond the "what" to understand the "why," providing insights that influence our GTM and technical direction. Startup Veteran: You are comfortable moving fast and staying proactive in an environment where the playbook is still being written. Relatable & Adaptable: You can navigate different personalities across the organization, from high-energy sales teams to analytical engineering partners. Lifelong Learner: You have a thirst for learning, keeping up with emerging technologies and industry trends. What We're Looking For Experience: 5+ years in SRE, DevOps, Platform Engineering, or Infrastructure Engineering. Cloud Expertise: Proven experience supporting production SaaS systems in Azure (preferred), AWS, or GCP. Technical Stack: Strong Linux, networking, and distributed systems troubleshooting skills. Containers: Strong experience with containers and orchestration (Kubernetes/EKS/AKS). IaC & Tooling: Expertise with Infrastructure-as-Code (Terraform strongly preferred). Programming: Strong scripting/programming skills in Python, Go, Bash, or C#/.NET. Observability: Hands-on experience with Datadog, Prometheus/Grafana, or OpenTelemetry. What We Offer Flexible vacation 12 company-paid holidays 10 paid sick days No work on your birthday Health, dental, and vision Insurance (including a $0 option) 401(k) with matching, and no waiting period Equity Life insurance Generous parental paid leave Wellness reimbursement of $300/year Remote worker reimbursement of $300/year Professional development reimbursement Competitive pay An award-winning culture Not sure if you check all the boxes? Apply anyway! We know that great talent comes in many forms, and we value potential just as much as experience. If you're excited about this role and believe you can grow into it, we’d love to hear from you. We’re looking for people who are eager to learn, adapt, and solve challenges—so if that sounds like you, don’t let a checklist hold you back! Change doesn't happen overnight, and the same goes for us here at CertifID. We evolve collectively and individually as we grow by leaning into the core values that define us. As we grow, we embody GRIT —collectively and individually—to raise the bar and influence outcomes in everything we do. Guard the Customer - Raise the Bar - Influence Outcomes - Teamwork Wins #J-18808-Ljbffr CERTIFID
- ...tech company specializing in commerce infrastructure is seeking a Senior SRE in Austin, Texas. In this role, you will manage the reliability and scalability of their multi-cloud infrastructure, engage in AI-assisted development, and automate various workflows. Ideal...Senior
- ...Overview UJET leads the way in AI-powered contact center... ...a future-proof, cloud platform that redefines the... ...accelerated growth in the AI-driven world. We’re looking for a Senior Site Reliability Engineer to help build and scale a high-impact SRE function. You’ll be a...PlatformSenior
- Indeed, Inc. is seeking a Software Engineer III to design and maintain data infrastructure for our database platform team. You will enhance reliability and simplify adoption for engineers by collaborating with site reliability engineers and application teams. The ideal...PlatformSenior
- ...location(s). As a Senior Site Reliability Engineer within the CET... ...'s mobile and digital platforms. You will lead efforts... ...Implement and evolve SRE best practices (SLOs,... ...Explore the use of AI and automation to improve... ...reviews, and automation-driven improvements ~...PlatformSeniorFull timeWork at office
$152k - $241.5k
Senior Site Reliability Engineer - HPC page is loaded## Senior Site... ...looking for a Senior SRE to join our Compute Farm... ...of our global services platform. At NVIDIA, you’ll... ...harness the power of AI to deliver groundbreaking... ...observability or data-driven operations (AIOps/ML-driven...PlatformSenior- ...Senior Site Reliability Engineer Austin, Texas, United States Who We Are... ...world-class experiences across platforms. But what truly powers 2K is... ...work. The Team The 2K SRE team owns the infrastructure... ...scale. Experience with AI and Agentic Development....PlatformSenior
- Apple Inc. in Austin, Texas is seeking a Data Platform SRE to develop and operate large-scale big data... ...applications including analytics and AI/ML apps, optimizing performance, and automating operations for reliability. This position requires strong programming skills...Platform
$121.4k - $218.6k
...challenges?** **Join our critical AI Hardware SRE Team!** The AI Hardware SRE... ...best-in-class uptime and reliability of our AI hardware... ...when they are breached. As a Senior Site Reliability Engineer, you... ...the world's most distributed platform from Cloud to Edge to help the...PlatformSeniorWork experience placementWork at office$109.5k - $150.55k
...looking for an experienced Sr Site Reliability Engineer to be part of the... ...who has been involved in the SRE implementation journey at... ...highly available distributed SaaS platform used by millions of K-12 students... .... Explore and integrate AI tooling into the SRE...PlatformSeniorFor contractorsLocal areaRemote workWorldwideWork visaFlexible hoursWeekend work- ...a voice-first communication platform, powered by our industry-leading... ...come in. We're seeking a Senior Site Reliability Engineer who can own our... ...We're investing in AI to compress incident response... ...what that looks like for an SRE and excited to help shape it...PlatformSeniorPermanent employmentLocal areaFlexible hours
- A dynamic SaaS company in Austin is seeking an experienced Account Executive to drive sales of data-driven marketing platforms. In this role, you will develop strategic sales plans, build relationships with key decision-makers, and use a consultative sales approach. With...PlatformSenior
- Dimensional Fund Advisors is seeking a Senior SRE in Austin, Texas, to manage the developer tooling ecosystem. This hybrid position will... .... The ideal candidate will have extensive experience in site reliability engineering, knowledge of Python and .NET tooling, and strong...PlatformSenior
- ...is the #1 TV streaming platform in the U.S., Canada, and... ...Learning, AI, Control and Optimization... ...talented and experienced Senior Software Engineer, MLOps... ...strong background in DevOps/SRE practices, cloud infrastructure... ...to enable fast, reliable, and safe production releases...PlatformSeniorWork at officeLocal areaRemote workMonday to ThursdayFlexible hours
- A technology-driven home care company in Austin, Texas is seeking a Senior Product Manager to lead product outcomes in a fast-paced... ...management, particularly in B2B SaaS startups, and possess strong... ...to transforming home care through AI technologies. #J-18808-Ljbffr Sensi...Senior
$79.1k - $158.2k
...according to terms for reliability and functionality. ~... ...and escalate issues to senior team members. Collects... ...workflows. ~ Extend APIs and platform automation to drive... ...Skills: ~5+ years in SRE, Infrastructure, or... ...-saving care. And with AI embedded across our products...PlatformSeniorTemporary workImmediate startFlexible hoursShift work- ...Manager to lead the development of their Enterprise Generative AI platform. The ideal candidate will be responsible for ensuring security,... ...and optimal integration of AI technologies. Proven expertise in SaaS products and a strong commitment to user-centric design are...PlatformSenior
- ...recurring revenue SaaS business with more... ...focused, execution-driven, and committed to operational... ...is looking for a Senior Analytics Engineer... ...layer of our data platform, taking raw data... ...best practices. AI-Augmented Development... ...and operations into reliable data models. You...PlatformSeniorFull timeTemporary workWork at officeLocal areaFlexible hoursShift work
- ClosedLoop in Austin, Texas is looking for a Senior Platform Engineer to build and maintain the... ..., and ensuring high standards of reliability and security. The ideal candidate has 5... ...commitment to team collaboration and continuous improvement. #J-18808-Ljbffr ClosedLoop.aiPlatformSeniorFlexible hours
- ...delivering innovative, reliable, and scalable technology... ...of cloud and AI capabilities that power... ...support AI/ML and data platforms, enabling resilient, scalable... ...functionally with architects, SRE teams, and engineering... ...success. Our purpose-driven, supportive culture,...PlatformSeniorWork at office
- General Motors in Austin, Texas, is seeking a Senior Software Engineer to develop data-driven and AI-enabled software solutions for engineering teams. This role... ...in Java, Python, SQL, and experience with cloud platforms like Azure and tools such as Databricks and Kubernetes...PlatformSenior
- WRITER is seeking a Senior Support Engineer based in Austin to assist developer personas... ...technical users in achieving success with our AI platform. The ideal candidate will have over 5... ...experience in technical support for B2B SaaS, strong skills in cloud technology and Python...PlatformSenior
- ...the technical direction of cloud infrastructure and integrate AI-driven solutions that define the Apple brand, impacting millions of customers... ...over 12 years of experience, including expertise in cloud platforms and a strong grasp of DevOps practices. Additionally, a...PlatformSenior
$170k - $190k
UJET leads the way in AI‑powered contact center innovation... ...a future‑proof, cloud platform that redefines the... ...growth in the AI‑driven world. Lead UJET’s Core... ...upgrades. Building scalable, reliable infrastructure that... ...companies, preferably in SaaS or CCaaS. Comfort...PlatformSeniorLocal area- ...components of the platform. Over time, you will... .... Partner with senior engineering staff... ...explicit, action‑driven backend transitions... ...Support platform reliability by writing high‑quality... ...incorporating AI rules, context, and... ...with multi‑tenant SaaS environments or systems...PlatformSeniorWork at office
$125.9k - $148.1k
...Full-Stack Senior Software Engineer At Armanino... ...leverage generative AI in prototyping and product... ...maintain a secure, reliable, and scalable, and efficient platform spanning back-end... ..., event-driven systems, and microservices... ...building multi-tenant SaaS applications Experience...PlatformSeniorContract workLocal areaFlexible hours- ...BookedBy’s scheduling platform has more than 60... ...Summary: We are seeking a Senior DevOps Engineer to... ..., ArgoCD for GitOps-driven deployments).... ...documentation to improve reliability and response. Use AI-driven monitoring, anomaly... ...+ years of DevOps / SRE / Platform...PlatformSeniorWork at officeWorldwide3 days per week
- A leading AI logistics solutions provider based in Austin is seeking a Senior Account Executive to drive sales and revenue in the logistics sector. The ideal candidate... ...5 years of B2B sales experience, preferably in SaaS solutions, and a strong understanding of AI technology...Senior
- Senior Software Engineer — Observability Austin... ...that must be fast, reliable, and secure at massive... ...the observability platforms, reliability tooling, and AI‑powered automation that... ...pipelines, event‑driven architectures, API design... .... Familiarity with SRE practices including...PlatformSeniorWorldwide
- ...branded shopping experiences across every AI channel. We work with enterprise... ...to match. The role We’re looking for a Senior SRE to own the reliability, scalability, and operational posture... ...experience at a startup or high‑growth SaaS company Familiarity with API gateway infrastructure...Senior
- ...enables fast, policy-driven action from a... ...view. This is a **Senior Staff** role: you... ...data engineering at SaaS scale.* 6+ years owning... ...commercial data platforms or major data... ...latency and high reliability.* Deep understanding... ...Familiarity with AI.* Familiarity with...PlatformSeniorLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior SRE - AI-Driven Reliability for SaaS Platforms. Be the first to apply!
- senior cloud service delivery manager Austin, TX
- senior business analyst contract Austin, TX
- senior product design engineer Austin, TX
- senior game producer Austin, TX
- senior software manager Austin, TX
- senior manager business analytics Austin, TX
- senior marketing account manager Austin, TX
- senior marketing manager Austin, TX
- senior contracts analyst Austin, TX
- sr operations manager Austin, TX

