Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior / Staff Site Reliability, Platform Engineering

Saviynt

Job Description

Job Description

About Saviynt

 

Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for global enterprises and government institutions. Built for the AI era, Saviynt helps organizations move faster—securely and compliantly.

 

 

Why This Role Matters

 

Saviynt’s SaaS platform runs on complex, distributed, cloud-native systems. As a Staff Platform Engineer, you will play a critical role in ensuring these systems remain highly available, scalable, and secure as the company grows.

 

This is a hands-on engineering and technical leadership role. You will own reliability for major platform domains, design scalable solutions on Kubernetes and AWS, and drive automation and reliability improvements across multiple teams.

 

 

What You’ll Do

 

In this pivotal role, you will be instrumental in designing, building, and maintaining the shared infrastructure services and platforms that our product and application teams will depend on

 

You will focus on creating reusable, reliable, and scalable solutions that abstract away complexity, enabling other teams to focus on their core business logic and deliver features faster in a multi-cloud environment

 

Design and build core platform components and shared infrastructure services that other development teams will integrate with and leverage to deploy and operate their applications

 

Architect, implement, and manage highly available and scalable Kubernetes platforms as a service for internal consumers

 

Develop robust, internal-facing tools and automation for infrastructure provisioning and management primarily using Go (Golang)

 

Architect and optimize foundational solutions within Cloud environments (AWS, Azure, etc.), focusing on creating reusable patterns and modules for other teams

 

Design and implement shared Event-Driven Architecture components and messaging platforms using technologies like Kafka or Google Pub/Sub that product teams can easily utilize

 

Develop and maintain robust CI/CD pipelines (e.g., GitLab CI and ArgoCD) as a service, providing standardized and automated deployment workflows for various development teams

 

Design and build resilient Distributed Systems components that serve as building blocks for other applications, focusing on reliability, fault tolerance, and performance

 

Manage and optimize our shared infrastructure across Multi-Region Cloud Environments, ensuring that platform services are globally available and performant for all consumers

 

Establish and enhance centralized Observability and Monitoring platforms and tools that provide self-service insights for consuming teams

 

Define and implement clear, well-documented RESTful API designs for the infrastructure services you build, ensuring ease of integration for internal clients

 

Implement and manage Service Mesh (e.g., Envoy, Istio) capabilities, providing traffic management, security, and policy enforcement as a shared platform for services

 

Design, implement, and optimize highly available Relational Database services or shared data platforms for broad organizational use

 

Collaborate closely with product development teams to understand their infrastructure needs and pain points, providing technical guidance and support

 

Participate in on-call rotations to support the critical shared infrastructure you build

 

 

What We’re Looking For

 

6+ years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a strong focus on building tools and services for other engineers

 

Deep expertise with Kubernetes in production environments, particularly in providing it as a platform(i.e single tenant and multi-tenant deployment architectures)

 

Strong programming skills in Go (Golang) and Python, with experience building robust, maintainable backend services and automation

 

Extensive hands-on experience with at least one major Cloud Provider (AWS, GCP, or Azure); multi-cloud experience is a strong plus, especially in building abstractions over them

 

Proven experience designing and implementing Event-Driven Architecture and message queuing systems (e.g., Kafka, RMQ, NATS) as shared services

 

Solid understanding and practical experience with CI/CD pipeline tools (especially GitLab CI) and experience establishing automated delivery processes for other teams

 

Demonstrable experience designing and operating Distributed Systems, with an understanding of patterns for creating reliable, shared components

 

Familiarity with Multi-Region Cloud Environments and strategies for building globally distributed and highly available platform

 

Proficiency in establishing and utilizing comprehensive Observability and Monitoring platforms (e.g., Prometheus, Grafana, ELK stack, Datadog) for shared infrastructure

 

Strong experience with RESTful API design principles and building well-documented, consumable APIs

 

Knowledge of Service Mesh concepts and practical experience with solutions like Istio in a platform context

 

Hands-on experience with Relational Databases (e.g., MySQL, PostgresSQL), ideally in managing them as a service

 

Excellent communication skills and the ability to clearly articulate complex technical concepts to both technical and non-technical audiences

 

A strong customer-centric mindset, treating internal development teams as your primary customers

 

Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience required

 

 

Why Join Saviynt

 

•        Work on a large-scale, cloud-native SaaS platform

•        Solve complex reliability challenges at scale

•        Influence platform architecture and engineering practices

•        Competitive compensation, benefits, and career growth

 

 

Security & Compliance

 

This role requires adherence to Saviynt’s information security and privacy policies, including annual security training.

 

 

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 10 days ago
Similar jobs that could be interesting for youBased on the Senior / Staff Site Reliability, Platform Engineering in Milpitas, CA vacancy
  • $176k - $276k

    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high...  ...aspects of large scale Observability & Telemetry collection platform with a focus on performance at scale, real time monitoring... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $180.5k - $270.7k

    Qualcomm is seeking an experienced Thermal Engineer to develop high-performance thermal solutions for data center applications in Santa Clara, California. The role involves hands-on lab work, thermal testing, and collaboration with cross-functional teams. The ideal candidate... 
    Senior

    Qualcomm

    Santa Clara, CA
    5 days ago
  •  ...AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available...  ...lessons that come from operating Kubernetes and cloud platforms at scale. The ideal candidate has deep hands‑on... 
    Senior

    Prophet Town

    Mountain View, CA
    4 days ago
  •  ...technology leader is looking for an experienced SRE software engineer in Cupertino, California, to build and enhance compute infrastructure...  .... Applicants should have at least 8 years of experience in site reliability engineering, a strong background in cloud infrastructure, and... 
    Senior

    Apple Inc.

    Cupertino, CA
    1 day ago
  •  ...love to talk. This is more than a job—it’s a journey. Site Reliability Engineers (SREs) are responsible for the overall performance and reliability...  ...customer problems and deliver new features, not reinvent platforms. What you'll do Work with product engineering teams... 
    Senior
    Remote work

    ASAPP

    Mountain View, CA
    7 days ago
  •  ...Role Overview You will be building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into reliable, job‑centric insights and automation for GPU fleets. Join our team of innovative engineers who are building this platform and operating it (not the compute... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  •  ...technology company is looking for a Java SRE Engineer to support large-scale cloud migrations...  ...lead migrations, design robust AWS EKS platforms, and implement deployment strategies....  ...with various teams to ensure reliability. This position is onsite in the San Francisco... 
    Senior

    EITACIES Inc.

    Santa Clara, CA
    2 days ago
  • $152k - $241.5k

     ...Overview We’re looking for a Senior SRE to join our Compute Farm...  ...generation of our global services platform. At NVIDIA, you’ll keep...  ...lifecycle management, fleet reliability/auto‑healing, E2E observability...  ...Perl, or Ruby. Mentored other engineers and influenced technical direction... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $174k - $252k

    Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will be...  ...data centers to building the next generation of Google platforms, we make Google’s product portfolio possible. We’re proud... 
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $127.63k - $191.2k

     ...Marvell Central Engineering Hardware Design Engineer Marvell's semiconductor solutions are the essential building blocks of the data...  ...Design and architecture for central engineering hardware, platform development for on-board subsystems to support chip test infrastructure... 
    Senior
    Permanent employment
    Internship
    Work from home

    Marvell

    Santa Clara, CA
    4 days ago
  • A leading tech recruiting firm is seeking a Site Reliability Engineer to manage and optimize cloud infrastructure primarily using GCP or AWS....  ...requires no customer interaction and focuses on improving platform architecture and reliability. #J-18808-Ljbffr Amiri Recruiting
    Senior

    Amiri Recruiting

    Mountain View, CA
    2 days ago
  • $200k - $322k

    Senior Manager, Site Reliability Engineering page is loaded## Senior Manager, Site Reliability Engineeringlocations: US, CA, Santa Claratime type: Full...  ...Lead the development of automation and orchestration platforms that reduce manual effort across the outage lifecycle... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $126k - $204.5k

     ...delivers the industry’s most advanced SecOps platform, consisting of XDR, XSIAM, XSOAR, and...  ...you will collaborate closely with our engineering teams to develop innovative solutions...  ...of the product and ensure the reliability and availability of our services. Qualifications... 
    Senior

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  • $180k - $260k

     ...operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of...  ...role, you will work closely with our infrastructure and platform teams to manage rollouts of both on‑premises and cloud... 
    Senior
    Odd job
    Work at office
    Remote work

    Booster

    Mountain View, CA
    2 days ago
  • $145k - $165k

    A technology solutions firm in Sunnyvale, CA is looking for a highly experienced Site Reliability Engineer (SRE). This role involves maintaining uptime and performance across systems. Exceptional Linux expertise and automation skills in Bash and Python are crucial. Key... 
    Senior

    Bolt Graphics, Inc.

    Sunnyvale, CA
    4 days ago
  • $175.8k - $264.2k

    Senior Site Reliability Engineer - Apple Services Engineering (ASE) / iCloud Cupertino, CA People at Apple don't just build products - they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many... 
    Senior

    Hong Kong Study Skills Research Institute

    Cupertino, CA
    5 days ago
  • $120.3k - $194.53k

     .... Job Summary Palo Alto Networks runs a large hybrid infrastructure across multiple public clouds. As a Site Reliability Engineer on the Internet Security Platform team, you will be part of a team supporting Advanced DNS Security services. This includes automation, architecture... 
    Senior
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    3 days ago
  • $235k - $250k

     ...experienced Infrastructure Development professional to design and maintain critical shared services for our mission-critical SaaS platform. You will be instrumental in enabling teams to deliver features faster in a multi-cloud environment. The ideal candidate has 9+ years... 
    Senior

    Medium

    Milpitas, CA
    2 days ago
  • $264.51k

     ...experience in network management product development and virtualization technologies. This position offers a competitive salary of $264,514 per year and involves collaboration with engineers to create solutions for complex technical challenges. #J-18808-Ljbffr AIRSPAN CAREERS
    Senior

    AIRSPAN CAREERS

    Milpitas, CA
    4 days ago
  • $170k - $225k

     ...Senior Manager / Director, Product Marketing – Server Memory About the Role We're seeking...  ...Partner with Product Management, Engineering, and Sales to align investments with market...  ..., enterprise servers, and CPU platform transitions (Intel Xeon, AMD EPYC, ARM)... 
    Senior

    Scuf

    Milpitas, CA
    2 days ago
  • $230k - $250k

    Cerebras Systems in Sunnyvale, CA, seeks a Sr. Member of Technical Staff to develop resilient software for their AI chip. Responsibilities include designing robust software features, maintaining deployment workflows using AWS, and debugging software issues. Candidates should... 
    Senior
    Remote job

    Cerebras

    Sunnyvale, CA
    1 day ago
  • $281k - $356k

     ...can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver...  ...from a diverse set of sensors, enabling engineers like you to (1) develop methods for...  ...this hybrid role you will report to a Sr Staff Technical Lead Manager. You will: *... 
    Senior
    Full time
    Temporary work
    Remote work

    Waymo

    Mountain View, CA
    8 days ago
  • A global data and AI company is seeking a Senior Staff Technical Program Manager to lead Reliability initiatives within product engineering teams. This role requires over 10 years of experience in managing cloud infrastructure programs and driving improvements in reliability... 
    Senior
    Local area

    Databricks

    Mountain View, CA
    4 days ago
  • $163k - $261k

     ...future. The successful candidate will also be eligible for an annual bonus, equity compensation, and benefits. #LI-JM3 #Mid-Senior Working at Aurora At Aurora, we bring together extraordinarily talented and experienced people united by the strength of our... 
    Senior
    Work at office
    Local area
    3 days per week

    Aurora Innovation

    Mountain View, CA
    3 days ago
  • $300k - $334k

    Google Inc. is seeking a Senior Staff Technical Program Manager for the GeminiApp team in Mountain View, CA. This role involves translating...  ...inception to delivery, and cultivating partnerships across engineering teams. A minimum of 10 years of experience in technical... 
    Senior

    Google Inc.

    Mountain View, CA
    5 days ago
  • Estuate, Inc. in Milpitas, California, is looking for a software engineer with extensive experience in designing and supporting database systems. The role involves analyzing, implementing, and automating testing processes using tools like Databricks, Apache Spark, and... 
    Senior

    Estuate, Inc.

    Milpitas, CA
    3 days ago
  • $200k - $322k

    NVIDIA Gruppe in Santa Clara is seeking a Senior Staff Software Engineer to lead engineering efforts in their enterprise systems. Responsibilities include designing AI-driven workflows, managing enterprise issues with an automation focus, and mentoring team members. The... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

    We are seeking software engineers to work on next-generation high-speed interconnect technologies. Our charter is to develop the most...  ...collaborating closely with hardware architects, silicon designers, and platform software experts to support innovation in factories and data... 
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • NVIDIA Corporation is seeking a Senior Staff Engineer for Enterprise Messaging Platforms to manage and enhance their global email and messaging infrastructure. This role involves architecting solutions with Microsoft Exchange and Azure services, ensuring high availability... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $262k - $365k

    Google is seeking an experienced Software Engineer specializing in AI/ML in Mountain View, CA. You will lead project teams and develop large-scale recommendation models to grow the YouTube Shorts ecosystem, working in a dynamic, innovative environment. The role necessitates... 
    Senior

    Google

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior / Staff Site Reliability, Platform Engineering. Be the first to apply!