Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Site Reliability Engineer

$232.9k - $335.81k

Uniphore Technologies North America Inc

Job Description Uniphore is a leading B2B AI‑native company building multimodal architecture that combines Generative AI, Knowledge AI, Emotion AI, workflow automation, and co‑pilot guidance to create human‑centric processes and experiences for customers and employees. About the Role We are looking for a Principal Site Reliability Engineer to join our Platform Engineering team. The highest‑leverage work isn't a runbook; it is the service that enforces the runbook automatically. You will write production Go that runs across hundreds of services, build standards, frameworks, automations, agentic workflows, and self‑service capabilities that enable engineering teams while maintaining enterprise‑grade reliability and security. You will implement these in code, such as a Kubernetes Operator that enforces service readiness, a service that surfaces SLO health across the fleet, and an internal platform service that automates task execution. You will collaborate with feature teams as an expert advisor, maintaining oversight of our single/multi‑tenant, multi‑cloud infrastructure. This senior individual‑contributor role focuses on large‑scale, resilient, automated infrastructure rather than daily firefighting and involves on‑call participation covering all production systems. Responsibilities Invention: Define and execute long‑term architectural strategy for our multi‑cloud platforms. Own: Provide technical leadership through design reviews and code contributions, set technical direction, eliminate architectural barriers, and drive simplicity. Act as a key technical advisor to Engineering Leadership and Product Management, influencing strategic direction. Lead design reviews across Infrastructure focusing on automation, scalability, and reliability; align architectural roadmaps across teams. Partner with Security to build secure‑by‑default systems and remediate weaknesses. Own the reliability of the systems under your technical stewardship. Create the technical clarity—vision, standards, and tooling—that lets feature teams build, own, and operate their services. Participate in fleet‑wide on‑call, owning critical escalations across all production systems and converting recurring failure modes into permanent architectural fixes. Teach: Establish and evangelize design principles for reliable, secure, scalable systems; grow other engineers through mentorship and design review. Requirements 10+ years in DevOps/SRE/Platform Engineering, with demonstrated Staff‑ or Principal‑scope impact and a track record of transforming operational models. Production Go: you write Go regularly, understand its concurrency model, and can own Go services in production. Kubernetes depth: operational expertise and ability to extend it—understand the controller‑runtime model and write or maintain a Kubernetes Operator. Cloud & infrastructure: expert‑level AWS/GCP/Azure, Terraform, and multi‑cloud architecture, with strong cost‑optimization instincts. Production excellence: deep incident management, RCA process, and on‑call system design experience. Software engineering fundamentals: API design, testing, observability instrumentation, and service lifecycle ownership. Standards & documentation: strong technical writing; create operational procedures teams can self‑execute. Architecture & planning: RFC/PRD review experience; catch operational problems at design time. Collaboration & coaching: build team capability through tooling and knowledge transfer rather than doing the work for them. Nice to Haves Building Kubernetes Operators, controllers, or admission webhooks (controller‑runtime, kubebuilder). Contributions to open‑source infrastructure tooling. AWS Solutions Architect Professional or equivalent GCP/Azure certifications. Kubernetes certifications (CKA, CKAD, CKS). Platform engineering, developer experience, or internal developer portals (Backstage, etc.). GitOps patterns (ArgoCD, Flux) and policy‑as‑code tooling (OPA, Kyverno). Why You’ll Love This Role Your code is your leverage. Solutions you ship multiply across dozens of services and teams—preventing entire classes of problems rather than patching instances. You’ll shape the platform strategy, drive the transformation from reactive support to strategic platform partnership, and tackle the hardest problems such as multi‑tenant architecture scaling, cross‑service observability, and reliability challenges for our largest enterprise deployments. You will set the bar, define standards, incident‑management frameworks, and service‑ownership models that let teams graduate to full operational independence. Hiring Range $232,900 – $335,811 OTE — for Primary Location Palo Alto, CA Benefits Competitive base pay with annual incentive based on target achievement. Pre‑IPO stock options. Medical, dental, vision, 401(k) with match, and other health benefits. Generous paid time off, paid holidays, paid birthday day off, and other leave policies. Location USA – CA – Palo Alto Equal Opportunity Employer Uniphore is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics. #J-18808-Ljbffr Uniphore Technologies North America Inc

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Principal Site Reliability Engineer in Palo Alto, CA vacancy
  • Job Summary Note: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud-native infrastructure. You’re responsible for architecting the reliability, scalability, and security of a... 
    Principal
    Visa sponsorship
    Work visa
    Shift work

    Palo Alto Networks, Inc.

    Santa Clara, CA
    1 day ago
  • $151.6k - $245.3k

    Job Summary Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance... 
    Principal

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $202k - $247k

     ...customers worldwide. Our team is growing, and we are looking for engineers with passion for automation. You will help support the...  ...and automate our infrastructure. About this role: As a Principal Site Reliability Engineer at FortiCNAPP, you will lead the design,... 
    Principal
    Full time
    Worldwide

    Zoomcar

    Santa Clara, CA
    14 hours ago
  • $151.6k - $245.3k

     ...outcomes. About the Role Palo Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Principal Site Reliability Engineer for the ADEM (Autonomous Digital Experience Management) team, you will be part of a team supporting the services that... 
    Principal
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    2 days ago
  •  ...join our small team focused on growth and productivity. The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native... 
    Suggested
    Remote job

    BuildBuddy

    Palo Alto, CA
    4 days ago
  •  ...situated within the S3 Organization, you will serve as a Principal Software Development Engineer dedicated to the development of a novel aircraft...  ...coding standards, and requisite tools to maintain a highly reliable software development environment. You will provide technical... 
    Principal

    Wisk Aero LLC

    Mountain View, CA
    3 days ago
  • The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability, and operational excellence of the production systems that power Nectar's platform. We run high-volume data ingestion pipelines and real-time AI agents on top of a fast-growing... 

    Nectar

    Palo Alto, CA
    4 days ago
  • $140k - $220k

    About the Job You’ll own reliability and operational excellence for Pylon’s production systems. This means designing and implementing...  ...scale as we grow. You’ll build tooling that makes the entire engineering team more effective, establish on‑call rotations and runbooks... 

    Pylon

    Palo Alto, CA
    1 day ago
  • $181k - $197k

    Senior SRE Palo Alto, CA • Engineering • Hybrid • Full-time Founded by a team of ex-Apple engineers, Instrumental provides a collection...  ...on, and measuring KPIs to ensure ongoing performance, reliability and efficiency. Network/application security and compliance... 
    Full time

    Clutch Canada

    Palo Alto, CA
    3 days ago
  • ATX Venture Partners seeks a Principal Engineer to drive technology initiatives and create scalable solutions. You'll develop systems in a highly collaborative environment, utilizing both front-end and back-end technologies, particularly in AI domains. The ideal candidate... 
    Principal

    ATX Venture Partners

    Mountain View, CA
    3 days ago
  •  ...Infrastructure Footprint: Global production infrastructure across AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available global infrastructure supporting production systems... 

    Prophet Town

    Mountain View, CA
    1 day ago
  • $180k - $260k

     ...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work... 
    Odd job
    Work at office
    Remote work

    Booster

    Mountain View, CA
    4 days ago
  •  ...Principal Software Engineer About the Team: This role is for the Cloud Engineering team within Cornerstone. The team is responsible for creating and managing Cornerstone’s cloud infrastructure and related DevOps tooling automation. As a Principal Software Engineer you... 
    Principal

    Namely

    Mountain View, CA
    2 days ago
  •  ...SPACE EXPLORATION TECHNOLOGIES CORP seeks a Principal Wi‑Fi Software Engineer for their Starlink consumer product line. This role will drive Wi‑Fi software development and roadmap, collaborating with engineers to enhance customer experience. The ideal candidate will... 
    Principal

    Jobleads-US

    Palo Alto, CA
    3 days ago
  •  ...technologies. Our mission is to double America’s compute capacity without building new data centers. We are seeking a skilled Site Reliability Engineer to join our growing team. The ideal candidate will help ensure the reliability, scalability, and performance of our hybrid... 
    Work at office
    Weekend work

    FLUIX

    Palo Alto, CA
    4 days ago
  • $86.33k - $191.9k

     ...guardrails to make going fast also going safely. Identifying reliability anti-patterns and solving them systemically . You dive deep into...  ...of AI‑assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real... 
    Local area
    Flexible hours

    Traveltechessentialist

    Palo Alto, CA
    2 days ago
  • Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes clusters (including GPU-backed clusters). Implement... 

    Amiri Recruiting

    Mountain View, CA
    10 days ago
  • JPMorgan Chase & Co. is seeking a Director of Site Reliability Engineering to partner with the Infrastructure Platforms and Foundational Services team in Palo Alto. This role involves guiding stakeholders through complex projects, leading the application of AI capabilities... 

    JPMorgan Chase & Co.

    Palo Alto, CA
    14 hours ago
  • $180k - $360k

     ...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who...  ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform... 
    Temporary work
    Relocation

    Pantera Capital

    Palo Alto, CA
    2 days ago
  • A leading tech recruiting firm is seeking a Site Reliability Engineer to manage and optimize cloud infrastructure primarily using GCP or AWS. The role involves maintaining high availability through Kubernetes clusters and improving CI/CD pipelines with Terraform. Ideal... 

    Amiri Recruiting

    Mountain View, CA
    4 days ago
  • TwinThread in Palo Alto, CA is looking for a Senior Principal Architect to spearhead the modernization of payments platforms. The ideal candidate will have over 10 years of Java development experience and strong expertise in cloud-native applications. You will work closely... 
    Principal

    TwinThread LLC

    Palo Alto, CA
    4 days ago
  • Cerebras is looking for a Senior Site Reliability Engineer to join their Infrastructure team in Palo Alto, California. This role involves designing and optimizing infrastructure for distributed AI applications, contributing to the open-source Ray project, and ensuring high... 

    Cerebras

    Palo Alto, CA
    14 hours ago
  • $280k - $350k

    Join careers.bitkraft.vc as a Staff / Principal Platform Engineer in Mountain View, California. In this role, you will take ownership of building and scaling AI products, manage cloud infrastructure, and enhance engineering workflows. With a hybrid work model, we seek an... 
    Principal
    Relocation

    careers.bitkraft.vc - Jobboard

    Mountain View, CA
    3 days ago
  • Intuit Inc. in Mountain View, California is seeking a Principal Software Engineer to lead significant technology initiatives. The role involves driving customer-oriented solutions, developing complex distributed systems, and leveraging AI technologies. The ideal candidate... 
    Principal

    Intuit

    Mountain View, CA
    14 hours ago
  • $210k - $247k

     ...Mainspring over traditional options like engines, turbines, and fuel cells to quickly and reliably deliver local power for EV...  ...delivering resilient, on-site power for commercial, industrial...  ...a high-impact opportunity for a Principal Cloud Platform Software Engineer... 
    Principal
    Local area
    Remote work
    Flexible hours

    Ring

    Menlo Park, CA
    4 days ago
  • $261.5k - $353.5k

    Intuit Inc. is looking for a Principal Software Engineer in Mountain View, California, to lead the technology vision and architecture for its Fintech Risk Platform. This critical role requires expertise in engineering leadership, building high-impact distributed systems... 
    Principal

    Intuit

    Mountain View, CA
    1 day ago
  •  ...Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team. Anyscale aims to provide the next generation of tools and infrastructure to make... 

    Cerebras

    Palo Alto, CA
    1 day ago
  • $220.5k - $300k

    A leading aerospace company in California seeks a Principal Security Software Engineer for its Starshield project. The role involves developing security-critical software and leveraging AI to enhance security measures. Candidates must have a bachelor's degree and at least... 
    Principal

    Jobleads-US

    Palo Alto, CA
    5 hours ago
  •  ...make a meaningful impact. Partner with an organization committed to defining the future of site reliability in the financial sector. As a Director of Site Reliability Engineering at JPMorgan Chase within the Infrastructure Platforms and Foundational Services (IPFS)... 

    JPMorgan Chase & Co.

    Palo Alto, CA
    14 hours ago
  • $150k - $180k

     ...financial, environmental, and innovation outcomes. Role Verrus is looking for candidates to serve as software-focused Senior Site Reliability Engineer at Verrus. This is a full‑time position based out of the Mountain View, CA office. Verrus takes a very technology‑forward... 
    Full time
    Work at office
    Local area
    Flexible hours

    Verrus, LLC

    Mountain View, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!