Principal Site Reliability Engineer

$232.9k - $335.81k

Uniphore Technologies North America Inc

Job Description Uniphore is a leading B2B AI‑native company building multimodal architecture that combines Generative AI, Knowledge AI, Emotion AI, workflow automation, and co‑pilot guidance to create human‑centric processes and experiences for customers and employees. About the Role We are looking for a Principal Site Reliability Engineer to join our Platform Engineering team. The highest‑leverage work isn't a runbook; it is the service that enforces the runbook automatically. You will write production Go that runs across hundreds of services, build standards, frameworks, automations, agentic workflows, and self‑service capabilities that enable engineering teams while maintaining enterprise‑grade reliability and security. You will implement these in code, such as a Kubernetes Operator that enforces service readiness, a service that surfaces SLO health across the fleet, and an internal platform service that automates task execution. You will collaborate with feature teams as an expert advisor, maintaining oversight of our single/multi‑tenant, multi‑cloud infrastructure. This senior individual‑contributor role focuses on large‑scale, resilient, automated infrastructure rather than daily firefighting and involves on‑call participation covering all production systems. Responsibilities Invention: Define and execute long‑term architectural strategy for our multi‑cloud platforms. Own: Provide technical leadership through design reviews and code contributions, set technical direction, eliminate architectural barriers, and drive simplicity. Act as a key technical advisor to Engineering Leadership and Product Management, influencing strategic direction. Lead design reviews across Infrastructure focusing on automation, scalability, and reliability; align architectural roadmaps across teams. Partner with Security to build secure‑by‑default systems and remediate weaknesses. Own the reliability of the systems under your technical stewardship. Create the technical clarity—vision, standards, and tooling—that lets feature teams build, own, and operate their services. Participate in fleet‑wide on‑call, owning critical escalations across all production systems and converting recurring failure modes into permanent architectural fixes. Teach: Establish and evangelize design principles for reliable, secure, scalable systems; grow other engineers through mentorship and design review. Requirements 10+ years in DevOps/SRE/Platform Engineering, with demonstrated Staff‑ or Principal‑scope impact and a track record of transforming operational models. Production Go: you write Go regularly, understand its concurrency model, and can own Go services in production. Kubernetes depth: operational expertise and ability to extend it—understand the controller‑runtime model and write or maintain a Kubernetes Operator. Cloud & infrastructure: expert‑level AWS/GCP/Azure, Terraform, and multi‑cloud architecture, with strong cost‑optimization instincts. Production excellence: deep incident management, RCA process, and on‑call system design experience. Software engineering fundamentals: API design, testing, observability instrumentation, and service lifecycle ownership. Standards & documentation: strong technical writing; create operational procedures teams can self‑execute. Architecture & planning: RFC/PRD review experience; catch operational problems at design time. Collaboration & coaching: build team capability through tooling and knowledge transfer rather than doing the work for them. Nice to Haves Building Kubernetes Operators, controllers, or admission webhooks (controller‑runtime, kubebuilder). Contributions to open‑source infrastructure tooling. AWS Solutions Architect Professional or equivalent GCP/Azure certifications. Kubernetes certifications (CKA, CKAD, CKS). Platform engineering, developer experience, or internal developer portals (Backstage, etc.). GitOps patterns (ArgoCD, Flux) and policy‑as‑code tooling (OPA, Kyverno). Why You’ll Love This Role Your code is your leverage. Solutions you ship multiply across dozens of services and teams—preventing entire classes of problems rather than patching instances. You’ll shape the platform strategy, drive the transformation from reactive support to strategic platform partnership, and tackle the hardest problems such as multi‑tenant architecture scaling, cross‑service observability, and reliability challenges for our largest enterprise deployments. You will set the bar, define standards, incident‑management frameworks, and service‑ownership models that let teams graduate to full operational independence. Hiring Range $232,900 – $335,811 OTE — for Primary Location Palo Alto, CA Benefits Competitive base pay with annual incentive based on target achievement. Pre‑IPO stock options. Medical, dental, vision, 401(k) with match, and other health benefits. Generous paid time off, paid holidays, paid birthday day off, and other leave policies. Location USA – CA – Palo Alto Equal Opportunity Employer Uniphore is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics. #J-18808-Ljbffr Uniphore Technologies North America Inc

Apply

Vacancy posted 4 days ago

Similar jobs that could be interesting for youBased on the Principal Site Reliability Engineer in Palo Alto, CA vacancy

Principal Site Reliability Engineer (CIPE)
Job Summary Note: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud-native infrastructure. You’re responsible for architecting the reliability, scalability, and security of a...
Principal
Visa sponsorship
Work visa
Shift work
Palo Alto Networks, Inc.
Santa Clara, CA
1 day ago
Principal Site Reliability Engineer (AIOps)
$151.6k - $245.3k
Job Summary Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance...
Principal
Palo Alto Networks, Inc.
Santa Clara, CA
4 days ago
Principal Site Reliability Engineer
$202k - $247k
...customers worldwide. Our team is growing, and we are looking for engineers with passion for automation. You will help support the... ...and automate our infrastructure. About this role: As a Principal Site Reliability Engineer at FortiCNAPP, you will lead the design,...
Principal
Full time
Worldwide
Zoomcar
Santa Clara, CA
14 hours ago
Principal Site Reliability Engineer ( U.S Citizenship required )
$151.6k - $245.3k
...outcomes. About the Role Palo Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Principal Site Reliability Engineer for the ADEM (Autonomous Digital Experience Management) team, you will be part of a team supporting the services that...
Principal
Full time
Work at office
Visa sponsorship
Work visa
Palo Alto Networks, Inc.
Santa Clara, CA
2 days ago
Senior Site Reliability Engineer - Remote & Scalable Impact
...join our small team focused on growth and productivity. The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native...
Suggested
Remote job
BuildBuddy
Palo Alto, CA
4 days ago
Principal Software Development Engineer
...situated within the S3 Organization, you will serve as a Principal Software Development Engineer dedicated to the development of a novel aircraft... ...coding standards, and requisite tools to maintain a highly reliable software development environment. You will provide technical...
Principal
Wisk Aero LLC
Mountain View, CA
3 days ago
Senior Site Reliability Engineer
The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability, and operational excellence of the production systems that power Nectar's platform. We run high-volume data ingestion pipelines and real-time AI agents on top of a fast-growing...
Nectar
Palo Alto, CA
4 days ago
Senior Site Reliability Engineer
$140k - $220k
About the Job You’ll own reliability and operational excellence for Pylon’s production systems. This means designing and implementing... ...scale as we grow. You’ll build tooling that makes the entire engineering team more effective, establish on‑call rotations and runbooks...
Pylon
Palo Alto, CA
1 day ago
Senior Site Reliability Engineer (SRE)
$181k - $197k
Senior SRE Palo Alto, CA • Engineering • Hybrid • Full-time Founded by a team of ex-Apple engineers, Instrumental provides a collection... ...on, and measuring KPIs to ensure ongoing performance, reliability and efficiency. Network/application security and compliance...
Full time
Clutch Canada
Palo Alto, CA
3 days ago
Senior Principal Engineer, AI Systems & Platforms
ATX Venture Partners seeks a Principal Engineer to drive technology initiatives and create scalable solutions. You'll develop systems in a highly collaborative environment, utilizing both front-end and back-end technologies, particularly in AI domains. The ideal candidate...
Principal
ATX Venture Partners
Mountain View, CA
3 days ago
Senior Site Reliability Engineer / DevOps Engineer
...Infrastructure Footprint: Global production infrastructure across AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available global infrastructure supporting production systems...
Prophet Town
Mountain View, CA
1 day ago
Senior/Staff Site Reliability Engineer
$180k - $260k
...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work...
Odd job
Work at office
Remote work
Booster
Mountain View, CA
4 days ago
Principal Software Engineer
...Principal Software Engineer About the Team: This role is for the Cloud Engineering team within Cornerstone. The team is responsible for creating and managing Cornerstone’s cloud infrastructure and related DevOps tooling automation. As a Principal Software Engineer you...
Principal
Namely
Mountain View, CA
2 days ago
Principal Wi‑Fi Software Engineer — Global Mesh (Equity)
...SPACE EXPLORATION TECHNOLOGIES CORP seeks a Principal Wi‑Fi Software Engineer for their Starlink consumer product line. This role will drive Wi‑Fi software development and roadmap, collaborating with engineers to enhance customer experience. The ideal candidate will...
Principal
Jobleads-US
Palo Alto, CA
3 days ago
Site Reliability Engineer
...technologies. Our mission is to double America’s compute capacity without building new data centers. We are seeking a skilled Site Reliability Engineer to join our growing team. The ideal candidate will help ensure the reliability, scalability, and performance of our hybrid...
Work at office
Weekend work
FLUIX
Palo Alto, CA
4 days ago
Site Reliability Engineer - 2
$86.33k - $191.9k
...guardrails to make going fast also going safely. Identifying reliability anti-patterns and solving them systemically . You dive deep into... ...of AI‑assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real...
Local area
Flexible hours
Traveltechessentialist
Palo Alto, CA
2 days ago
Site Reliability Engineer
Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes clusters (including GPU-backed clusters). Implement...
Amiri Recruiting
Mountain View, CA
10 days ago
Senior Director, AI-Driven Site Reliability Engineering
JPMorgan Chase & Co. is seeking a Director of Site Reliability Engineering to partner with the Infrastructure Platforms and Foundational Services team in Palo Alto. This role involves guiding stakeholders through complex projects, leading the application of AI capabilities...
JPMorgan Chase & Co.
Palo Alto, CA
14 hours ago
Site Reliability Engineer - Cybersecurity
$180k - $360k
...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who... ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform...
Temporary work
Relocation
Pantera Capital
Palo Alto, CA
2 days ago
Senior Site Reliability Engineer: Cloud, Kubernetes & CI/CD
A leading tech recruiting firm is seeking a Site Reliability Engineer to manage and optimize cloud infrastructure primarily using GCP or AWS. The role involves maintaining high availability through Kubernetes clusters and improving CI/CD pipelines with Terraform. Ideal...
Amiri Recruiting
Mountain View, CA
4 days ago
Cloud Payments Platform Architect, Senior Principal
TwinThread in Palo Alto, CA is looking for a Senior Principal Architect to spearhead the modernization of payments platforms. The ideal candidate will have over 10 years of Java development experience and strong expertise in cloud-native applications. You will work closely...
Principal
TwinThread LLC
Palo Alto, CA
4 days ago
Senior Site Reliability Engineer - Cloud AI Infrastructure
Cerebras is looking for a Senior Site Reliability Engineer to join their Infrastructure team in Palo Alto, California. This role involves designing and optimizing infrastructure for distributed AI applications, contributing to the open-source Ray project, and ensuring high...
Cerebras
Palo Alto, CA
14 hours ago
Principal Platform Engineer, AI Infra & Cloud
$280k - $350k
Join careers.bitkraft.vc as a Staff / Principal Platform Engineer in Mountain View, California. In this role, you will take ownership of building and scaling AI products, manage cloud infrastructure, and enhance engineering workflows. With a hybrid work model, we seek an...
Principal
Relocation
careers.bitkraft.vc - Jobboard
Mountain View, CA
3 days ago
Principal Software Engineer - AI, End-to-End & Scale
Intuit Inc. in Mountain View, California is seeking a Principal Software Engineer to lead significant technology initiatives. The role involves driving customer-oriented solutions, developing complex distributed systems, and leveraging AI technologies. The ideal candidate...
Principal
Intuit
Mountain View, CA
14 hours ago
Principal Cloud Platform Software Engineer
$210k - $247k
...Mainspring over traditional options like engines, turbines, and fuel cells to quickly and reliably deliver local power for EV... ...delivering resilient, on-site power for commercial, industrial... ...a high-impact opportunity for a Principal Cloud Platform Software Engineer...
Principal
Local area
Remote work
Flexible hours
Ring
Menlo Park, CA
4 days ago
Principal Software Engineer - Fintech Risk Platform
$261.5k - $353.5k
Intuit Inc. is looking for a Principal Software Engineer in Mountain View, California, to lead the technology vision and architecture for its Fintech Risk Platform. This critical role requires expertise in engineering leadership, building high-impact distributed systems...
Principal
Intuit
Mountain View, CA
1 day ago
Senior Site Reliability Engineer, Platform Infrastructure (Foundations)
...Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team. Anyscale aims to provide the next generation of tools and infrastructure to make...
Cerebras
Palo Alto, CA
1 day ago
Principal Security Software Engineer, AI-Driven Defense
$220.5k - $300k
A leading aerospace company in California seeks a Principal Security Software Engineer for its Starshield project. The role involves developing security-critical software and leveraging AI to enhance security measures. Candidates must have a bachelor's degree and at least...
Principal
Jobleads-US
Palo Alto, CA
5 hours ago
Director of Site Reliability Engineering
...make a meaningful impact. Partner with an organization committed to defining the future of site reliability in the financial sector. As a Director of Site Reliability Engineering at JPMorgan Chase within the Infrastructure Platforms and Foundational Services (IPFS)...
JPMorgan Chase & Co.
Palo Alto, CA
14 hours ago
Staff Site Reliability Engineer
$150k - $180k
...financial, environmental, and innovation outcomes. Role Verrus is looking for candidates to serve as software-focused Senior Site Reliability Engineer at Verrus. This is a full‑time position based out of the Mountain View, CA office. Verrus takes a very technology‑forward...
Full time
Work at office
Local area
Flexible hours
Verrus, LLC
Mountain View, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!