Principal Site Reliability Engineer
$232.9k - $335.81kUniphore Technologies North America Inc
Job Description Uniphore is a leading B2B AI‑native company building multimodal architecture that combines Generative AI, Knowledge AI, Emotion AI, workflow automation, and co‑pilot guidance to create human‑centric processes and experiences for customers and employees. About the Role We are looking for a Principal Site Reliability Engineer to join our Platform Engineering team. The highest‑leverage work isn't a runbook; it is the service that enforces the runbook automatically. You will write production Go that runs across hundreds of services, build standards, frameworks, automations, agentic workflows, and self‑service capabilities that enable engineering teams while maintaining enterprise‑grade reliability and security. You will implement these in code, such as a Kubernetes Operator that enforces service readiness, a service that surfaces SLO health across the fleet, and an internal platform service that automates task execution. You will collaborate with feature teams as an expert advisor, maintaining oversight of our single/multi‑tenant, multi‑cloud infrastructure. This senior individual‑contributor role focuses on large‑scale, resilient, automated infrastructure rather than daily firefighting and involves on‑call participation covering all production systems. Responsibilities Invention: Define and execute long‑term architectural strategy for our multi‑cloud platforms. Own: Provide technical leadership through design reviews and code contributions, set technical direction, eliminate architectural barriers, and drive simplicity. Act as a key technical advisor to Engineering Leadership and Product Management, influencing strategic direction. Lead design reviews across Infrastructure focusing on automation, scalability, and reliability; align architectural roadmaps across teams. Partner with Security to build secure‑by‑default systems and remediate weaknesses. Own the reliability of the systems under your technical stewardship. Create the technical clarity—vision, standards, and tooling—that lets feature teams build, own, and operate their services. Participate in fleet‑wide on‑call, owning critical escalations across all production systems and converting recurring failure modes into permanent architectural fixes. Teach: Establish and evangelize design principles for reliable, secure, scalable systems; grow other engineers through mentorship and design review. Requirements 10+ years in DevOps/SRE/Platform Engineering, with demonstrated Staff‑ or Principal‑scope impact and a track record of transforming operational models. Production Go: you write Go regularly, understand its concurrency model, and can own Go services in production. Kubernetes depth: operational expertise and ability to extend it—understand the controller‑runtime model and write or maintain a Kubernetes Operator. Cloud & infrastructure: expert‑level AWS/GCP/Azure, Terraform, and multi‑cloud architecture, with strong cost‑optimization instincts. Production excellence: deep incident management, RCA process, and on‑call system design experience. Software engineering fundamentals: API design, testing, observability instrumentation, and service lifecycle ownership. Standards & documentation: strong technical writing; create operational procedures teams can self‑execute. Architecture & planning: RFC/PRD review experience; catch operational problems at design time. Collaboration & coaching: build team capability through tooling and knowledge transfer rather than doing the work for them. Nice to Haves Building Kubernetes Operators, controllers, or admission webhooks (controller‑runtime, kubebuilder). Contributions to open‑source infrastructure tooling. AWS Solutions Architect Professional or equivalent GCP/Azure certifications. Kubernetes certifications (CKA, CKAD, CKS). Platform engineering, developer experience, or internal developer portals (Backstage, etc.). GitOps patterns (ArgoCD, Flux) and policy‑as‑code tooling (OPA, Kyverno). Why You’ll Love This Role Your code is your leverage. Solutions you ship multiply across dozens of services and teams—preventing entire classes of problems rather than patching instances. You’ll shape the platform strategy, drive the transformation from reactive support to strategic platform partnership, and tackle the hardest problems such as multi‑tenant architecture scaling, cross‑service observability, and reliability challenges for our largest enterprise deployments. You will set the bar, define standards, incident‑management frameworks, and service‑ownership models that let teams graduate to full operational independence. Hiring Range $232,900 – $335,811 OTE — for Primary Location Palo Alto, CA Benefits Competitive base pay with annual incentive based on target achievement. Pre‑IPO stock options. Medical, dental, vision, 401(k) with match, and other health benefits. Generous paid time off, paid holidays, paid birthday day off, and other leave policies. Location USA – CA – Palo Alto Equal Opportunity Employer Uniphore is an equal opportunity employer committed to diversity in the workplace. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, disability, veteran status, and other protected characteristics. #J-18808-Ljbffr Uniphore Technologies North America Inc
- Job Summary Note: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud-native infrastructure. You’re responsible for architecting the reliability, scalability, and security of a...PrincipalVisa sponsorshipWork visaShift work
$151.6k - $245.3k
Job Summary Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance...Principal$202k - $247k
...customers worldwide. Our team is growing, and we are looking for engineers with passion for automation. You will help support the... ...and automate our infrastructure. About this role: As a Principal Site Reliability Engineer at FortiCNAPP, you will lead the design,...PrincipalFull timeWorldwide$151.6k - $245.3k
...outcomes. About the Role Palo Alto Networks runs a large infrastructure and is one of the largest GCP customers. As a Principal Site Reliability Engineer for the ADEM (Autonomous Digital Experience Management) team, you will be part of a team supporting the services that...PrincipalFull timeWork at officeVisa sponsorshipWork visa- ...join our small team focused on growth and productivity. The role involves scaling our platform and infrastructure while enhancing reliability and the overall developer experience. Ideal candidates will have strong expertise in distributed systems, cloud-native...SuggestedRemote job
- ...situated within the S3 Organization, you will serve as a Principal Software Development Engineer dedicated to the development of a novel aircraft... ...coding standards, and requisite tools to maintain a highly reliable software development environment. You will provide technical...Principal
- The Role We're looking for a Senior Site Reliability Engineer to own the reliability, scalability, and operational excellence of the production systems that power Nectar's platform. We run high-volume data ingestion pipelines and real-time AI agents on top of a fast-growing...
$140k - $220k
About the Job You’ll own reliability and operational excellence for Pylon’s production systems. This means designing and implementing... ...scale as we grow. You’ll build tooling that makes the entire engineering team more effective, establish on‑call rotations and runbooks...$181k - $197k
Senior SRE Palo Alto, CA • Engineering • Hybrid • Full-time Founded by a team of ex-Apple engineers, Instrumental provides a collection... ...on, and measuring KPIs to ensure ongoing performance, reliability and efficiency. Network/application security and compliance...Full time- ATX Venture Partners seeks a Principal Engineer to drive technology initiatives and create scalable solutions. You'll develop systems in a highly collaborative environment, utilizing both front-end and back-end technologies, particularly in AI domains. The ideal candidate...Principal
- ...Infrastructure Footprint: Global production infrastructure across AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available global infrastructure supporting production systems...
$180k - $260k
...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work...Odd jobWork at officeRemote work- ...Principal Software Engineer About the Team: This role is for the Cloud Engineering team within Cornerstone. The team is responsible for creating and managing Cornerstone’s cloud infrastructure and related DevOps tooling automation. As a Principal Software Engineer you...Principal
- ...SPACE EXPLORATION TECHNOLOGIES CORP seeks a Principal Wi‑Fi Software Engineer for their Starlink consumer product line. This role will drive Wi‑Fi software development and roadmap, collaborating with engineers to enhance customer experience. The ideal candidate will...Principal
- ...technologies. Our mission is to double America’s compute capacity without building new data centers. We are seeking a skilled Site Reliability Engineer to join our growing team. The ideal candidate will help ensure the reliability, scalability, and performance of our hybrid...Work at officeWeekend work
$86.33k - $191.9k
...guardrails to make going fast also going safely. Identifying reliability anti-patterns and solving them systemically . You dive deep into... ...of AI‑assisted developer tools and platforms to increase engineering productivity, enforce code quality standards, and enable real...Local areaFlexible hours- Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes clusters (including GPU-backed clusters). Implement...
- JPMorgan Chase & Co. is seeking a Director of Site Reliability Engineering to partner with the Infrastructure Platforms and Foundational Services team in Palo Alto. This role involves guiding stakeholders through complex projects, leading the application of AI capabilities...
$180k - $360k
...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who... ...Cybersecurity / SRE team is focused on ensuring the security and reliability of X Money. This role will primarily focus on the X Money platform...Temporary workRelocation- A leading tech recruiting firm is seeking a Site Reliability Engineer to manage and optimize cloud infrastructure primarily using GCP or AWS. The role involves maintaining high availability through Kubernetes clusters and improving CI/CD pipelines with Terraform. Ideal...
- TwinThread in Palo Alto, CA is looking for a Senior Principal Architect to spearhead the modernization of payments platforms. The ideal candidate will have over 10 years of Java development experience and strong expertise in cloud-native applications. You will work closely...Principal
- Cerebras is looking for a Senior Site Reliability Engineer to join their Infrastructure team in Palo Alto, California. This role involves designing and optimizing infrastructure for distributed AI applications, contributing to the open-source Ray project, and ensuring high...
$280k - $350k
Join careers.bitkraft.vc as a Staff / Principal Platform Engineer in Mountain View, California. In this role, you will take ownership of building and scaling AI products, manage cloud infrastructure, and enhance engineering workflows. With a hybrid work model, we seek an...PrincipalRelocation- Intuit Inc. in Mountain View, California is seeking a Principal Software Engineer to lead significant technology initiatives. The role involves driving customer-oriented solutions, developing complex distributed systems, and leveraging AI technologies. The ideal candidate...Principal
$210k - $247k
...Mainspring over traditional options like engines, turbines, and fuel cells to quickly and reliably deliver local power for EV... ...delivering resilient, on-site power for commercial, industrial... ...a high-impact opportunity for a Principal Cloud Platform Software Engineer...PrincipalLocal areaRemote workFlexible hours$261.5k - $353.5k
Intuit Inc. is looking for a Principal Software Engineer in Mountain View, California, to lead the technology vision and architecture for its Fintech Risk Platform. This critical role requires expertise in engineering leadership, building high-impact distributed systems...Principal- ...Andreessen Horowitz, NEA, and Addition with $250+ million raised to date. About the role Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team. Anyscale aims to provide the next generation of tools and infrastructure to make...
$220.5k - $300k
A leading aerospace company in California seeks a Principal Security Software Engineer for its Starshield project. The role involves developing security-critical software and leveraging AI to enhance security measures. Candidates must have a bachelor's degree and at least...Principal- ...make a meaningful impact. Partner with an organization committed to defining the future of site reliability in the financial sector. As a Director of Site Reliability Engineering at JPMorgan Chase within the Infrastructure Platforms and Foundational Services (IPFS)...
$150k - $180k
...financial, environmental, and innovation outcomes. Role Verrus is looking for candidates to serve as software-focused Senior Site Reliability Engineer at Verrus. This is a full‑time position based out of the Mountain View, CA office. Verrus takes a very technology‑forward...Full timeWork at officeLocal areaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Site Reliability Engineer. Be the first to apply!
- principal network engineer Palo Alto, CA
- senior director engineering Palo Alto, CA
- engineering director Palo Alto, CA
- principal engineer Palo Alto, CA
- director software engineering Palo Alto, CA
- general engineer Palo Alto, CA
- senior chief engineer Palo Alto, CA
- principal developer Palo Alto, CA
- principal cloud engineer Palo Alto, CA
- senior principal engineer Palo Alto, CA

