Software Engineering Manager 1 - Streaming & Cloud Platform Reliability

$155.5k - $315k

HPE

Software Engineering Manager 1 – Streaming & Cloud Platform Reliability

This role has been designed as ‘’Onsite’ with an expectation that you will primarily work from an HPE office.

Who We Are:

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today’s complex world. Our culture thrives on finding new and better ways to accelerate what’s next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description:

We’re looking for a hands‑on Software Engineering Manager to lead a small team (2–4 developers) focused on improving the reliability of Mist’s cloud platform by driving concrete postmortem action items from our incident management process.

This team owns follow‑ups from production incidents—especially those involving our streaming data pipelines (Kafka / Flink / Storm) and core APIs. You’ll work closely with senior engineers to turn incident learnings into durable engineering improvements.

This is a hybrid role requiring on‑site collaboration multiple days per week in Cupertino, California. Due to the requirements of this position, this role requires a US Citizen or Green Card holder.

What You’ll Do

Own and drive post‑incident follow‑ups from our Incident Management process, turning incident reports into design and implementation work.
Lead, mentor, and grow a 2–4 person engineering team, while contributing hands‑on code in production services.
Design, implement, and harden streaming topologies using Kafka, Storm, and/or Flink (e.g., stats, telemetry, alerts, pcaps).
Improve reliability of core APIs (REST API, WebSocket, Webhooks, etc.), including auth, rate limiting, and DR‑sensitive flows.
Enhance observability and runbooks: add metrics/alerts, define SLOs, and codify playbooks for recurring incident patterns.
Collaborate with SRE, Platform, and Data teams on DR, multi‑region, and multi‑cloud behavior (AWS, GCP, DR regions).
Ensure robust testing and deployment practices (unit/integration tests, regression tests for past incidents, safe rollout/rollback).

Experience Required for this Role

7+ years total professional software engineering experience.
This is a hybrid role requiring on‑site collaboration multiple days per week in Cupertino, California. Due to the requirements of this position, this role requires a US Citizen or Green Card holder.
2+ years in a team lead role (mentors, performance feedback, prioritization), while remaining hands‑on technically.
5+ years building backend or distributed systems in Python, Go, or Java proficiency in at least one of these languages to lead design reviews and contribute production code.
3+ years designing, implementing, and operating distributed, event‑driven systems using:
- Kafka and at least one of Flink or Storm, or a comparable streaming framework.
3+ years building and operating RESTful APIs (service design, auth, rate limiting, client IP handling, versioning).
3+ years working with cloud‑native infrastructure:
- Kubernetes, containerized microservices, CI/CD pipelines.
3+ years with production datastores such as:
- Redis, Postgres, Cassandra/Datastax, S3/GCS, or similar distributed storage systems.
2+ years directly involved in production incident response:
- On‑call participation, postmortems, and driving remediation work through to completion.
Proven ability to debug latency, throughput, data correctness, and availability issues in streaming pipelines and/or APIs.
Experience adding or improving metrics, logging, tracing, and alerts for production services.

Preferred Qualifications

2+ years working with big‑data / analytics or ETL systems
(e.g., Apache Spark, Airflow, Snowflake, or similar).
Experience with webhook or event‑delivery systems (idempotency, retries, ordering, DLQs).
Exposure to multi‑region / DR design: cross‑cloud migrations, DNS and certificate management, environment‑driven configuration.
Familiarity with DevOps practices, CI/CD automation, and service ownership.
Experience with observability stacks such as Prometheus, Grafana, Kibana/Elasticsearch.

Why This Role

Direct, visible impact on the stability and reliability of Mist’s cloud platform and AI‑driven networking products.
A focused charter with real, concrete backlogs driven by incidents—not vague “platform work.”
Close collaboration with strong senior engineers and SREs, with room to shape both technical direction and team culture.

Additional Skills:

Accountability, Accountability, Action Planning, Active Learning, Active Listening, Agile Methodology, Agile Scrum Development, Analytical Thinking, Bias, Coaching, Creativity, Critical Thinking, Cross-Functional Teamwork, Data Analysis Management, Data Collection Management (Inactive), Data Controls, Design, Design Thinking, Empathy, Follow-Through, Group Problem Solving, Growth Mindset, Intellectual Curiosity (Inactive), Long Term Planning, Managing Ambiguity {+ 5 more}

What We Can Offer You:

Health & Wellbeing

We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

Personal & Professional Development

We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have — whether you want to become a knowledge expert in your field or apply your skills to another division.

Unconditional Inclusion

We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

Let's Stay Connected:

Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

#unitedstates

Job:

Engineering

Job Level:

Manager_1 "The expected salary/wage range for this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education/training, and/or skill level.
– United States of America: Annual Salary USD 155,500 - 315,000 in California
The listed salary range reflects base salary. Variable incentives may also be offered."

Information about employee benefits offered in the US can be found at

HPE is an Equal Employment Opportunity/ Veterans/Disabled/LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here: Equal Employment Opportunity.

Hewlett Packard Enterprise is EEO Protected Veteran/ Individual with Disabilities.

HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.

No Fees Notice & Recruitment Fraud Disclaimer

It has come to HPE’s attention that there has been an increase in recruitment fraud whereby scammer impersonate HPE or HPE-authorized recruiting agencies and offer fake employment opportunities to candidates. These scammers often seek to obtain personal information or money from candidates.

Please note that Hewlett Packard Enterprise (HPE), its direct and indirect subsidiaries and affiliated companies, and its authorized recruitment agencies/vendors will never charge any candidate a registration fee, hiring fee, or any other fee in connection with its recruitment and hiring process. The credentials of any hiring agency that claims to be working with HPE for recruitment of talent should be verified by candidates and candidates shall be solely responsible to conduct such verification. Any candidate/individual who relies on the erroneous representations made by fraudulent employment agencies does so at their own risk, and HPE disclaims liability for any damages or claims that may result from any such communication.

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Software Engineering Manager 1 - Streaming & Cloud Platform Reliability in Cupertino, CA vacancy

Software Engineering Manager 1 - Streaming & Cloud Platform Reliability
...Software Engineering Manager 1 – Streaming & Cloud Platform Reliability This role has been designed as "Onsite" with an expectation that you will primarily work from an HPE office. Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people...
Platform
Cloud
Work at office
Hewlett Packard Enterprise Development LP
Cupertino, CA
1 day ago
Technical Program Manager, Google Cloud Platform Reliability
$227k - $320k
Technical Program Manager, Google Cloud Platform Reliability corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree in a technical... ...experience. 10 years of experience in program management or engineering leadership. Experience with site reliability...
Platform
Cloud
Full time
Local area
Google Inc.
Sunnyvale, CA
4 days ago
Site Reliability Engineering Manager, Google Distributed Cloud
$207k - $300k
Site Reliability Engineering Manager, Google Distributed Cloud Google Sunnyvale, CA, USA Bachelor’s degree in Computer Science... ...Engineering (SRE) combines software and systems engineering to build... ...building the next generation of Google platforms, we make Google's product...
Platform
Cloud
Full time
Google Inc.
Sunnyvale, CA
4 days ago
Software Engineering Manager, ML Kernel Performance, AWS Neuron, Annapurna Labs
$212.7k - $287.7k
...AWS Neuron, the software development kit used... ...boundary, our engineers craft high-performance... ...the team #1. Why AWS Amazon... ...broadly adopted cloud platform. We pioneered cloud... ...engineering team management experience - 7+... ...design patterns, reliability and scaling) of...
Platform
Cloud
Local area
Work from home
Flexible hours
Amazon
Cupertino, CA
16 hours ago
Director, Cloud Ops/Site Reliability
...for an experienced Cloud Ops leader who... ...process automation platform.We are pioneering... ...service lifecycle management Hands-on with Ansible... ...requirements for engineering teams to deliver ops-ready software REQUIRED SKILLS AND... ...+ years as a Site Reliability Engineer or Dev Ops...
Platform
Cloud
Decision Engines, Inc.
Palo Alto, CA
16 hours ago
Senior Product Manager, Google Distributed Cloud, Hardware
$163k - $237k
Senior Product Manager, Google Distributed Cloud, Hardware Google... ...end-to-end, 0 to 1, etc). Experience... ...infrastructure platforms and distributed... ...with creative engineers, designers, marketers... ..., scale, reliability, ease-of-use, and... ...solutions, from managed software to AI-led...
Platform
Cloud
Full time
Google Inc.
Sunnyvale, CA
4 days ago
Senior Product Manager, Cloud AI, Gemini API
$192k - $278k
...experience in product management, including... ...end-to-end, 0 to 1, etc.). 3 years... ...between Research, Engineering, and GTM teams. Experience... ...APIs, tools, or platforms. Understanding of... .... Google Cloud accelerates every... ...like RESTful design, streaming, and state management...
Platform
Cloud
Full time
Google Inc.
Sunnyvale, CA
2 days ago
Cloud Ops & Reliability Director — Scale a SaaS Platform
...California is seeking a Cloud Ops leader responsible for... ...grade business automation platform. This role requires managing oversight of DevOps and... ...strong technical skills in software delivery and team... ...alongside a bachelor’s degree in engineering. Prior experience with...
Platform
Cloud
Decision Engines, Inc.
Palo Alto, CA
16 hours ago
Software Engineering Manager, ASIC
$207k - $300k
...of experience in software development. 3 years... ...in a people management or team leadership... ...work of a Software Engineer goes beyond just... ..., efficiency, reliability and velocity. Our... ...Googlers, Google Cloud customers, and billions... ...the essential platforms that enable...
Platform
Cloud
Full time
Worldwide
Google Inc.
Sunnyvale, CA
3 days ago
Cloud Service Reliability Engineer
...to dive deep. We call this role a Cloud Service Reliability Engineer. The Cloud Service Reliability... ...primarily focused on identity and access management, cloud computing services/... ...monitors new and existing services, platforms, and application stacks. Drives...
Platform
Cloud
forhyre.com
Sunnyvale, CA
1 day ago
Software Engineering Director - Switching/Routing, Layer 1, PHY, Kernel, Platform area
$190.28k - $285k
...Director of Software Engineering Marvell's semiconductor solutions... .... Across enterprise, cloud and AI, and carrier... ...Expect Lead and manage a team of R&D engineers... ...development team for platform and infrastructure areas... ...Knowledge in Layer 1, PHY, Kernel and platform...
Platform
Cloud
Permanent employment
Work experience placement
Internship
Work from home
Marvell
Santa Clara, CA
16 hours ago
Software Engineering Manager, ASE Storage Infrastructure
$216.6k - $325.5k
...Software Engineering Manager, ASE Storage Infrastructure Apple Services Engineering... ...designs, builds, and operates the cloud infrastructure, server systems, and platform technologies that power many... ...Apple's commitment to reliability, data protection, and quality...
Platform
Cloud
Relocation
Apple
Cupertino, CA
3 days ago
Software Engineering Manager - Backend
$145k - $190k
...Software Engineering Manager – Backend Location: Santa Clara, CA, United... ...engineers focused on building reliable, secure, scalable, cloud-based distributed... ...reliable, long-lived platforms that serve homeowners,... ..., API standards, event streaming patterns, and integration...
Platform
Cloud
For contractors
Remote work
Q-Cells
Santa Clara, CA
1 day ago
Director, Site Reliability Engineering Sunnyvale, CA , USA
$250k
...in AI knowledge management solutions for... ...truth—explainable, reliable, and... ...Site Reliability Engineering, you will ensure... ...knowledge management platform operates with... ...performance of eGain’s cloud services Define... ...experience in software engineering,... ...eGain” Step 1 Aptitude section...
Platform
Cloud
Work at office
eGain Corporation
Sunnyvale, CA
4 days ago
Senior Manager, GPU Cloud Infrastructure - GeForce NOW
$256k - $414k
...global leader in cloud gaming, dedicated... ...premier data centers to stream over 2,000 games... ...for a Senior Manager to lead the design... ...training, and inference platforms by delivering... ...throughput, and highly reliable interconnects... ...Science or a related engineering field (or...
Platform
Cloud
Local area
NVIDIA
Santa Clara, CA
16 hours ago
Engineering Manager, Cloud Platform (US)
...a team of 10–15 engineers building cloud-native applications... ...with people management. What you'll do... ...our Node.js-based platform running primarily... ...goals, running 1:1s, growing careers... ...quality, system reliability, on-call hygiene... ...for 8+ years of software engineering...
Platform
Cloud
GrabJobs
San Jose, CA
2 days ago
Product Manager II, Google Cloud
$156k - $229k
...experience in product management or related... ...of the following: Cloud, SaaS, enterprise... ...Product Management, Engineering, UX/UI, Sales,... ...including Google Cloud Platform, Google Workspace... ..., efficiency, reliability and velocity. Our... ...the future. From software to hardware our...
Platform
Cloud
Full time
Worldwide
Google Inc.
Sunnyvale, CA
1 day ago
Sr. Manager, Cloud Engineering - CTIO (Hybrid)
...Senior Manager Of Cloud Engineering As a global leader in cybersecurity, CrowdStrike protects the people... ...the world's most advanced AI-native platform. Our customers span all industries, and... ...high-volume, highly resilient Tier-1 cloud services and systems ~ Strong...
Platform
Cloud
Work at office
2 days per week
CrowdStrike
Sunnyvale, CA
4 days ago
Senior Manager, DevOps & SRE - Platform Reliability & Global Operations
Senior DevOps & SRE Manager - Platform Reliability & Global Operations The... ..., workflows, event streaming, and data platforms.... ...management, including Sev‑1/Sev‑2 incident... ...DevOps, SRE, Platform Engineering, or Production Operations... ...with Kubernetes, cloud platforms, and event‑...
Platform
Cloud
Work at office
3 days per week
Qcells North America
Santa Clara, CA
1 day ago
Software Engineering Manager, Apple Cloud Networking
$228.1k - $342.8k
...Software Engineering Manager, Apple Cloud Networking Apple Cloud Networking team builds and operates software-defined network platforms that work at scale to deliver a multi-cloud network and security... ...passionate about innovation, reliability, and impact at scale. Your...
Platform
Cloud
Relocation
Apple
Sunnyvale, CA
16 hours ago
Software Engineering Manager - Platform Technologies
$228.1k - $342.8k
...Software Engineering Manager - Platform Technologies Imagine what you could do here. At Apple, new ideas have... ...Elasticsearch Experience with cloud infrastructure and managed services... ...of driving improvements in platform reliability, performance, and scalability...
Platform
Cloud
Relocation
Apple
Cupertino, CA
3 days ago
Lead Group Product Manager, Cloud SQL
$240k - $334k
...experience in product management or a related technical... ...execution, end-to-end, 0 to 1, etc.). 5 years of... ...experience building and running cloud database service. 7... ...-functionally with engineering, UX/UI, sales finance,... ...our users. Cloud SQL Platform is the driver of growth...
Platform
Cloud
Full time
Temporary work
Google Inc.
Sunnyvale, CA
2 days ago
Group Product Manager, Generative AI, Google Cloud
$240k - $334k
...experience in product management or related technical role... ..., end-to-end, 0 to 1, etc). Experience with... ...-functionally with engineering, UX/UI, sales finance,... ...years of experience in software development or engineering... ...transform our Google Cloud Platform (GCP) products to...
Platform
Cloud
Full time
Temporary work
Google
Sunnyvale, CA
4 days ago
AI Native Software Engineering Manager
$94.4k - $305k
...-powered workflows engineered to scale in real-world... ...building cloud-native solutions, and... ...completely net new platforms and systems that need... ...serverless). Minimum of 1 year expertise in... ...layers to manage multi-provider pipelines... ...or agent registry/stream-based architectures...
Platform
Cloud
Work experience placement
Live in
Work at office
Local area
Accenture
Mountain View, CA
16 hours ago
Staff Software Engineer (Backend Engineer + Streaming)
$300 per month
...Staff Streaming Software Engineer Crusoe is on a mission to accelerate... ...construction, and cloud services. If you... ...the real-time data platforms that power metrics,... ...operate Crusoe's AI cloud reliably at scale. In this... ...-code, and incident management for streaming...
Platform
Cloud
Temporary work
Immediate start
Crusoe
Sunnyvale, CA
7 days ago
Enterprise Public Cloud Computing and Storage Manager (Enterprise Supervising Technology Analyst)
$148.12k - $180.46k
...nation, the City manages a large set of services... ..., serving about 1 million residents... ...experience platform, data equity and privacy... ...Enterprise Public Cloud Computing and... ...performance, and reliability. Perform hands‑on... ...Certified Systems Engineer (MCSE) or equivalent...
Platform
Cloud
Temporary work
Work experience placement
Local area
Remote work
City of San José
San Jose, CA
16 hours ago
Technical Program Manager III, Capacity Management, Cloud
$163k - $237k
Technical Program Manager III, Capacity Management, Cloud Google Kirkland, WA, USA; Seattle, WA, USA; +1 more In accordance with Washington... ...in product development with engineers. Behind everything our... ...next generation of Google platforms, we make Google's product portfolio...
Platform
Cloud
Full time
Temporary work
Flexible hours
Shift work
Google Inc.
Sunnyvale, CA
2 days ago
Senior Customer Program Manager - NVIDIA Cloud Partners
$200k - $322k
...Senior Customer Program Manager to support the... ...enterprise customers, cloud service providers, and Tier-1 partners. The... ...expertise, a solid engineering background, and the... ...milestones aligned with NCP platform strategy and... ...production scaling and reliability of inference...
Platform
Cloud
NVIDIA Corporation
Santa Clara, CA
2 days ago
Senior Technical Program Manager II, Infrastructure, Google Cloud
$227k - $320k
...Technical Program Manager II, Infrastructure, Google Cloud corporate_fare Google... ...development with engineers. The AI and... ...scale, efficiency, reliability and velocity. Our... ...providing the essential platforms that enable... ...the future. From software to hardware our teams...
Platform
Cloud
Full time
Work experience placement
Worldwide
Shift work
Google Inc.
Sunnyvale, CA
4 days ago
Senior Technical Program Manager I, Infrastructure, Google Cloud
$192k - $278k
...Technical Program Manager I, Infrastructure, Google Cloud corporate_fare Google... ...development with engineers. The AI and... ...scale, efficiency, reliability and velocity. Our... ...providing the essential platforms that enable... ...the future. From software to hardware, our teams...
Platform
Cloud
Full time
Work experience placement
Worldwide
Google Inc.
Sunnyvale, CA
1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Software Engineering Manager 1 - Streaming & Cloud Platform Reliability. Be the first to apply!