Senior Software Engineer - Application Reliability, Hybrid
$199.7k - $254.6kCisco Systems, Inc.
Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received .
This position is based in San Jose, CA or North Carolina and operates under a hybrid work model.
Meet the Team
Join Cisco's Enterprise AI team, the core group enabling Generative AI powered experiences across Cisco. Our mission is to build secure, scalable AI platforms that empower teams to safely develop, deploy, and operationalize AI-powered solutions. We operate at the intersection of applied AI, cloud infrastructure and security - partnering across engineering, security, compliance, and product teams to bring trusted AI to life at an enterprise scale.
We are a fast-growing, highly collaborative team of platform engineers, AI engineers, and data scientists who value technical depth, ownership, and pragmatic execution. What makes this team exciting is the opportunity to define how secure Generative AI is built and governed inside a global technology leader.
As a Senior Software Engineer in Application Reliability, you will own the reliability of our AI-powered applications and features from the user's perspective.
While our infrastructure SRE team ensures the platform is healthy, your focus will be on feature uptime, usage trends, automated issue identification, and self-healing remediation at the application layer. You will build LangGraph-based agents for automated diagnostics, Looker dashboards for observability, and evaluation harnesses for agent quality - all powered by BigQuery, BigTable, and Python. You will partner closely with application developers, data engineers, and infrastructure SREs to ensure our APIs, RAG systems, agents, and user-facing features are reliable, observable, and continuously improving.
Your Impact
Define, implement, and enforce feature-level SLIs, SLOs, and error budgets for APIs, RAG systems, AI agents, and user-facing applications.
Build and maintain application observability systems using Looker dashboards on BigQuery and BigTable - providing real-time visibility into feature health, error patterns, and usage trends for developers, PMs, and leadership.
Design and build LangGraph-based agents for automated issue identification and remediation: anomaly detection on BQ logs, root cause diagnosis, auto-rollback, feature flag kill switches, and self-healing workflows.
Develop agent evaluation harnesses to benchmark agent performance, test multi-step workflows, handle non-deterministic outputs, and run regression testing as agents evolve.
Write complex SQL (BigQuery) for usage trend analysis, anomaly detection, and operational analytics; design BQ table schemas optimized for observability and debugging.
Analyze application usage trends and adoption metrics to proactively identify reliability risks, capacity needs, and degraded user experiences before they become incidents.
Partner with application development teams to embed reliability practices into the development lifecycle: deployment safety (canary, progressive rollout), structured logging standards, and distributed tracing.
Lead application-level incident response, root cause analysis, and blameless postmortems focused on feature impact rather than infrastructure symptoms.
Build Python-based tooling and automation to reduce mean time to detect (MTTD) and mean time to resolve (MTTR) for application-layer issues.
Stay current with the rapidly evolving AI landscape (new frameworks, tools, and paradigms) and apply emerging techniques to improve platform reliability and developer productivity.
Minimum Qualifications
10+ years of experience in software engineering with significant focus on reliability, observability, or production operations; Bachelor's or Master's Degree in Computer Science, Engineering, or a related technical discipline.
Strong Python development skills, with experience building production tooling, automation, and agent-based systems.
Production GCP experience - deploying and managing applications on GKE (Kubernetes), deep SQL expertise with BigQuery (complex queries, window functions, schema design, cost optimization), and hands-on experience with BigTable (or equivalent) for high-throughput operational data.
Proven experience designing and operating application-level SLI/SLO frameworks, burn-rate alerting, and error budget policies.
Strong debugging skills at the application layer - distributed tracing, profiling, structured log analysis, and dependency mapping.
Preferred Qualifications
Experience building agent evaluation harnesses (benchmarking, regression testing, guardrail validation for AI agents).
Familiarity with A2A protocols, streaming architectures, and event-driven systems.
Experience with deployment safety patterns: feature flags, canary deployments, progressive rollouts, and automated rollback.
Experience with GCP observability services (Cloud Logging, Cloud Trace, Cloud Monitoring).
Exposure to AIOps concepts: ML-driven anomaly detection, automated root cause analysis, intelligent alerting.
Experience driving reliability culture across engineering teams - SLO adoption, postmortem processes, and reliability reviews.
Active engagement with the evolving AI ecosystem; awareness of emerging tools and frameworks.
Hands-on experience with GenAI application development: LangGraph, agent engineering, prompt design, and agentic workflows.
Experience building Looker dashboards and Look ML models for operational observability.
Why Cisco?
At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you'll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.
We are Cisco, and our power starts with you.
Message to applicants applying to work in the U.S. and/or Canada:
The starting salary range posted for this position is $199,700.00 to $254,600.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.Individual pay is determined by the candidate's hiring location, market conditions, job-related skillset, experience, qualifications, education, certifications, and/or training. The full salary range for certain locations is listed below. For locations not listed below, the recruiter can share more details about compensation for the role in your location during the hiring process.
U.S. employees are offered benefits, subject to Cisco's plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.
U.S. employees are eligible for paid time away as described below, subject to Cisco's policies:
10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees
1 paid day off for employee's birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco
Non-exempt employees** receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees
Exempt employees participate in Cisco's flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)
80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours ofunused sick timecarried forwardfrom one calendar yearto the next
Additional paid time away may be requested to deal with critical or emergency issues for family members
Optional 10 paid days per full calendar year to volunteer
For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco's policies.
Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components, subject to the applicable Cisco plan. For quota-based incentive pay, Cisco typically pays as follows:
.75% of incentive target for each 1% of revenue attainment up to 50% of quota;
1.5% of incentive target for each 1% of attainment between 50% and 75%;
1% of incentive target for each 1% of attainment between 75% and 100%; and
Once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.
For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay 0% up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.
The applicable full salary ranges for this position, by specific state, are listed below:
New York City Metro Area:
$199,700.00 - $292,800.00Non-Metro New York state & Washington state:
$174,500.00 - $260,500.00* For quota-based sales roles on Cisco's sales plan, the ranges provided in this posting include base pay and sales target incentive compensation combined.
** Employees in Illinois, whether exempt or non-exempt, will participate in a unique time off program to meet local requirements.
- ...Cisco Systems, Inc. is looking for a Senior Software Engineer focused on Application Reliability in San Jose, CA. You will define and enforce SLIs and build observability... ...with GCP and Kubernetes, contributing to a fast-paced, hybrid work environment. #J-18808-Ljbffr...ApplicationSenior
$185k
...Senior Software Security Engineer Engineering · US, San Jose · Hybrid Who We Are Spectro Cloud lets organizations around the world... ...across platform, cloud, and application layers Strengthen security... ...hardening, incident response, and reliability improvements Clearly...ApplicationSeniorWork at officeFlexible hoursShift work3 days per week$90k - $215k
...Senior Software Engineer- Observability and Reliability Platform Engineering (REMOTE) Senior Software Engineer- Observability... ...week ago Be among the first 25 applicants At GEICO, we offer a... ...experience with AWS, GCP, Azure, or hybrid data center Education ~...ApplicationSeniorHourly payFull timeWork experience placementLocal areaRemote workFlexible hours$165k - $241.4k
The application window is expected to close on: 08/04/2026 Job posting... ...received . This is a hybrid position in the Milpitas, CA... ...be part of a best-in-class Software Development team that works... ...experience in Software Development Engineering in software engineering, or...ApplicationSeniorTemporary workWork at officeLocal areaFlexible hours- ...Senior Software Engineer Java AWS Hybrid Senior Software Engineer Payments Java AWS Location: Hybrid (Multiple... ...systems Ensure performance, reliability, and low-latency transaction processing... ...building high-availability, scalable applications Familiarity with containers and...ApplicationSenior
- ...Index Engines is seeking mid to senior level Software Engineers for their San Jose, CA office. The ideal candidate... ...and maintain software for Linux applications and will work with the Support Organization... ..., 401(k), unlimited PTO, and a hybrid work schedule. #J-18808-Ljbffr...ApplicationSeniorWork at office
$140k - $215k
## Sr. Software Engineer - Falcon Fusion Product (Hybrid)Applylocations: USA - Sunnyvale, CAtime... ...Our Fusion is seeking a Senior-to-Principal (Level 7)... ...attention to performance, reliability and scalability will be... ...is 7-12) of overall applicable experience in a technical...ApplicationSeniorWork experience placementWork at officeLocal area- ...Back-End Engineer As a global leader in... ...the cloud product software engineering team,... ...Sensor. This is a hybrid role and will... ...organization and up to senior leadership. Our... ...design patterns, reliability and scaling) of... ...building cloud-deployed applications ~ BS/BE in CS...ApplicationSeniorWork at office2 days per week
$201.6k - $302k
Job Description The Role: As the Senior Engineering Manager for Hybrid Services & Reliability (HSR) within AV Core Infrastructure (ACI) at GM, you are the architect... ...estimate only. It is based on what a successful applicant might be paid in accordance with applicable state...ApplicationSeniorLocal areaRemote workWork from homeRelocationRelocation packageFlexible hours$140k - $215k
...We're seeking a highly skilled Senior Engineer to join our Falcon Risk Platform... ...our Engineers - This role is hybrid, requiring 2-3 days per week on-... ...internal and customer-facing web applications, with a focus on performance, reliability, and security. Develop and...ApplicationSeniorWork experience placementWork at officeLocal area2 days per week3 days per week$174k - $252k
Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California Fair...ApplicationSeniorFull time$140k - $215k
...the Role: This is a Software Development Engineer role on the Cloud Runtime... ...operations. This role is hybrid, requiring 2-3 days per week... ...systems and components reliability and performance through monitoring... ...for all employees and applicants for employment. The...ApplicationSeniorWork experience placementWork at officeLocal area2 days per week3 days per week$165k - $241.4k
The application window is expected to close on: 06/29/2026 Job posting... ...received . This is a hybrid position. Meet the Team... ...security - partnering across engineering, security, compliance, and product... ...standard for automation and reliability that enables our AI models...ApplicationTemporary workLocal areaFlexible hours- ...Monolithic Power Systems Inc. seeks a self-motivated senior-level engineer in San Jose, CA to drive system-level architecture and product definition for high-reliability power management solutions. This role involves collaboration with internal teams and customers, supporting...ApplicationSenior
$175k - $260k
...Australia-Employment is seeking an Applications Engineer to join their team in Santa Clara, CA. The ideal candidate will have a Master’s degree... ...260,000 per year, along with a generous bonus structure and hybrid work flexibility. You will be instrumental in recommending...ApplicationSenior- ...for Rainfall Health's new Senior Software Engineer role. Senior Software... ...Digital Health) Location: Hybrid-San Francisco Bay Area... ...role in designing secure, reliable systems that integrate with... ...What You’ll Do Platform & Application Development Design, build...ApplicationSenior
- ...firm in California is seeking a Senior/Staff Java Developer to... ...maintain large-scale cloud-based applications. The ideal candidate will... ...technologies like AWS or Azure. This hybrid role requires onsite work... ...teams to ensure high-quality software delivery. Competitive salary...ApplicationSenior
$153.2k - $234.1k
...General Motors in Sunnyvale, CA, is seeking a Senior Mobile Engineer to develop high-performance mobile applications for fleet management. This hybrid role involves collaborating with cross-functional teams to influence mobile architecture and design solutions. The ideal...ApplicationSeniorRemote work- ...Applications are still being accepted. Apply now! Job Type Full-Time Workspace Hybrid/Remote Job Description As a Senior Software Optimization Engineer, you’ll lead the design and implementation of software... ...system efficiency and reliability. Collaborate with cross-functional...ApplicationSeniorPermanent employmentFull timeContract workRemote work
- ...Cisco Systems, Inc. is looking for a skilled Software Development Engineer for a hybrid role in Milpitas, CA. You'll design and implement high-quality applications for Cisco's SDWAN management. Candidates should have a Bachelor's degree and significant experience in software...ApplicationSenior
- ...The Role Index Engines has an outstanding career... ...opportunity for mid to senior level Software Engineers for our San... ...Index Engines’ Linux application and will work closely... ...systems that are scalable, reliable, and secure Guide... ...Unlimited PTO Hybrid work schedule with WFH...ApplicationSeniorWork at officeWork from homeMonday to Friday
$152k - $241.5k
...world. We are looking for a Senior Software Engineer to join our mission to... ...business critical services and AI applications. You will be working with a... ..., crafting and building reliable distributed systems, and has... ...in a globally distributed, hybrid multi‑cloud environment (...ApplicationSenior$143k - $191k
...decisions. We’re looking for a Software Engineer to design and build highly... ...intuitive, scalable, and reliable product features. This is a... ...backend or full-stack applications ~ Proficiency in Python and... ...this role is categorized as hybrid in Santa Clara, CA The base...ApplicationSeniorWork at officeRemote workFlexible hours- ...THE ROLE: As a senior member of the LLM... ...scalability, and reliability, enabling tensor parallelism... ...of inference engines, distributed... ...TP / PP / EP (MoE) hybrid execution, including... ...Software Engineering ~ Expertise... ...will consider all applicants without regard to...ApplicationSenior
$120k - $180k
## Software Engineer, Cloud/Backend - Policy (Hybrid)Applylocations: USA - Sunnyvale, CA: USA - Redmond... ..., mentor junior and senior developers and... ...that scale cleanly and reliably, and then implementing those... ...opportunity for all employees and applicants for employment. The...ApplicationWork experience placementWork at officeLocal areaFlexible hours2 days per week3 days per week$142k - $200k
...set of hardware, software and mobile solutions... ...To Software Engineering Manager What You... ...We are seeking a Senior Software Engineer... ...operations to deliver reliable, secure, and... ...collaboration skills. Hybrid in office 3X a... ...variable pay where applicable. Actual base...ApplicationSeniorWork at office$154.42k - $235.9k
...experience that make complex systems reliable, observable, and fast. As a Senior Software Engineer, you will design and deliver... ...blocks used by AV/Robotics applications on vehicles, on benches, and in... ...eligible for relocation benefits. Hybrid Work This role is categorized...ApplicationSeniorPermanent employmentLocal areaRelocationRelocation packageFlexible hours- ...effortlessly run large‑scale ML applications, without the hassle of... ...Role We are looking for a Software Engineer to join the ML Integration... ...large‑scale ML workloads run reliably and efficiently across our... ...Location This role follows a hybrid schedule, requiring in-...ApplicationSeniorWork at officeRemote work
$179.06k - $198.95k
...Clara 2 days per week (Hybrid) Expertise coding... ...skilled and motivated engineer to design, develop, and... ...designing for scale, reliability, and operational excellence... ...to run efficiently as Software-as-a-Service (SaaS) on... ...Pursuant to Applicable State Equal Pay Transparency...ApplicationSeniorHourly payFull timeWork at office2 days per week3 days per week$154.42k - $235.9k
...experience that make complex systems reliable, observable, and fast. As a Senior Software Engineer, you will design and deliver... ...blocks used by AV/Robotics applications on vehicles, on benches, and... ...eligible for relocation benefits. Hybrid: This role is categorized...ApplicationSeniorPermanent employmentLocal areaWork from homeRelocationRelocation packageFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer - Application Reliability, Hybrid. Be the first to apply!
- software developer internship no experience San Jose, CA
- federal - software developer San Jose, CA
- software engineer contract San Jose, CA
- part time software developer San Jose, CA
- software engineer healthcare San Jose, CA
- network software engineer San Jose, CA
- ngo software engineer San Jose, CA
- software development engineer aws San Jose, CA
- software developer internship San Jose, CA
- software developer intern San Jose, CA



