Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior Software Engineer - Application Reliability, Hybrid

$199.7k - $254.6k

Cisco Systems, Inc.

The application window is expected to close on: 06/20/2026

Job posting may be removed earlier if the position is filled or if a sufficient number of applications are received .

This position is based in San Jose, CA or North Carolina and operates under a hybrid work model.

Meet the Team


Join Cisco's Enterprise AI team, the core group enabling Generative AI powered experiences across Cisco. Our mission is to build secure, scalable AI platforms that empower teams to safely develop, deploy, and operationalize AI-powered solutions. We operate at the intersection of applied AI, cloud infrastructure and security - partnering across engineering, security, compliance, and product teams to bring trusted AI to life at an enterprise scale.


We are a fast-growing, highly collaborative team of platform engineers, AI engineers, and data scientists who value technical depth, ownership, and pragmatic execution. What makes this team exciting is the opportunity to define how secure Generative AI is built and governed inside a global technology leader.
As a Senior Software Engineer in Application Reliability, you will own the reliability of our AI-powered applications and features from the user's perspective.

While our infrastructure SRE team ensures the platform is healthy, your focus will be on feature uptime, usage trends, automated issue identification, and self-healing remediation at the application layer. You will build LangGraph-based agents for automated diagnostics, Looker dashboards for observability, and evaluation harnesses for agent quality - all powered by BigQuery, BigTable, and Python. You will partner closely with application developers, data engineers, and infrastructure SREs to ensure our APIs, RAG systems, agents, and user-facing features are reliable, observable, and continuously improving.

Your Impact

  • Define, implement, and enforce feature-level SLIs, SLOs, and error budgets for APIs, RAG systems, AI agents, and user-facing applications.

  • Build and maintain application observability systems using Looker dashboards on BigQuery and BigTable - providing real-time visibility into feature health, error patterns, and usage trends for developers, PMs, and leadership.

  • Design and build LangGraph-based agents for automated issue identification and remediation: anomaly detection on BQ logs, root cause diagnosis, auto-rollback, feature flag kill switches, and self-healing workflows.

  • Develop agent evaluation harnesses to benchmark agent performance, test multi-step workflows, handle non-deterministic outputs, and run regression testing as agents evolve.

  • Write complex SQL (BigQuery) for usage trend analysis, anomaly detection, and operational analytics; design BQ table schemas optimized for observability and debugging.

  • Analyze application usage trends and adoption metrics to proactively identify reliability risks, capacity needs, and degraded user experiences before they become incidents.

  • Partner with application development teams to embed reliability practices into the development lifecycle: deployment safety (canary, progressive rollout), structured logging standards, and distributed tracing.

  • Lead application-level incident response, root cause analysis, and blameless postmortems focused on feature impact rather than infrastructure symptoms.

  • Build Python-based tooling and automation to reduce mean time to detect (MTTD) and mean time to resolve (MTTR) for application-layer issues.

  • Stay current with the rapidly evolving AI landscape (new frameworks, tools, and paradigms) and apply emerging techniques to improve platform reliability and developer productivity.

Minimum Qualifications

  • 10+ years of experience in software engineering with significant focus on reliability, observability, or production operations; Bachelor's or Master's Degree in Computer Science, Engineering, or a related technical discipline.

  • Strong Python development skills, with experience building production tooling, automation, and agent-based systems.

  • Production GCP experience - deploying and managing applications on GKE (Kubernetes), deep SQL expertise with BigQuery (complex queries, window functions, schema design, cost optimization), and hands-on experience with BigTable (or equivalent) for high-throughput operational data.

  • Proven experience designing and operating application-level SLI/SLO frameworks, burn-rate alerting, and error budget policies.

  • Strong debugging skills at the application layer - distributed tracing, profiling, structured log analysis, and dependency mapping.

Preferred Qualifications

  • Experience building agent evaluation harnesses (benchmarking, regression testing, guardrail validation for AI agents).

  • Familiarity with A2A protocols, streaming architectures, and event-driven systems.

  • Experience with deployment safety patterns: feature flags, canary deployments, progressive rollouts, and automated rollback.

  • Experience with GCP observability services (Cloud Logging, Cloud Trace, Cloud Monitoring).

  • Exposure to AIOps concepts: ML-driven anomaly detection, automated root cause analysis, intelligent alerting.

  • Experience driving reliability culture across engineering teams - SLO adoption, postmortem processes, and reliability reviews.

  • Active engagement with the evolving AI ecosystem; awareness of emerging tools and frameworks.

  • Hands-on experience with GenAI application development: LangGraph, agent engineering, prompt design, and agentic workflows.

  • Experience building Looker dashboards and Look ML models for operational observability.

Why Cisco?

At Cisco, we're revolutionizing how data and infrastructure connect and protect organizations in the AI era - and beyond. We've been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.

Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you'll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.

We are Cisco, and our power starts with you.

Message to applicants applying to work in the U.S. and/or Canada:

The starting salary range posted for this position is $199,700.00 to $254,600.00 and reflects the projected salary range for new hires in this position in U.S. and/or Canada locations, not including incentive compensation*, equity, or benefits.

Individual pay is determined by the candidate's hiring location, market conditions, job-related skillset, experience, qualifications, education, certifications, and/or training. The full salary range for certain locations is listed below. For locations not listed below, the recruiter can share more details about compensation for the role in your location during the hiring process.

U.S. employees are offered benefits, subject to Cisco's plan eligibility rules, which include medical, dental and vision insurance, a 401(k) plan with a Cisco matching contribution, paid parental leave, short and long-term disability coverage, and basic life insurance. Please see the Cisco careers site to discover more benefits and perks. Employees may be eligible to receive grants of Cisco restricted stock units, which vest following continued employment with Cisco for defined periods of time.

U.S. employees are eligible for paid time away as described below, subject to Cisco's policies:

  • 10 paid holidays per full calendar year, plus 1 floating holiday for non-exempt employees

  • 1 paid day off for employee's birthday, paid year-end holiday shutdown, and 4 paid days off for personal wellness determined by Cisco

  • Non-exempt employees** receive 16 days of paid vacation time per full calendar year, accrued at rate of 4.92 hours per pay period for full-time employees

  • Exempt employees participate in Cisco's flexible vacation time off program, which has no defined limit on how much vacation time eligible employees may use (subject to availability and some business limitations)

  • 80 hours of sick time off provided on hire date and each January 1st thereafter, and up to 80 hours ofunused sick timecarried forwardfrom one calendar yearto the next

  • Additional paid time away may be requested to deal with critical or emergency issues for family members

  • Optional 10 paid days per full calendar year to volunteer

For non-sales roles, employees are also eligible to earn annual bonuses subject to Cisco's policies.

Employees on sales plans earn performance-based incentive pay on top of their base salary, which is split between quota and non-quota components, subject to the applicable Cisco plan. For quota-based incentive pay, Cisco typically pays as follows:

  • .75% of incentive target for each 1% of revenue attainment up to 50% of quota;

  • 1.5% of incentive target for each 1% of attainment between 50% and 75%;

  • 1% of incentive target for each 1% of attainment between 75% and 100%; and

  • Once performance exceeds 100% attainment, incentive rates are at or above 1% for each 1% of attainment with no cap on incentive compensation.

For non-quota-based sales performance elements such as strategic sales objectives, Cisco may pay 0% up to 125% of target. Cisco sales plans do not have a minimum threshold of performance for sales incentive compensation to be paid.

The applicable full salary ranges for this position, by specific state, are listed below:

New York City Metro Area:

$199,700.00 - $292,800.00

Non-Metro New York state & Washington state:

$174,500.00 - $260,500.00

* For quota-based sales roles on Cisco's sales plan, the ranges provided in this posting include base pay and sales target incentive compensation combined.

** Employees in Illinois, whether exempt or non-exempt, will participate in a unique time off program to meet local requirements.

Vacancy posted 10 days ago
Similar jobs that could be interesting for youBased on the Senior Software Engineer - Application Reliability, Hybrid in San Jose, CA vacancy
  •  ...Cisco Systems, Inc. is looking for a Senior Software Engineer focused on Application Reliability in San Jose, CA. You will define and enforce SLIs and build observability...  ...with GCP and Kubernetes, contributing to a fast-paced, hybrid work environment. #J-18808-Ljbffr... 
    Application
    Senior

    Cisco

    San Jose, CA
    4 days ago
  • $185k

     ...Senior Software Security Engineer Engineering · US, San Jose · Hybrid Who We Are Spectro Cloud lets organizations around the world...  ...across platform, cloud, and application layers Strengthen security...  ...hardening, incident response, and reliability improvements Clearly... 
    Application
    Senior
    Work at office
    Flexible hours
    Shift work
    3 days per week

    Skydrop

    San Jose, CA
    4 days ago
  • $90k - $215k

     ...Senior Software Engineer- Observability and Reliability Platform Engineering (REMOTE) Senior Software Engineer- Observability...  ...week ago Be among the first 25 applicants At GEICO, we offer a...  ...experience with AWS, GCP, Azure, or hybrid data center Education ~... 
    Application
    Senior
    Hourly pay
    Full time
    Work experience placement
    Local area
    Remote work
    Flexible hours

    GEICO

    San Jose, CA
    9 hours ago
  • $165k - $241.4k

    The application window is expected to close on: 08/04/2026 Job posting...  ...received . This is a hybrid position in the Milpitas, CA...  ...be part of a best-in-class Software Development team that works...  ...experience in Software Development Engineering in software engineering, or... 
    Application
    Senior
    Temporary work
    Work at office
    Local area
    Flexible hours

    Cisco Systems, Inc.

    Milpitas, CA
    10 days ago
  •  ...Senior Software Engineer Java AWS Hybrid Senior Software Engineer Payments Java AWS Location: Hybrid (Multiple...  ...systems Ensure performance, reliability, and low-latency transaction processing...  ...building high-availability, scalable applications Familiarity with containers and... 
    Application
    Senior

    Liberty Personnel Services, Inc.

    Milpitas, CA
    4 days ago
  •  ...Index Engines is seeking mid to senior level Software Engineers for their San Jose, CA office. The ideal candidate...  ...and maintain software for Linux applications and will work with the Support Organization...  ..., 401(k), unlimited PTO, and a hybrid work schedule. #J-18808-Ljbffr... 
    Application
    Senior
    Work at office

    Index Engines

    San Jose, CA
    3 days ago
  • $140k - $215k

    ## Sr. Software Engineer - Falcon Fusion Product (Hybrid)Applylocations: USA - Sunnyvale, CAtime...  ...Our Fusion is seeking a Senior-to-Principal (Level 7)...  ...attention to performance, reliability and scalability will be...  ...is 7-12) of overall applicable experience in a technical... 
    Application
    Senior
    Work experience placement
    Work at office
    Local area

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    3 days ago
  •  ...Back-End Engineer As a global leader in...  ...the cloud product software engineering team,...  ...Sensor. This is a hybrid role and will...  ...organization and up to senior leadership. Our...  ...design patterns, reliability and scaling) of...  ...building cloud-deployed applications ~ BS/BE in CS... 
    Application
    Senior
    Work at office
    2 days per week

    CrowdStrike

    Sunnyvale, CA
    1 day ago
  • $201.6k - $302k

    Job Description The Role: As the Senior Engineering Manager for Hybrid Services & Reliability (HSR) within AV Core Infrastructure (ACI) at GM, you are the architect...  ...estimate only. It is based on what a successful applicant might be paid in accordance with applicable state... 
    Application
    Senior
    Local area
    Remote work
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago
  • $140k - $215k

     ...We're seeking a highly skilled Senior Engineer to join our Falcon Risk Platform...  ...our Engineers - This role is hybrid, requiring 2-3 days per week on-...  ...internal and customer-facing web applications, with a focus on performance, reliability, and security. Develop and... 
    Application
    Senior
    Work experience placement
    Work at office
    Local area
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    12 days ago
  • $174k - $252k

    Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California Fair... 
    Application
    Senior
    Full time

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $140k - $215k

     ...the Role: This is a Software Development Engineer role on the Cloud Runtime...  ...operations. This role is hybrid, requiring 2-3 days per week...  ...systems and components reliability and performance through monitoring...  ...for all employees and applicants for employment. The... 
    Application
    Senior
    Work experience placement
    Work at office
    Local area
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    4 days ago
  • $165k - $241.4k

    The application window is expected to close on: 06/29/2026 Job posting...  ...received . This is a hybrid position. Meet the Team...  ...security - partnering across engineering, security, compliance, and product...  ...standard for automation and reliability that enables our AI models... 
    Application
    Temporary work
    Local area
    Flexible hours

    Cisco Systems, Inc.

    San Jose, CA
    10 days ago
  •  ...Monolithic Power Systems Inc. seeks a self-motivated senior-level engineer in San Jose, CA to drive system-level architecture and product definition for high-reliability power management solutions. This role involves collaboration with internal teams and customers, supporting... 
    Application
    Senior

    Monolithic Power Systems

    San Jose, CA
    3 days ago
  • $175k - $260k

     ...Australia-Employment is seeking an Applications Engineer to join their team in Santa Clara, CA. The ideal candidate will have a Master’s degree...  ...260,000 per year, along with a generous bonus structure and hybrid work flexibility. You will be instrumental in recommending... 
    Application
    Senior

    Australia-Employment

    Santa Clara, CA
    3 days ago
  •  ...for Rainfall Health's new Senior Software Engineer role. Senior Software...  ...Digital Health) Location: Hybrid-San Francisco Bay Area...  ...role in designing secure, reliable systems that integrate with...  ...What You’ll Do Platform & Application Development Design, build... 
    Application
    Senior

    Rainfall Health

    Sunnyvale, CA
    2 days ago
  •  ...firm in California is seeking a Senior/Staff Java Developer to...  ...maintain large-scale cloud-based applications. The ideal candidate will...  ...technologies like AWS or Azure. This hybrid role requires onsite work...  ...teams to ensure high-quality software delivery. Competitive salary... 
    Application
    Senior

    Compunnel

    Sunnyvale, CA
    3 days ago
  • $153.2k - $234.1k

     ...General Motors in Sunnyvale, CA, is seeking a Senior Mobile Engineer to develop high-performance mobile applications for fleet management. This hybrid role involves collaborating with cross-functional teams to influence mobile architecture and design solutions. The ideal... 
    Application
    Senior
    Remote work

    General Motors

    Sunnyvale, CA
    3 days ago
  •  ...Applications are still being accepted. Apply now! Job Type Full-Time Workspace Hybrid/Remote Job Description As a Senior Software Optimization Engineer, you’ll lead the design and implementation of software...  ...system efficiency and reliability. Collaborate with cross-functional... 
    Application
    Senior
    Permanent employment
    Full time
    Contract work
    Remote work

    Ainabl

    San Jose, CA
    3 days ago
  •  ...Cisco Systems, Inc. is looking for a skilled Software Development Engineer for a hybrid role in Milpitas, CA. You'll design and implement high-quality applications for Cisco's SDWAN management. Candidates should have a Bachelor's degree and significant experience in software... 
    Application
    Senior

    Cisco

    Milpitas, CA
    4 days ago
  •  ...The Role Index Engines has an outstanding career...  ...opportunity for mid to senior level Software Engineers for our San...  ...Index Engines’ Linux application and will work closely...  ...systems that are scalable, reliable, and secure Guide...  ...Unlimited PTO Hybrid work schedule with WFH... 
    Application
    Senior
    Work at office
    Work from home
    Monday to Friday

    Index Engines

    San Jose, CA
    3 days ago
  • $152k - $241.5k

     ...world. We are looking for a Senior Software Engineer to join our mission to...  ...business critical services and AI applications. You will be working with a...  ..., crafting and building reliable distributed systems, and has...  ...in a globally distributed, hybrid multi‑cloud environment (... 
    Application
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $143k - $191k

     ...decisions. We’re looking for a Software Engineer to design and build highly...  ...intuitive, scalable, and reliable product features. This is a...  ...backend or full-stack applications ~ Proficiency in Python and...  ...this role is categorized as hybrid in Santa Clara, CA The base... 
    Application
    Senior
    Work at office
    Remote work
    Flexible hours

    Eightfold LLC

    Santa Clara, CA
    15 hours ago
  •  ...THE ROLE: As a senior member of the LLM...  ...scalability, and reliability, enabling tensor parallelism...  ...of inference engines, distributed...  ...TP / PP / EP (MoE) hybrid execution, including...  ...Software Engineering ~ Expertise...  ...will consider all applicants without regard to... 
    Application
    Senior

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    15 hours ago
  • $120k - $180k

    ## Software Engineer, Cloud/Backend - Policy (Hybrid)Applylocations: USA - Sunnyvale, CA: USA - Redmond...  ..., mentor junior and senior developers and...  ...that scale cleanly and reliably, and then implementing those...  ...opportunity for all employees and applicants for employment. The... 
    Application
    Work experience placement
    Work at office
    Local area
    Flexible hours
    2 days per week
    3 days per week

    CrowdStrike Holdings, Inc.

    Sunnyvale, CA
    8 hours ago
  • $142k - $200k

     ...set of hardware, software and mobile solutions...  ...To Software Engineering Manager What You...  ...We are seeking a Senior Software Engineer...  ...operations to deliver reliable, secure, and...  ...collaboration skills. Hybrid in office 3X a...  ...variable pay where applicable. Actual base... 
    Application
    Senior
    Work at office

    ChargePoint

    Campbell, CA
    1 day ago
  • $154.42k - $235.9k

     ...experience that make complex systems reliable, observable, and fast. As a Senior Software Engineer, you will design and deliver...  ...blocks used by AV/Robotics applications on vehicles, on benches, and in...  ...eligible for relocation benefits. Hybrid Work This role is categorized... 
    Application
    Senior
    Permanent employment
    Local area
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    4 days ago
  •  ...effortlessly run large‑scale ML applications, without the hassle of...  ...Role We are looking for a Software Engineer to join the ML Integration...  ...large‑scale ML workloads run reliably and efficiently across our...  ...Location This role follows a hybrid schedule, requiring in-... 
    Application
    Senior
    Work at office
    Remote work

    Dormont Manufacturing Company

    Sunnyvale, CA
    4 days ago
  • $179.06k - $198.95k

     ...Clara 2 days per week (Hybrid) Expertise coding...  ...skilled and motivated engineer to design, develop, and...  ...designing for scale, reliability, and operational excellence...  ...to run efficiently as Software-as-a-Service (SaaS) on...  ...Pursuant to Applicable State Equal Pay Transparency... 
    Application
    Senior
    Hourly pay
    Full time
    Work at office
    2 days per week
    3 days per week

    Cohesity

    Santa Clara, CA
    3 days ago
  • $154.42k - $235.9k

     ...experience that make complex systems reliable, observable, and fast. As a Senior Software Engineer, you will design and deliver...  ...blocks used by AV/Robotics applications on vehicles, on benches, and...  ...eligible for relocation benefits. Hybrid: This role is categorized... 
    Application
    Senior
    Permanent employment
    Local area
    Work from home
    Relocation
    Relocation package
    Flexible hours

    General Motors

    Sunnyvale, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Software Engineer - Application Reliability, Hybrid. Be the first to apply!