Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior / Staff Software Engineer (Observability / SRE)

$148k - $249k

GrabJobs

Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis. Waabi is backed by and partners with world leaders in AI, automotive, logistics, and deep tech. With offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit: We are constantly expanding our compute footprint in the cloud, and need to expand our observability and monitoring capabilities alongside. We currently use the built in AWS monitoring tools, but this doesn’t work with our on-premise stuff and aren’t user friendly. There are a number of options out there we could deploy, but all of them require some attention and work. Even if we go a vendored route, we still need at least one person to own this area. You Will.. - Design and lead the architecture and development of Waabi’s monitoring and observability stack, used to monitor the health and performance of cloud and on-prem environments. - Develop and extend workloads and benchmarks (compute, storage, network, ML/AI) and integrate stress, chaos, and regression tests to validate hardware and platform choices. - Analyze and optimize end-to-end performance across hardware, firmware, Linux kernel, runtimes, and distributed services using advanced profiling tools (perf, eBPF, flamegraphs, tracing frameworks). - Build automation and observability tooling (Go/Python/Java, Kubernetes/Docker) for CI/CD-based performance regression detection, telemetry, alerting, and anomaly detection. - Work with client teams to support their applications’ observability requirements. - Influence system architecture and tooling decisions that improve how Waabi builds, monitors, and scales its infrastructure. - Drive execution and quality, writing design docs, setting milestones, mentoring ICs, and communicating insights and results to stakeholders and leadership. Qualifications: - 5+ years software engineering or systems/performance engineering experience (BS in CS/EE or related), with demonstrated end-to-end ownership of complex projects. - Proficient in at least one of: Python, Rust, C/C++; strong CS fundamentals and system design skills. - Hands-on with Linux internals (CPU scheduling, memory, I/O, networking) and perf tooling (perf, eBPF, flamegraphs, tracing frameworks). - Experience with Kubernetes, microservices, and distributed systems; comfort building production services and pipelines. - Proven track record of clear communication, writing design docs, and leading cross-functional efforts. Bonus: - Experience deploying and managing observability platforms (OpenTelemetry, Grafana OSS). - Performance tuning for databases/streaming/batch/ML platforms; GPU/xPU or Arm performance exposure. - Experience tuning stream processing, batch or ML platforms (e.g. Argo Workflows, PyTorch). - Familiarity with microservices debugging and distributed tracing (OpenTelemetry, Prometheus). The US yearly salary range for this role is: $148,000 - $249,000 USD in addition to competitive perks & benefits. Waabi (US) Inc.’s yearly salary ranges are determined based on several factors in accordance with the Company’s compensation practices. The salary base range is reflective of the minimum and maximum target for new hire salaries for the position across all US locations. Note: The Company provides additional compensation for employees in this role, including equity incentive awards and an annual performance bonus. Perks/Benefits: - Competitive compensation and equity awards. - Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only). - Unlimited Vacation. - Flexible hours and Work from Home support. - Daily drinks, snacks and catered meals (when in office). - Regularly scheduled team building activities and social events both on-site, off-site & virtually. - As we grow, this list continues to evolve! Waabi is a technology start-up building technologies to transform the way the world moves. Join our talented team to be a part of the future and to make an impact! Waabi is an equal opportunity employer. We celebrate diversity and are committed to creating a supportive, inclusive, and accessible workplace for all our employees. We seek applicants of all backgrounds and identities, across race, color, ethnicity, national origin or ancestry, age, citizenship, religion, sex, sexual orientation, gender identity or expression, military or veteran status, marital status, pregnancy or parental status, caregiver status, disability, or any other characteristic protected by law. We make workplace accommodations for qualified individuals with disabilities as required by applicable law. If reasonable accommodation is needed to participate in the job application or interview process please let our recruiting team know. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior / Staff Software Engineer (Observability / SRE) in San Francisco, CA vacancy
  • $238k - $288k

     ...operability - and we're hiring a founding engineer to lead our BMC firmware work. You'll set...  ...Build out BMC-driven telemetry and observability - sensor, power, thermal, and RAS data flowing into Crusoe Cloud's ops and SRE stack - so the BMC layer is a first-class... 
    Senior
    Temporary work

    Crusoe

    San Francisco, CA
    3 days ago
  • $238k - $288k

     ...Location Type On-site Department Cloud Engineering Crusoe builds and operates AI-first cloud...  ...operations Build out BMC-driven telemetry and observability — sensor, power, thermal, and RAS data flowing into Crusoe Cloud's ops and SRE stack — so the BMC layer is a first-class... 
    Senior
    Full time
    Temporary work

    ProducePay

    San Francisco, CA
    4 days ago
  •  ...leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their...  ...Responsibilities include leading incident response, improving observability, and collaborating with various teams to enhance platform reliability... 
    Senior

    Speak

    San Francisco, CA
    2 days ago
  • Fieldguide is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of our production...  ...standards and build robust observability practices. Candidates should have at least 5 years of experience in SRE or related fields, proficiency in operating... 
    Senior
    Remote job
    Flexible hours

    Fieldguide

    San Francisco, CA
    2 days ago
  • Airwallex Pty Ltd. is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you’ll work closely with product teams to deliver scalable...  .... Candidates need a Bachelor's degree and 6+ years in SRE or DevOps roles. The position offers the opportunity to... 
    Senior

    Airwallex Pty Ltd.

    San Francisco, CA
    3 days ago
  • Lambda, a leader in AI cloud infrastructure, is seeking a Software Engineer specializing in observability platforms. The ideal candidate has over 8 years of experience, including 3+ years in Go and 5+ years practicing Site Reliability Engineering. Responsibilities include... 
    Senior
    Work at office

    Lambda

    San Francisco, CA
    3 days ago
  •  ..., based in San Francisco, is looking for a Staff Site Reliability Engineer to lead the reliability, scalability, and observability strategies across their platform. The ideal...  ...candidate will have over 10 years of experience in software engineering with a strong focus on... 
    Senior
    Flexible hours

    Fieldguide

    San Francisco, CA
    2 days ago
  • $279.2k - $390.9k

     ...generation ML Indexing & Retrieval engine, integrating capabilities...  .... Define best practices for observability, reliability, and operational...  ...adopting robust DevOps and SRE principles. Collaborate with...  ...10+ years of experience in software engineering, specializing in... 
    Senior
    For contractors
    Work experience placement
    Remote work
    Flexible hours

    Tensec

    San Francisco, CA
    3 days ago
  • $200k - $230k

     ...as the technology evolves. AI experience requirements vary by role and will be assessed during the interview process. Staff Observability Engineer Gusto's Reliability Engineering team enables our product teams to build impactful products by building secure,... 
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    2 days ago
  •  ...About the Role We're looking for a Senior Software Engineer to help build and evolve our...  ...from CI/CD and release automation to observability standards , platform tooling , and...  ...devtools / infrastructure (or adjacent SRE/release engineering). ~ Strong coding... 
    Senior
    Local area

    Dilectus Workforce Solutions

    San Francisco, CA
    29 days ago
  •  ...seeking a Sr. Site Reliability Engineer to join our team and run...  ...They enjoy building testing and observability capabilities that will accelerate...  ...with a solid foundation in software engineering, particularly in...  ...processes. DevOps Engineer/SRE Transitioning to Blockchain An... 
    Senior
    Remote job

    Blockchain Works

    San Francisco, CA
    1 day ago
  • CloudDevs: Senior Web site Reliability Engineer (SRE) CloudDevs works with fast-moving, venture-backed startups...  ...and bettering how groups ship software program, you’ll match proper in....  ...system reliability, efficiency, and observability. Outline and monitor SLIs, SLOs,... 
    Senior

    The10minutecareersolution

    San Francisco, CA
    1 day ago
  • $190k - $230k

     ...Senior Software Engineer (Full Stack / Product Engineering) Location: San Francisco, NYC, Austin, or Remote (North America) Company Stage...  ...APIs, and frontend surfaces Ensure systems are reliable, observable, and scalable under real-world operational demands... 
    Senior
    Work at office
    Remote work
    Flexible hours

    Recruiting from Scratch

    San Francisco, CA
    1 day ago
  • $281k - $356k

     ...across 15+ U.S. states. This Senior Tech Lead role will lead...  ...temporary mitigations and permanent software fixes, as well as preventing...  ...with Data Science, Systems Engineering and operations teams to...  ...significantly contributing to SRE or operations-focused engineering... 
    Senior
    Permanent employment
    Full time
    Temporary work
    Remote work

    Waymo

    San Francisco, CA
    3 days ago
  • $230k - $250k

     ...business. Why We Need You: We are looking for a seasoned Senior Staff Software Engineer to architect, lead, and drive strategic initiatives for...  ...understanding of Agile methodologies, CI/CD pipelines, observability, and monitoring tools. Outstanding leadership,... 
    Senior
    Work experience placement
    Home office

    Findem

    San Francisco, CA
    1 day ago
  • $143.3k - $266.13k

     ...Senior Staff Software Engineer Job Locations US-CA-San Francisco - Remote Job ID 2026-5810 Name Linked Remote: San...  ...turn it into robust, documented system architecture. Observability & Hardening: Experience using Prometheus and OpenTelemetry... 
    Senior
    Full time
    Local area
    Remote work

    DataDirect Networks Inc

    San Francisco, CA
    4 days ago
  •  ...Senior Staff Software Engineer San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable...  ...systems in place (performance budgets, quality gates, observability) that let a team move fast without breaking things.... 
    Senior
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    1 day ago
  •  ...world. The Role: We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering organization with a...  ...the SDLC including plan, code, test, build, deploy, observe and remediate. Innovate - Collaborate with cross-functional... 
    Senior
    Remote work

    SoFi

    San Francisco, CA
    3 days ago
  • $189k - $236k

     ...Senior Staff Software Engineer - Pricing and Packaging San Francisco, CA At Gusto, we're on a mission to grow the small business economy...  ...for AI-augmented development. Own and improve platform observability, reliability, and performance: define and track SLOs,... 
    Senior
    Full time
    Work at office
    Local area
    Remote work
    2 days per week
    3 days per week

    Gusto

    San Francisco, CA
    2 days ago
  • $231k - $300k

     ...potential. About the Team: Quizlet's Engineering organization builds the core...  ...About the Role: We're looking for a Senior Staff Engineer to lead the technical design,...  ...that are scalable, reliable, secure, and observable-ensuring high availability, low latency... 
    Senior
    Work at office
    3 days per week

    Quizlet

    San Francisco, CA
    1 day ago
  • $150k - $260k

     ...Staff/Senior/Principal Software Engineer (Elixir/AI Focus) Location: Remote (with occasional travel) Base Location: Salt Lake City, UT, US Employment...  ...Wallaby for end-to-end user flows. Utilize advanced observability tools (GCP logging, AppSignal, Prometheus, Grafana)... 
    Senior
    Full time
    Contract work
    Remote work

    GrabJobs

    San Francisco, CA
    1 day ago
  • $180k - $220k

     ...About the Role We're looking for a Senior/Staff Backend Engineer to architect and build large scale...  ...and explicit guarantees - embedding observability and fault tolerance into their...  ...into clear, maintainable, and reliable software. Strong backend engineer. Deep Python... 
    Senior
    Full time
    Work at office
    Shift work

    Actively AI

    San Francisco, CA
    1 day ago
  • $160k - $210k

     ...ownership of Zip’s Kubernetes platform, observability and deployment tooling. You will...  ...lead in SF for the broader Foundations Engineering Group. Your Role You will join a super...  ...end to end Qualifications 6+ years of software engineering experience in infrastructures... 
    Senior

    ZIP

    San Francisco, CA
    11 hours ago
  •  ...us and our community. Trust Engineering is responsible for the technology...  ...You Will Make As a senior technical individual contributor...  ...contributors at Airbnb are Software Engineers which means we expect...  ...building and improving observability, SLOs, incident response, and... 
    Senior
    Work experience placement

    airbnb, Inc.

    San Francisco, CA
    1 day ago
  •  ...We’re looking for an experienced senior/staff engineer (5+ years of experience) who is product...  ...scalability, and performance. Develop observability and metrics to enable smooth...  ...you are ~5+ years of experience in software development, with a strong background... 
    Senior
    Work from home

    LIGHTFIELD INC

    San Francisco, CA
    1 day ago
  • $190k - $290k

     ...Adyen, everything we do is engineered for ambition. For our...  .... Customer Developer Observability Team We believe that our...  ...We are looking for a Software Engineer to join the team, and...  ...Currently working as a Senior Software Engineer or at a similar... 
    Senior
    H1b
    Work at office
    Visa sponsorship
    Flexible hours
    Shift work

    Adyen

    San Francisco, CA
    3 days ago
  • $169k - $225k

     ...Senior Staff Software Engineer As a Senior Staff Software Engineer, you will play a key role in designing and developing scalable, high-throughput...  ..., and DataOps teams to embed data governance, observability, and automation into the core of our data ingestion process... 
    Senior
    Full time
    Remote work

    Intellipro Group

    San Francisco, CA
    4 days ago
  • $150k - $200k

     ...projects. Amperesand is building hardware and software that rewrites this broken power...  ...pipelines including data ingestion, feature engineering, model training, inference, deployment,...  ...with strong focus on reliability, observability, retraining, and explainability.... 
    Senior
    Temporary work
    Work experience placement
    Local area
    Shift work

    Amperesand

    San Francisco, CA
    15 hours ago
  • $300 per month

     ...Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated...  ...Engineer to lead the architecture and evolution of Crusoe's observability platform at scale. In this role, you will define and drive... 
    Temporary work

    Crusoe

    San Francisco, CA
    2 days ago
  • $250k - $400k

     ...Senior/Staff Software Engineer, Developer Platform Title of Role: Senior/Staff Software Engineer, Developer Platform Location: San...  ..., ensuring high reliability and scalability. Develop observability frameworks to monitor system performance through metrics... 
    Senior
    Work at office

    Recruiting from Scratch

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior / Staff Software Engineer (Observability / SRE). Be the first to apply!