Senior / Staff Software Engineer (Observability / SRE)
$148k - $249kGrabJobs
Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis. Waabi is backed by and partners with world leaders in AI, automotive, logistics, and deep tech. With offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit: We are constantly expanding our compute footprint in the cloud, and need to expand our observability and monitoring capabilities alongside. We currently use the built in AWS monitoring tools, but this doesn’t work with our on-premise stuff and aren’t user friendly. There are a number of options out there we could deploy, but all of them require some attention and work. Even if we go a vendored route, we still need at least one person to own this area. You Will.. - Design and lead the architecture and development of Waabi’s monitoring and observability stack, used to monitor the health and performance of cloud and on-prem environments. - Develop and extend workloads and benchmarks (compute, storage, network, ML/AI) and integrate stress, chaos, and regression tests to validate hardware and platform choices. - Analyze and optimize end-to-end performance across hardware, firmware, Linux kernel, runtimes, and distributed services using advanced profiling tools (perf, eBPF, flamegraphs, tracing frameworks). - Build automation and observability tooling (Go/Python/Java, Kubernetes/Docker) for CI/CD-based performance regression detection, telemetry, alerting, and anomaly detection. - Work with client teams to support their applications’ observability requirements. - Influence system architecture and tooling decisions that improve how Waabi builds, monitors, and scales its infrastructure. - Drive execution and quality, writing design docs, setting milestones, mentoring ICs, and communicating insights and results to stakeholders and leadership. Qualifications: - 5+ years software engineering or systems/performance engineering experience (BS in CS/EE or related), with demonstrated end-to-end ownership of complex projects. - Proficient in at least one of: Python, Rust, C/C++; strong CS fundamentals and system design skills. - Hands-on with Linux internals (CPU scheduling, memory, I/O, networking) and perf tooling (perf, eBPF, flamegraphs, tracing frameworks). - Experience with Kubernetes, microservices, and distributed systems; comfort building production services and pipelines. - Proven track record of clear communication, writing design docs, and leading cross-functional efforts. Bonus: - Experience deploying and managing observability platforms (OpenTelemetry, Grafana OSS). - Performance tuning for databases/streaming/batch/ML platforms; GPU/xPU or Arm performance exposure. - Experience tuning stream processing, batch or ML platforms (e.g. Argo Workflows, PyTorch). - Familiarity with microservices debugging and distributed tracing (OpenTelemetry, Prometheus). The US yearly salary range for this role is: $148,000 - $249,000 USD in addition to competitive perks & benefits. Waabi (US) Inc.’s yearly salary ranges are determined based on several factors in accordance with the Company’s compensation practices. The salary base range is reflective of the minimum and maximum target for new hire salaries for the position across all US locations. Note: The Company provides additional compensation for employees in this role, including equity incentive awards and an annual performance bonus. Perks/Benefits: - Competitive compensation and equity awards. - Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only). - Unlimited Vacation. - Flexible hours and Work from Home support. - Daily drinks, snacks and catered meals (when in office). - Regularly scheduled team building activities and social events both on-site, off-site & virtually. - As we grow, this list continues to evolve! Waabi is a technology start-up building technologies to transform the way the world moves. Join our talented team to be a part of the future and to make an impact! Waabi is an equal opportunity employer. We celebrate diversity and are committed to creating a supportive, inclusive, and accessible workplace for all our employees. We seek applicants of all backgrounds and identities, across race, color, ethnicity, national origin or ancestry, age, citizenship, religion, sex, sexual orientation, gender identity or expression, military or veteran status, marital status, pregnancy or parental status, caregiver status, disability, or any other characteristic protected by law. We make workplace accommodations for qualified individuals with disabilities as required by applicable law. If reasonable accommodation is needed to participate in the job application or interview process please let our recruiting team know. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
$238k - $288k
...operability - and we're hiring a founding engineer to lead our BMC firmware work. You'll set... ...Build out BMC-driven telemetry and observability - sensor, power, thermal, and RAS data flowing into Crusoe Cloud's ops and SRE stack - so the BMC layer is a first-class...SeniorTemporary work$238k - $288k
...Location Type On-site Department Cloud Engineering Crusoe builds and operates AI-first cloud... ...operations Build out BMC-driven telemetry and observability — sensor, power, thermal, and RAS data flowing into Crusoe Cloud's ops and SRE stack — so the BMC layer is a first-class...SeniorFull timeTemporary work- ...leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their... ...Responsibilities include leading incident response, improving observability, and collaborating with various teams to enhance platform reliability...Senior
- Fieldguide is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of our production... ...standards and build robust observability practices. Candidates should have at least 5 years of experience in SRE or related fields, proficiency in operating...SeniorRemote jobFlexible hours
- Airwallex Pty Ltd. is seeking a Senior Site Reliability Engineer in San Francisco. In this role, you’ll work closely with product teams to deliver scalable... .... Candidates need a Bachelor's degree and 6+ years in SRE or DevOps roles. The position offers the opportunity to...Senior
- Lambda, a leader in AI cloud infrastructure, is seeking a Software Engineer specializing in observability platforms. The ideal candidate has over 8 years of experience, including 3+ years in Go and 5+ years practicing Site Reliability Engineering. Responsibilities include...SeniorWork at office
- ..., based in San Francisco, is looking for a Staff Site Reliability Engineer to lead the reliability, scalability, and observability strategies across their platform. The ideal... ...candidate will have over 10 years of experience in software engineering with a strong focus on...SeniorFlexible hours
$279.2k - $390.9k
...generation ML Indexing & Retrieval engine, integrating capabilities... .... Define best practices for observability, reliability, and operational... ...adopting robust DevOps and SRE principles. Collaborate with... ...10+ years of experience in software engineering, specializing in...SeniorFor contractorsWork experience placementRemote workFlexible hours$200k - $230k
...as the technology evolves. AI experience requirements vary by role and will be assessed during the interview process. Staff Observability Engineer Gusto's Reliability Engineering team enables our product teams to build impactful products by building secure,...Full timeWork at officeLocal areaRemote work2 days per week3 days per week- ...About the Role We're looking for a Senior Software Engineer to help build and evolve our... ...from CI/CD and release automation to observability standards , platform tooling , and... ...devtools / infrastructure (or adjacent SRE/release engineering). ~ Strong coding...SeniorLocal area
- ...seeking a Sr. Site Reliability Engineer to join our team and run... ...They enjoy building testing and observability capabilities that will accelerate... ...with a solid foundation in software engineering, particularly in... ...processes. DevOps Engineer/SRE Transitioning to Blockchain An...SeniorRemote job
- CloudDevs: Senior Web site Reliability Engineer (SRE) CloudDevs works with fast-moving, venture-backed startups... ...and bettering how groups ship software program, you’ll match proper in.... ...system reliability, efficiency, and observability. Outline and monitor SLIs, SLOs,...Senior
$190k - $230k
...Senior Software Engineer (Full Stack / Product Engineering) Location: San Francisco, NYC, Austin, or Remote (North America) Company Stage... ...APIs, and frontend surfaces Ensure systems are reliable, observable, and scalable under real-world operational demands...SeniorWork at officeRemote workFlexible hours$281k - $356k
...across 15+ U.S. states. This Senior Tech Lead role will lead... ...temporary mitigations and permanent software fixes, as well as preventing... ...with Data Science, Systems Engineering and operations teams to... ...significantly contributing to SRE or operations-focused engineering...SeniorPermanent employmentFull timeTemporary workRemote work$230k - $250k
...business. Why We Need You: We are looking for a seasoned Senior Staff Software Engineer to architect, lead, and drive strategic initiatives for... ...understanding of Agile methodologies, CI/CD pipelines, observability, and monitoring tools. Outstanding leadership,...SeniorWork experience placementHome office$143.3k - $266.13k
...Senior Staff Software Engineer Job Locations US-CA-San Francisco - Remote Job ID 2026-5810 Name Linked Remote: San... ...turn it into robust, documented system architecture. Observability & Hardening: Experience using Prometheus and OpenTelemetry...SeniorFull timeLocal areaRemote work- ...Senior Staff Software Engineer San Francisco, CA | New York City, NY | Seattle, WA About Anthropic Anthropic's mission is to create reliable... ...systems in place (performance budgets, quality gates, observability) that let a team move fast without breaking things....SeniorWork at officeVisa sponsorshipFlexible hours
- ...world. The Role: We are looking for an experienced Senior Staff Software Engineer to join our Builder Tools engineering organization with a... ...the SDLC including plan, code, test, build, deploy, observe and remediate. Innovate - Collaborate with cross-functional...SeniorRemote work
$189k - $236k
...Senior Staff Software Engineer - Pricing and Packaging San Francisco, CA At Gusto, we're on a mission to grow the small business economy... ...for AI-augmented development. Own and improve platform observability, reliability, and performance: define and track SLOs,...SeniorFull timeWork at officeLocal areaRemote work2 days per week3 days per week$231k - $300k
...potential. About the Team: Quizlet's Engineering organization builds the core... ...About the Role: We're looking for a Senior Staff Engineer to lead the technical design,... ...that are scalable, reliable, secure, and observable-ensuring high availability, low latency...SeniorWork at office3 days per week$150k - $260k
...Staff/Senior/Principal Software Engineer (Elixir/AI Focus) Location: Remote (with occasional travel) Base Location: Salt Lake City, UT, US Employment... ...Wallaby for end-to-end user flows. Utilize advanced observability tools (GCP logging, AppSignal, Prometheus, Grafana)...SeniorFull timeContract workRemote work$180k - $220k
...About the Role We're looking for a Senior/Staff Backend Engineer to architect and build large scale... ...and explicit guarantees - embedding observability and fault tolerance into their... ...into clear, maintainable, and reliable software. Strong backend engineer. Deep Python...SeniorFull timeWork at officeShift work$160k - $210k
...ownership of Zip’s Kubernetes platform, observability and deployment tooling. You will... ...lead in SF for the broader Foundations Engineering Group. Your Role You will join a super... ...end to end Qualifications 6+ years of software engineering experience in infrastructures...Senior- ...us and our community. Trust Engineering is responsible for the technology... ...You Will Make As a senior technical individual contributor... ...contributors at Airbnb are Software Engineers which means we expect... ...building and improving observability, SLOs, incident response, and...SeniorWork experience placement
- ...We’re looking for an experienced senior/staff engineer (5+ years of experience) who is product... ...scalability, and performance. Develop observability and metrics to enable smooth... ...you are ~5+ years of experience in software development, with a strong background...SeniorWork from home
$190k - $290k
...Adyen, everything we do is engineered for ambition. For our... .... Customer Developer Observability Team We believe that our... ...We are looking for a Software Engineer to join the team, and... ...Currently working as a Senior Software Engineer or at a similar...SeniorH1bWork at officeVisa sponsorshipFlexible hoursShift work$169k - $225k
...Senior Staff Software Engineer As a Senior Staff Software Engineer, you will play a key role in designing and developing scalable, high-throughput... ..., and DataOps teams to embed data governance, observability, and automation into the core of our data ingestion process...SeniorFull timeRemote work$150k - $200k
...projects. Amperesand is building hardware and software that rewrites this broken power... ...pipelines including data ingestion, feature engineering, model training, inference, deployment,... ...with strong focus on reliability, observability, retraining, and explainability....SeniorTemporary workWork experience placementLocal areaShift work$300 per month
...Staff Software Engineer Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated... ...Engineer to lead the architecture and evolution of Crusoe's observability platform at scale. In this role, you will define and drive...Temporary work$250k - $400k
...Senior/Staff Software Engineer, Developer Platform Title of Role: Senior/Staff Software Engineer, Developer Platform Location: San... ..., ensuring high reliability and scalability. Develop observability frameworks to monitor system performance through metrics...SeniorWork at office
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior / Staff Software Engineer (Observability / SRE). Be the first to apply!
- senior development executive San Francisco, CA
- senior technical manager San Francisco, CA
- senior procurement specialist San Francisco, CA
- senior software development engineer in test San Francisco, CA
- senior manager data science San Francisco, CA
- senior platform engineer San Francisco, CA
- senior procurement San Francisco, CA
- senior director product management San Francisco, CA
- senior cost manager San Francisco, CA
- senior compliance officer San Francisco, CA


