Site Reliability Engineer

$150k - $250k

3B Staffing LLC

Site Reliability Engineer role

USC or GC only are considered at this time.

San Francisco - Local to Bay area only but role is remote and occasion meeting required

Latest update, 03/31/2026:

The Site Reliability Engineer role is critical for us right now - we have enterprise customers with urgent reliability issues that need immediate attention. We're excited to see candidates who can jump in and own this piece of our infrastructure!

What our team says about this role

Client is looking for 2 SREs. From a financial perspective, the business has hit $7M+ in ARR, are meaningfully profitable, and are growing exponentially.

They're looking for 2 SREs with strong programming expertise and experience with large-scale systems to own reliability and performance for enterprise customers including Nvidia, Samsara, Zapier and PwC.

Avoid candidates that are too CI/CD or DevOps focused - they need people with genuine debugging experience in production environments

The role has a base salary of $150K - $250K + equity and they have a preference for on-site in San Francisco but the search is also open to remote for strong candidates that are not based in the SF Area."

"We are looking for a Site Reliability Engineer with strong programming expertise and experience with large-scale systems to own the reliability and performance for our enterprise customers including Nvidia, Samsara, Zapier and PwC.

You will work closely with our Co-founders and have a massive impact on our product and customer satisfaction."

Tech stack

Python, C, Rust, Kubernetes, FastAPI, Redis, Postgres, Prisma

Seniority

4-8+ years of experience in production or reliability engineering, with a focus on debugging and fixing system- level issues.

Work experience

Experience debugging memory leaks in a production environment.
Experience as a production engineer or reliability engineer with direct experience fixing issues rather than only reporting them.
Experience working with large scale systems (at least 1k+ RPS).

Hard skills

Strong programming ability in C and Rust
Experience with PostgreSQL, Redis, Kubernetes or Prometheus/Grafana

Soft skills

Excited to work at an early-stage startup and willing to work ~60 hours / week.

What you will do:

Work directly with enterprise customers to debug and resolve production issues.
Own the reliability and performance, with a focus on debugging memory leaks, connection pool issues, and other critical bugs.
Proactively improve the overall reliability of the system to prevent future issues.
Profile systems, run benchmarks, and work to improve latency and throughput.
Collaborate in a fast-paced startup environment.

Role Details

Title: Site Reliability Engineer
Core responsibilities include owning product reliability and performance, debugging memory leaks, and working closely with enterprise customers.
Reports to the co-founder and collaborate with the entire 5-person company.

Candidate Requirements

Must have experience with large-scale systems, ideally from big tech companies like Meta, Amazon, Microsoft, etc.
Strong debugging skills, particularly with memory leaks, are essential.
Programming proficiency required; C, Rust, and Python are required .
Looking for candidates with 4+ years of experience, but open to more senior candidates if they fit the role.

Company Context

Client is a profitable AI company with a $7 million ARR, used by companies like Netflix, NASA, and Nvidia.
The company is a small, dynamic team focusing on open-source AI gateways.

Compensation and Logistics

Salary range set at $200,000 to $250,000.
Remote work is possible, but Bay Area candidates preferred for occasional on-site work.

Timeline and Urgency

Hiring is urgent due to customer issues and potential churn.
Interview process includes recruiter screen, 30-minute call, technical round, and on-site session.

Pain Points

Current customer issues need immediate attention to prevent churn.
Lack of dedicated personnel focusing solely on product reliability.

Ideal Candidate Profile

Preferred from big tech companies with experience in high-traffic environments.
Hands-on, eager to work in a startup environment, willing to handle long hours and high-pressure situations.

Apply

Vacancy posted 2 days ago

Similar jobs that could be interesting for youBased on the Site Reliability Engineer in San Francisco, CA vacancy

Senior Site Reliability Engineer
...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient...
Suggested
TechChain Talent
San Francisco, CA
2 days ago
Site Reliability Engineer (SRE)
$170k - $250k
...Site Reliability Engineer (SRE) Location: San Francisco, CA / Palo Alto, CA Company Stage of Funding: Growth-Stage AI Infrastructure Company ($80M Raised) Office Type: Onsite (4 Days Per Week) Salary: $170,000-$250,000 + Competitive Equity Company Description...
Suggested
Work at office
Visa sponsorship
Flexible hours
Recruiting from Scratch
San Francisco, CA
4 days ago
Site Reliability Engineer
...human would. We're a small team of former Google and Stripe engineers, including the founding team of Google Wallet, dedicated to... ...The Role We're looking for a skilled and passionate Site Reliability Engineer to join our team. As a SRE, you'll be responsible...
Suggested
Remote work
1 day per week
Runloop AI, Inc
San Francisco, CA
1 day ago
Site Reliability Engineer
$113.4k - $162k
...Site Reliability Engineer San Francisco, CA We believe communication belongs to everyone. We exist to democratize phone service. TextNow is evolving the way the world connects and that's because we're made up of people with curious minds who bring an optimistic, yet...
Suggested
Temporary work
TextNow
San Francisco, CA
3 days ago
Senior Site Reliability Engineer
$160k - $250k
...public clouds when the right fit. As we continue to commercialize our machine learning models, we also need to grow our DevOps and Site Reliability team to maintain the reliability of our enterprise SaaS offering for our customers. Our ideal candidate is someone who is able...
Suggested
Hive
San Francisco, CA
2 days ago
Site Reliability Engineer II
$86k - $105k
...generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best... ...certifications. Minimum of 2 years prior DevOps, software engineering or related experience. Must be able to work different schedules...
Hourly pay
Work at office
Immediate start
Visa sponsorship
Work visa
Flexible hours
Early Warning Services
San Francisco, CA
4 days ago
Site Reliability Engineer (SRE)
$170k - $230k
...Site Reliability Engineer (SRE) Palo Alto / San Francisco Bay Area About Mithril Mithril is an AI infrastructure platform built to make GPU compute more accessible and affordable for the world's leading enterprises, AI startups, and the AI research community,...
Work at office
Local area
1 day per week
Mithril
San Francisco, CA
4 days ago
Site Reliability Engineer (SRE)
$163.71k - $306k
...their own infrastructure, behind their own controls, with the reliability and operational clarity they would expect from any critical system... ..., Support, and TAMs to trust. Partner with product engineers on infrastructure requirements for new Retool products, especially...
Retool
San Francisco, CA
3 days ago
Site Reliability Engineer
$100k - $170k
...Site Reliability Engineer Houston; San Francisco; Seattle About Nscale Nscale is the GPU cloud built for AI. We run high-performance, cost-efficient infrastructure for AI-native startups and global enterprises, from bare metal up through the platform services...
Flexible hours
Shift work
Nscale
San Francisco, CA
2 days ago
Site Reliability Engineer II
$98.58k - $138.02k
...Site Reliability Engineer II Restaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique, centralized solution for accounting and back-office operations for restaurants. Restaurant365's culture is focused on empowering...
Work at office
Restaurant365
San Francisco, CA
2 days ago
Site Reliability Engineer
$150k
...Site Reliability Engineer San Francisco, CA About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security...
VantageScore®
San Francisco, CA
3 days ago
Site Reliability Engineer (SRE)
...Site Reliability Engineer (SRE) FLUIX is building the AI operating system that plans, designs, and optimizes AI infrastructure. We are based in Silicon Valley. We specialize in providing AI-driven solutions for data centers and power providers, leveraging cutting-edge...
Work at office
Weekend work
Fluix AI
San Francisco, CA
1 day ago
Site Reliability Engineer
$200k - $300k
...Site Reliability Engineer Title of Role: Site Reliability Engineer Location: San Francisco, onsite Company Stage of Funding: Venture Round - Healthcare, AI Office Type: Onsite Salary: $200K-$300K Company Description We're representing a dynamic...
Work at office
Recruiting from Scratch
San Francisco, CA
1 day ago
Site Reliability Engineer
...Site Reliability Engineer We are looking for a dynamic engineer to join our rapidly growing SRE team. As an SRE, you will report to our VP of Technical Operations and be responsible for operating an extremely high performance and scalable, low latency platform built...
Relocation package
1872 Consulting
San Francisco, CA
1 day ago
Site Reliability Engineer
...company valued at $10 billion. We work in‑person five days a week in our new San Francisco headquarters. About the Role As a Site Reliability Engineer (SRE) at Mercor, you’ll own production reliability across our most critical systems, partnering directly with...
Mercor
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
$210k - $240k
Join to apply for the Senior Site Reliability Engineer role at Alembic Technologies This range is provided by Alembic Technologies. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $210,000.00/yr - $2...
Full time
Alembic Technologies
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
$175k - $250k
...0/yr Job Title: Senior Cloud Infrastructure Engineer Location: San Francisco, CA. Remote unavailable. Modality: On-Site only. Must live within commuting distance of... ...while ensuring scalability, performance, and reliability across environments. What You’ll Do Design,...
Full time
Remote work
Relocation
Relocation package
The Recruiting Guy
San Francisco, CA
4 days ago
Senior Software Engineer - Site Reliability Engineering
...Udaip Cloud-Based Data And Ai Platform Engineer At U.S. Bank, we're on a journey to do our best. Helping the customers and businesses we serve to make better and smarter financial decisions and enabling the communities we support to grow and succeed. We believe it...
Temporary work
Work experience placement
Phenom People
San Francisco, CA
1 day ago
Director, Site Reliability Engineering
$205k - $305k
...Director Of Site Reliability Engineering Interested in working on cutting-edge blockchain technology and creating equitable access to the global financial system? Since 2014, the mission-driven team at the Stellar Development Foundation (SDF) has helped fuel the tremendous...
Temporary work
Work at office
Local area
Worldwide
Flexible hours
Stellar
San Francisco, CA
7 days ago
Manager, Site Reliability Engineering
$204k - $281k
.... This is an opportunity to do career-defining work. We’re all in on this mission. If you are too, let’s talk. Manager, Site Reliability Engineering San Francisco, California Okta authenticates, authorizes and provisions millions of users a day. The service is hosted on...
Permanent employment
Worldwide
Flexible hours
Okta, Inc.
San Francisco, CA
2 days ago
Staff Site Reliability Engineer
...Staff Site Reliability Engineer (SRE) Location: San Francisco, CA Job Responsibilities As our Staff SRE, you'll be the primary expert responsible for our entire compute ecosystem. Your key responsibilities will include: As a Staff SRE, you'll operate at the...
United IT
San Francisco, CA
2 days ago
Senior Software Engineer, Site Reliability Engineering
$210.8k - $272.8k
About Thumbtack Thumbtack helps millions of people confidently care for their homes. About the Site Reliability Engineering Team The Site Reliability Engineering team focuses on creating and maintaining a reliable, secure, and scalable platform vital for a seamless user...
Local area
Thumbtack
San Francisco, CA
3 days ago
Senior Manager, Site Reliability Engineering - Infrastructure Platform
...to help us continue to scale the service with great people and reliable, cost-effective, and efficient infrastructure, processes, and... ...platform capabilities in partnership with architects and product engineering Build a world-class observability platform and monitoring...
Gravity Engineering Services Pvt Ltd.
San Francisco, CA
2 days ago
Senior Site Reliability Engineer
$174.92k - $209.91k
...: to make access to data as simple and reliable as electricity. With Fivetran, customer... ...canonical and ready to query, with no engineering or maintenance required. We’re proud that... ...integrate our teams, systems, and career sites. About the Role Fivetran is building...
Full time
Work at office
Remote work
Fivetran
Oakland, CA
17 hours ago
Site Reliability Engineer
...ambitious goals and attract incredibly creative scientists and engineers from leading academic institutions and from frontier AI labs... ...human brain. Position Summary We are looking for a Site Reliability Engineer to own the digital infrastructure that powers our...
Visa sponsorship
Astera
Emeryville, CA
4 days ago
Site Reliability Engineer
$160k - $230k
...As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline...
Remote job
Full time
Work experience placement
Together AI
San Francisco, CA
more than 2 months ago
Site Reliability Engineer - Hosting
...design of information and operational support systems. Required Skills/Qualifications: BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting...
Permanent employment
Work experience placement
Start working today
Remote work
Flexible hours
San Francisco, CA
more than 2 months ago
Senior Site Reliability Engineer (GPU Clusters) - Hosting
$250k
...Europe, while now significantly expanding its footprint in the United States. The company is looking for a Senior / Staff Site Reliability Engineer to support and scale large-scale HPC and cloud environments powering GPU-intensive workloads. The role involves working...
Permanent employment
Remote work
San Francisco, CA
a month ago
Senior Site Reliability Engineer- San Francisco, CA, the US
...Job Description Job Description Senior Site Reliability Engineer (Payments Infrastructure) Kody is seeking a Senior Site Reliability Engineer to ensure the reliability, availability, scalability, and operational excellence of our global payment platform. You will...
Kody
San Francisco, CA
a month ago
Site Reliability Engineering
...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...
Forhyre
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!