Site Reliability Engineer

$150k

VantageScore

Job Description

About The Role

We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering team. In this role, you will oversee and maintain the reliability, security posture, and operational hygiene of our cloud infrastructure, APIs, and software supply chain. You will drive patch management programs, harden our Cloud infrastructure, and maintain our code repositories to ensure all systems remain compliant, secure, and scalable.

This role is ideal for an engineer who thrives at the intersection of operations and security, is passionate about automation, and takes pride in keeping complex environments clean, auditable, and resilient.

Key Responsibilities

Own and execute end-to-end patch management across AWS compute resources (EC2, ECS, Lambda runtimes, EKS nodes), third-party dependencies, and OS-level packages.

Monitor, triage, and remediate vulnerabilities identified by security scanning tools (e.g., AWS Inspector, Dependabot, Security Hub, or equivalent), prioritizing by CVSS severity and business impact.

Maintain and enforce branch protection rules, secret scanning policies, and dependency update workflows across all code repositories.

Design and implement automated pipelines for continuous compliance checking, security testing (SAST/DAST/SCA), and infrastructure drift detection.

Collaborate with IT & Info-Sec SMEs on AWS IAM roles and policies, VPC configurations, Security Groups, CloudTrail, Config, and GuardDuty to ensure least-privilege access and auditability.

Collaborate with development teams to embed security controls into CI/CD pipelines (GitHub Actions, CodePipeline, or equivalent) without impeding developer velocity.

Support the reliability and availability of production APIs — including uptime monitoring, incident response, runbook creation, and post-incident reviews.

Partner with Legal and Data Governance SMEs on API access procedures and monitoring.

Define and track SLOs/SLAs for internal and external APIs; implement alerting and dashboards using observability tooling (e.g., CloudWatch, Datadog, Grafana).

Lead periodic infrastructure and dependency audits; produce clear reports on patch compliance status and open risk items for engineering and security leadership.

Maintain thorough documentation of patching schedules, runbooks, access policies, and environment configurations.

Participate in on-call rotation and contribute to a culture of continuous improvement.

Required Qualifications

Bachelor's Degree in Computer Science, Information Systems, or a related field (or equivalent practical experience).

5+ years of professional experience in a Site Reliability Engineering, Software Engineering, DevOps, or DevSecOps role.

Demonstrated expertise managing AWS environments — including EC2, Lambda, ECS/EKS, S3, RDS, IAM, VPC, CloudTrail, Config, and GuardDuty.

Experience with various cloud environments: AWS, Azure, GPC

Strong experience with GitHub administration: branch protection, Actions workflows, secret scanning, Dependabot, and code owners.

Hands-on experience with patch management and vulnerability remediation at scale, including OS-level patching (Amazon Linux, Ubuntu) and dependency lifecycle management.

Proficiency with infrastructure-as-code tools (Terraform, CloudFormation, or AWS CDK).

Experience integrating security tooling (SAST, DAST, SCA, container scanning) into CI/CD pipelines.

Solid understanding of API reliability patterns: health checks, rate limiting, circuit breakers, and observability.

Familiarity with compliance frameworks relevant to cloud environments (SOC 2, CIS Benchmarks, NIST CSF).

Strong scripting skills in Python, Bash, or similar for automation and tooling.

Excellent communication skills and ability to translate technical risk for non-technical stakeholders.

Build observation (logging, metrics, alerting) systems to make sure system works well, and develop response plans.

Preferred Qualifications

AWS certifications (e.g., AWS Certified Security – Specialty, AWS Certified DevOps Engineer – Professional).

Experience with container security and Kubernetes (EKS) hardening.

Familiarity with CSPM tools (e.g., Wiz, Prisma Cloud, AWS Security Hub) for continuous cloud posture management.

Experience managing API gateways (AWS API Gateway, Kong, or similar) including security policy enforcement.

Exposure to secrets management solutions (AWS Secrets Manager, HashiCorp Vault).

Knowledge of SBOM (Software Bill of Materials) generation and management.

Experience with incident response playbooks and tabletop exercises.
Familiarity with Agile/Scrum methodologies and cross-functional engineering teams.

Compensation

The anticipated base salary range for this position is $150,000 annually, plus eligibility for a 15% annual performance bonus. Actual compensation will be determined based on several factors, including skills, experience, education, certifications, and geographic location.

In addition to base salary and bonus eligibility, we offer a competitive benefits package, including medical, dental, vision, 401(k), paid time off, and other employee benefits.

Apply

Vacancy posted 27 days ago

Similar jobs that could be interesting for youBased on the Site Reliability Engineer in San Francisco, CA vacancy

Sr. Site Reliability Engineer
...Sr. Site Reliability Engineer Job type: Full Time · Department: Platform · Work type: On-Site San Francisco, California, United States (Remote) Optura is healthcare’s AI orchestration platform. We help healthcare organizations transform disconnected AI pilots into a unified...
Suggested
Full time
Remote work
Neara
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
...US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems...
Suggested
Axiom Pursuits
San Francisco, CA
1 day ago
Site Reliability Engineer
...the company | Site Reliability Engineer | San Francisco, CA (Hybrid) | Full-time the company is a no-code data workflow automation tool that helps operations teams move, transform, and automate their data without writing code. LLMs are a core part of our product — we use...
Suggested
Full time
United States Digital Space LLC
San Francisco, CA
10 hours ago
Site Reliability Engineer III
$151.5k - $252.5k
...and making a real impact for some of the world’s biggest brands. About The Role We are looking for an experienced Senior Site Reliability Engineer to join the Veeam Data Cloud (VDC) engineering team. You will be working with a global team to build the world’s next modern...
Suggested
Base plus commission
Local area
Worldwide
Veeam
San Francisco, CA
10 hours ago
Site Reliability Engineer
...advanced algorithms that significantly outperforms individual engineers. We combine language models with human ingenuity to push the... ...and quality. The Role We are seeking an experienced Site Reliability Engineer to join our Platform Engineering team in the Bay Area...
Suggested
Dormont Manufacturing Company
San Francisco, CA
2 days ago
Site Reliability Engineer - Scale & Observability
A dynamic tech firm located in San Francisco is seeking a Site Reliability Engineer to enhance operational health across their production systems. This high-impact role demands expertise in AWS and strong programming skills. You will manage production systems' reliability...
gamma.app
San Francisco, CA
2 days ago
Site Reliability Engineer
...shape the future of healthcare, we’d love to meet you. About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product. You’ll work across our distributed workflow...
Work at office
Remote work
Flexible hours
2 days per week
Plenful
San Francisco, CA
15 hours ago
Senior Site Reliability Engineer
...customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like... ...of-the-art AI. As a Senior SRE, you'll tackle the scaling and reliability challenges that come with adding terabytes of data monthly and...
Unify
San Francisco, CA
4 days ago
Site Reliability Engineer
...millions of daily users while enabling our engineering teams to ship fast. You'll own the... ...building automation and tooling that improves reliability and partnering with engineering to... ...services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems...
Work at office
Work from home
gamma.app
San Francisco, CA
2 days ago
Senior/Staff Site Reliability Engineer
$50 per hour
...years of professional SRE experience 5+ years of experience contributing to architecture and design (architecture, design patterns, reliability and scaling) of new and current systems Bachelor’s Degree in Computer Science or related field, or 8+ years relevant work...
Temporary work
Work experience placement
Dormont Manufacturing Company
San Francisco, CA
4 days ago
Director of Site Reliability Engineering
$210k - $310k
...the SDF team is expanding to support the rapidly growing and changing Stellar ecosystem. SDF is looking for a Director of Site Reliability Engineering to lead a small, high-leverage SRE team and help shape how engineering teams own, operate, and improve production...
Temporary work
Work at office
Local area
Worldwide
Flexible hours
Stellar
San Francisco, CA
1 day ago
Principal Site Reliability Engineer
$300 per month
...About This Role As a Principal Site Reliability Engineer, you will play a critical role in designing and operating a next-generation NeoCloud built for AI, GPU, and high-performance workloads. This role sits at the intersection of infrastructure architecture, reliability...
Temporary work
Dormont Manufacturing Company
San Francisco, CA
2 days ago
Remote Senior Site Reliability Engineer (SRE) - Zetachain
We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You’ll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You’ll also provide guidance...
Remote job
Blockchain Works
San Francisco, CA
a month ago
CloudDevs: Senior Web site Reliability Engineer (SRE)
CloudDevs: Senior Web site Reliability Engineer (SRE) CloudDevs works with fast-moving, venture-backed startups throughout the US. We’re constructing a pool of world-class Web site Reliability Engineers for present roles and for upcoming alternatives. You’ll both be positioned...
The10minutecareersolution
San Francisco, CA
1 day ago
Site Reliability Engineer (Senior or Staff), Infrastructure Security
$127k - $249k
Senior / Staff Engineer - SRE, InfraSec We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team to guide the security of our cloud‑based infrastructure. You will be highly hands‑on technically while also mentoring a small team of SREs. The...
Local area
Remote work
The Consulting Solutions
San Francisco, CA
1 day ago
Senior Site Reliability Engineer
$166.59k - $199.91k
...About the Role The company is looking for a high-performance engineer to be a part of a team of Site Reliability Engineers. You will be working closely with engineering teams, product managers, as well as support and sales engineers to build the future of the company’s...
Work experience placement
United States Digital Space LLC
Oakland, CA
3 days ago
Staff Site Reliability Engineer, Tech Lead
...customer acquisition, and Connor was a machine learning research engineer at Scale AI . The rest of our team comes from companies like... ...-of-the-art AI. As our Staff SRE Tech Lead, you'll own the reliability and scalability of our platform as we add terabytes of data monthly...
Unify
San Francisco, CA
15 hours ago
Site Reliability Engineer
$160k - $230k
...As a Site Reliability Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline...
Remote job
Full time
Work experience placement
Together AI
San Francisco, CA
more than 2 months ago
Site Reliability Engineer
$100k - $200k
...Instacart, NFI, Ramp, and Zscaler. We’re building the most reliable and secure identity platform in the world. To do that, we... ..., automates, and recovers without skipping a beat. As a Site Reliability Engineer, you’ll help us design, run, and improve the systems that...
Remote work
Flexible hours
ConductorOne
San Francisco, CA
more than 2 months ago
Site Reliability Engineer
$130k - $175k
...scale energy storage and producing battery materials in the U.S. for the first time, all from batteries we already have. Site Reliability Engineer Essential Duties: We are seeking a highly skilled and motivated Site Reliability Engineer to collect requirements,...
Full time
Casual work
Work at office
Local area
Night shift
Redwood Materials
San Francisco, CA
more than 2 months ago
Site Reliability Engineer - Supercomputing
$180k
...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who... ...their teammates. About the Role We are seeking a talented Site Reliability Engineer (SRE) to join our SuperComputing team. In this role...
Temporary work
Relocation
xAI
San Francisco, CA
more than 2 months ago
Site Reliability Engineer - Hosting
...design of information and operational support systems. Required Skills/Qualifications: BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven working experience in installing, configuring, and troubleshooting...
Permanent employment
Work experience placement
Start working today
Remote work
Flexible hours
San Francisco, CA
more than 2 months ago
Site Reliability Engineer II, tvScientific
$114.3k - $235.32k
...verification who have now purpose-built a CTV performance platform advertisers can trust to grow their business. We are seeking a Site Reliability Engineer to help operate, scale, and continuously improve a cloud-native platform built on AWS, Kubernetes/EKS, and ArgoCD-driven...
Work at office
Relocation
Relocation package
Pinterest
San Francisco, CA
1 day ago
Senior Site Reliability Engineer (GPU Clusters) - Hosting
$250k
...Europe, while now significantly expanding its footprint in the United States. The company is looking for a Senior / Staff Site Reliability Engineer to support and scale large-scale HPC and cloud environments powering GPU-intensive workloads. The role involves working...
Permanent employment
Remote work
San Francisco, CA
a month ago
Senior Site Reliability Engineer- San Francisco, CA, the US
...Job Description Job Description Senior Site Reliability Engineer (Payments Infrastructure) Kody is seeking a Senior Site Reliability Engineer to ensure the reliability, availability, scalability, and operational excellence of our global payment platform. You will...
Kody
San Francisco, CA
21 days ago
Senior Site Reliability Engineer
$181.69k - $213.75k
...more people in more places. We believe that the problems we solve today unlock the opportunities of tomorrow. As a Senior Site Reliability Engineer, you’ll work to: Build and scale our internal platform offerings (compute, storage and networking services) to ensure the...
Full time
Work at office
Carta
San Francisco, CA
1 day ago
Senior SRE Engineer: Scale & Reliability (Kubernetes/GCP)
A leading language learning platform is seeking an experienced SRE Engineer to ensure the reliability and resilience of their infrastructure. Responsibilities include leading incident response, improving observability, and collaborating with various teams to enhance platform...
Speak
San Francisco, CA
2 days ago
Senior SRE & InfraSec Engineer — Remote
The Consulting Solutions is seeking an experienced Senior / Staff Engineer for our SRE, InfraSec team in Seattle. The role involves leading the security of cloud-based infrastructure, mentoring a team of SREs, and collaborating with other engineering teams to ensure high...
Remote job
The Consulting Solutions
San Francisco, CA
1 day ago
Site Reliability Engineer - Storage
$180k
...knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who... ...share knowledge with their teammates. About the role As a Site Reliability Storage Engineer, you will play a pivotal role in designing,...
Remote job
Temporary work
xAI
San Francisco, CA
more than 2 months ago
Site Reliability Engineering
...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...
Forhyre
San Francisco, CA
23 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!