Senior Site Reliability Engineer

$175k - $250k

The Recruiting Guy

1 day ago Be among the first 25 applicants This range is provided by The Recruiting Guy. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $175,000.00/yr - $250,000.00/yr Job Title: Senior Cloud Infrastructure Engineer Location: San Francisco, CA. Remote unavailable. Modality: On-Site only. Must live within commuting distance of San Francisco or be willing to relocate. Relocation Assistance: No Employment Type: Salaried W2 Full-Time. Salary Range: $175,000 - $250,000 About The Company We represent a pioneering open source technology company in San Francisco that is transforming the way creators interact with generative AI. They are the team behind a powerful, node based visual interface that gives artists, developers, and innovators the ability to design, control, and customize AI workflows with complete flexibility. Their platform allows users to connect modular components, build complex pipelines, and run everything locally with impressive speed and precision. Their mission is to make generative AI open, transparent, and accessible to everyone. Built around community collaboration and creative empowerment, their tools help users experiment freely and bring their ideas to life. Whether it is visual storytelling, image generation, or advanced machine learning, their technology gives creators the freedom to explore without limitations. About The Role In this role, you will take the lead on designing, deploying, and maintaining large-scale distributed systems that power AI workloads. The ideal candidate is deeply technical, self-sufficient, and motivated by solving complex infrastructure challenges. You will work closely with core engineers to shape the company’s long-term infrastructure vision while ensuring scalability, performance, and reliability across environments. What You’ll Do Design, build, and maintain the core infrastructure that powers AI workloads at scale Manage and automate GPU compute clusters using tools such as Python, Kubernetes, Terraform, and Ansible Architect and operate systems for orchestration, observability, distributed storage, and networking Ensure reliability, scalability, and performance across production environments Collaborate closely with core engineers to design infrastructure for new features and systems Contribute to technical strategy and long-term infrastructure vision Drive best practices for infrastructure automation, deployment, and monitoring Requirements 5+ years experience as an Infrastructure Engineer or Site Reliability Engineer building and operating large-scale distributed systems Skilled in Python and comfortable working with infrastructure-as-code tools such as Terraform and Ansible Familiar with container orchestration systems such as Kubernetes and related tooling like FluxCD, Prometheus, and Grafana Capable of managing high-performance GPU environments across cloud and bare metal setups Highly adaptable, resourceful, and motivated by building things from the ground up Excited to work in a small, fast-growing team where autonomy and accountability are key Comfortable working on-site in a startup setting where collaboration and speed matter most Bonus Points Experience contributing to or maintaining open-source projects Background working with AI infrastructure, ML pipelines, or GPU orchestration Strong computer science fundamentals and ability to work across different programming languages or frameworks Skills: prometheus,fluxcd,kubernetes,python,ansible,terraform,infrastructure,grafana #J-18808-Ljbffr

Apply

Vacancy posted 9 hours ago

Similar jobs that could be interesting for youBased on the Senior Site Reliability Engineer in San Francisco, CA vacancy

Senior Site Reliability Engineer
...founders with PhDs in AI, Math, and Computer Science - is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and...
Senior
Hyperbolic Labs
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
$210k - $240k
...Join to apply for the Senior Site Reliability Engineer role at Alembic Technologies This range is provided by Alembic Technologies. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $210,000.00/yr - $2...
Senior
Full time
Alembic Technologies
San Francisco, CA
9 hours ago
Senior Site Reliability Engineer
...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers the tools to create experiences that are more like cash than crypto. The network is faster, cheaper, and far more energy-efficient...
Senior
TechChain Talent
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
$160k - $250k
...public clouds when the right fit. As we continue to commercialize our machine learning models, we also need to grow our DevOps and Site Reliability team to maintain the reliability of our enterprise SaaS offering for our customers. Our ideal candidate is someone who is able...
Senior
Hive
San Francisco, CA
4 days ago
Senior Site Reliability Engineer
...Engineering Hiring Sprint We're growing our engineering team and are accelerating hiring through a focused Engineering Hiring Sprint... ...: Platform Engineers Database Engineers Site Reliability Engineers Extensibility API Engineers AI Agents Engineers...
Senior
Work at office
Local area
Flexible hours
Airbyte
San Francisco, CA
5 days ago
Senior Software Engineer - Site Reliability Engineering
...Udaip Cloud-Based Data And Ai Platform Engineer At U.S. Bank, we're on a journey to do our best. Helping the customers and businesses we serve to make better and smarter financial decisions and enabling the communities we support to grow and succeed. We believe it...
Senior
Temporary work
Work experience placement
Phenom People
San Francisco, CA
4 days ago
Senior Software Engineer, Site Reliability Engineering
$210.8k - $272.8k
About Thumbtack Thumbtack helps millions of people confidently care for their homes. About the Site Reliability Engineering Team The Site Reliability Engineering team focuses on creating and maintaining a reliable, secure, and scalable platform vital for a seamless user...
Senior
Local area
Thumbtack
San Francisco, CA
5 days ago
Senior Manager, Site Reliability Engineering - Infrastructure Platform
$232k - $319k
...to help us continue to scale the service with great people and reliable, cost-effective, and efficient infrastructure, processes, and... ...with self-service Accelerate the velocity of SRE and product engineering by developing robust platforms, powerful tooling, and...
Senior
Permanent employment
Local area
Worldwide
Flexible hours
Okta, Inc.
San Francisco, CA
5 days ago
Senior Site Reliability Engineer
$174.92k - $209.91k
...same: to make access to data as simple and reliable as electricity. With Fivetran, customer... ..., canonical and ready to query, with no engineering or maintenance required. We're proud... ...integrate our teams, systems, and career sites. About the Role Fivetran is building...
Senior
Full time
Work at office
Remote work
Fivetran
Oakland, CA
2 days ago
Senior Site Reliability Engineer- San Francisco, CA, the US
...Job Description Job Description Senior Site Reliability Engineer (Payments Infrastructure) Kody is seeking a Senior Site Reliability Engineer to ensure the reliability, availability, scalability, and operational excellence of our global payment platform. You will...
Senior
Kody
San Francisco, CA
7 days ago
Senior Site Reliability Engineer (SRE) - AI Inftastructure
$300k
...thousands of H100s, H200s, and B200s, ready for experimentation, full-scale model training, or inference. As a Platform Engineer/Senior Site Reliability Engineer, you’ll own the reliability, performance, and automation of this GPU-powered infrastructure, ensuring...
Senior
Permanent employment
San Francisco, CA
more than 2 months ago
Senior Site Reliability Engineer (GPU Clusters) - Hosting
$250k
...across Europe, while now significantly expanding its footprint in the United States. The company is looking for a Senior / Staff Site Reliability Engineer to support and scale large-scale HPC and cloud environments powering GPU-intensive workloads. The role involves...
Senior
Permanent employment
Remote work
San Francisco, CA
more than 2 months ago
Senior Software Engineer
...and 7Wire Ventures. About the Role We are looking for a Senior Software Engineer — an individual contributor who owns a product surface end-... ...governed by HIPAA, PCI-DSS, and SOC 2. Responsibilities Deliver Reliable Craft on Your Surface Deliver reliable, high-quality craft...
Senior
Full time
PayZen
San Francisco, CA
3 days ago
Senior Software Engineer
...Flexible health coverage and 401K matching. The Role This is a senior individual contributor role for someone who can take a... ...a single feature, and help raise the bar for the rest of the engineering team. We’re looking for someone who can see how the pieces fit...
Senior
Full time
Flexible hours
Circleback
San Francisco, CA
2 days ago
Senior Software Engineer
...gets stuff built. You'll work closely with engineers, designers, and end-users to accelerate our product development. As a senior-level software engineer at Pulley, you will... ...regulations, and jurisdiction workflows into fast, reliable product experiences, with AI at the core of...
Senior
Full time
Pulley
San Francisco, CA
1 day ago
Senior Software Engineer
$140k - $240k
...radar systems. Working alongside software, hardware, and systems engineers, you’ll design and operate the backend services and cloud... ...-critical sensor data at scale. The systems you build must be reliable, resilient, secure, and performant—enabling operators to depend...
Senior
Permanent employment
Work experience placement
Casual work
Relocation package
CHAOS Industries
San Francisco, CA
2 days ago
Senior Software Engineer
$130k - $196.5k
...from the ground up and migrate existing complex use cases into that system. Work with a team of supportive and passionate software engineers. Architect and implement systems that materialize our platform vision. Provide operational support for our production systems...
Senior
Full time
Work from home
Flexible hours
Night shift
LiveRamp
San Francisco, CA
2 days ago
Senior Software Engineer
$170k - $230k
...history of writing scalable, performant and maintainable code. We strongly believe languages can be learned and care more about your engineering skill over frameworks * Excitement about shipping customer centric software * Experience with Java >= 11, JPA ORM mapping,...
Senior
Full time
Work at office
Local area
Home office
Flexible hours
Highnote
San Francisco, CA
1 hour ago
Senior Software Engineering - Telecommute
$200k - $250k
Senior Software Engineer Engineering Prolific Prolific is not just another player in the AI space - we are the architects of the human data... ...decisions that balance scrappy startup execution with scalable, reliable engineering, as Prolific revolutionizes research for the AI...
Senior
Full time
Work at office
Remote work
2 days per week
1 day per week
Prolific
San Francisco, CA
2 days ago
Senior Software Engineer
$165k - $247k
...identity resolution, data import and export connectors, privacy and compliance, and the operational reliability of everything in between. As a Senior Software Engineer, you'll take on complex infrastructure challenges: designing for extreme throughput, optimizing for...
Senior
Full time
Home office
Flexible hours
Amplitude
San Francisco, CA
4 days ago
Senior Software Engineer, Infrastructure
$180k - $230k
...Sift Sift is the data infrastructure platform for hardware engineering teams. Sift turns high-frequency telemetry into engineering insights... ...software engineers to optimize application performance and reliability. Implement monitoring, alerting, and logging systems to...
Senior
Full time
Work at office
Relocation
Sift Stack, Inc.
San Francisco, CA
1 hour ago
Senior / Staff Site Reliability, Platform Engineering
...complex, distributed, cloud-native systems. As a Staff Platform Engineer, you will play a critical role in ensuring these systems... ...hands-on engineering and technical leadership role. You will own reliability for major platform domains, design scalable solutions on Kubernetes...
Senior
Saviynt
San Francisco, CA
12 days ago
Senior Principal Software Engineer, AI Onboarding
$260k - $275k
...SENIOR PRINCIPAL SOFTWARE ENGINEER Saviynt is an identity platform built to power and protect the world at work. With the rise of AI and Agents... ...in engineering processes, tooling, and operational reliability. Collaborate with internal teams to produce software...
Senior
Medium
San Francisco, CA
4 days ago
Senior Python Engineer
$126k - $248k
...Our mission is to increase developer adoption, satisfaction and retention by providing a reliable, enjoyable interface for developers and other end-users. Our senior engineers are typically specialists in a particular programming language, but are capable of contributing...
Senior
Full time
Local area
Worldwide
Flexible hours
MongoDB
San Francisco, CA
2 days ago
Senior Staff Software Engineer
$257k - $302k
...Forbes Cloud 100 2025 List [ and is a Y Combinator 2024 Breakthrough Company [ Checkr is looking for an experienced Senior Staff Software Engineer to facilitate the long-term design of Checkr’s core systems and to lead critical cross-organizational initiatives,...
Senior
Full time
Work at office
Local area
Remote work
Relocation
Flexible hours
3 days per week
Checkr
San Francisco, CA
1 day ago
Senior Software Engineer, Networking
$200k - $250k
...and internationally. ROLE We are looking to add a Software Engineer, Networking to expand our high-impact Platform team. You’ll... ...Our team builds and operates the systems that enable secure, reliable connectivity between our platform and customer environments. As...
Senior
Full time
Work at office
Local area
Peregrine Technologies
San Francisco, CA
1 hour ago
Senior / Principal Software Engineer (Compiler & AI Tooling )
...mission-critical industries, helping partners move more quickly and reliably from algorithm to silicon. Our platform accelerates deployment... .... The Roles We are looking for an experienced software engineer to help us build a new generation of transpilation tools...
Senior
Full time
Remote work
Relocation package
Flexible hours
Code Metal
San Francisco, CA
8 hours ago
Site Reliability Engineer
$150k - $250k
...Site Reliability Engineer role USC or GC only are considered at this time. San Francisco - Local to Bay area only but role... ..., Rust, Kubernetes, FastAPI, Redis, Postgres, Prisma Seniority 4-8+ years of experience in production or reliability...
Work experience placement
Casual work
Local area
Immediate start
Remote work
3B Staffing LLC
San Francisco, CA
4 days ago
Site Reliability Engineer
...Site Reliability Engineer We are looking for a dynamic engineer to join our rapidly growing SRE team. As an SRE, you will report to our VP of Technical Operations and be responsible for operating an extremely high performance and scalable, low latency platform built...
Relocation package
1872 Consulting
San Francisco, CA
4 days ago
Site Reliability Engineer
$152.5k - $219.2k
...global cloud platform. As a team of six engineers distributed across the US, Canada, and the... ...with a strong focus on automation, reliability, and operational excellence. We are one... ...Qualifications ~2+ years of experience in Site Reliability Engineering, DevOps,...
Permanent employment
Full time
Temporary work
Local area
Worldwide
Flexible hours
Cisco
Daly City, CA
3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior Site Reliability Engineer. Be the first to apply!