Site Reliability Engineer
$145k - $165kBolt Graphics
Bolt Graphics is a semiconductor startup based in Sunnyvale, CA building the fastest and most efficient graphics processors. We pride ourselves on our first principles approach to solving problems. We are energized by our mission to reduce the barrier of entry for content creation and consumption. Our goal is to enable everyone to easily create, simulate and consume immersive experiences as vividly as they can imagine them. Our Values Be Fearless : Unmute yourself. Test boundaries and get proven right. Remain Adaptable : Stay comfortable in a continuously changing world. If you’re wrong, concede and move on. Educate Your Ego : Selflessly collaborate towards our shared purpose. About the role Bolt Graphics is seeking a highly experienced Site Reliability Engineer (SRE) to design, build, and operate highly reliable developer and production systems. This role is mission-critical to maintaining uptime, performance, and operational excellence across compute, storage, and networking environments. Exceptional Linux expertise and advanced automation capabilities are mandatory for success in this role. What you'll do Design, implement, and operate highly available, fault-tolerant infrastructure and services. Install, maintain, and upgrade server, storage, and networking hardware in office and colocation facilities. Continuously monitor developer and production environments and proactively remediate reliability risks. Participate in an on-call rotation and lead incident response efforts, including rapid triage, mitigation, and post-incident root cause analysis. Respond effectively under pressure to outages and degradation events to restore service availability. Develop, maintain, and continuously improve automation and operational tooling using Bash and Python. Partner closely with engineering teams to support development, testing, and production workloads at scale. Qualifications (required) Expert-level Linux systems administration across complex, production environments (this is a core requirement). Exceptional proficiency in Bash and Python; advanced scripting and automation skills are mandatory, not optional. Proven ability to write maintainable automation and diagnostic tooling for large-scale systems. Deep understanding of server hardware, storage subsystems, and datacenter operations. Hands‑on experience with virtualization platforms including Proxmox (current), VMware vSphere, and/or OpenShift. Strong experience with containerization technologies (Docker, containerd) and orchestration platforms (Kubernetes). Experience operating workloads in AWS and/or Microsoft Azure environments. Experience implementing observability, monitoring, and alerting using tools such as Prometheus and Grafana. Additional Qualifications Familiarity with systems programming languages such as C, C++, Rust, Go, and/or Julia. Relevant certifications such as CompTIA A+, Azure Engineer, or similar are preferred. Active government clearance or the ability to obtain one is preferred. On-Call & Incident Response Expectations This role includes participation in an on-call rotation supporting developer and production systems. The SRE is expected to respond to incidents outside of normal business hours as required, lead technical incident response efforts, communicate effectively with stakeholders during outages, and produce clear post-incident documentation and corrective action plans. Compensation Range $145,000–$165,000 per year (California). This range represents the anticipated base pay for this role; the final offer may vary based on qualifications, experience, and location. Medical, Dental, & Vision - 100% covered premiums Equity - Stock Options 401(k) match Equal Opportunity Statement Bolt is committed to building a diverse and inclusive environment in which we recognize and value each other’s differences as well as fostering a culture that promotes its core values: Professionalism, Integrity, and Respect. As an equal opportunity employer, all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, or status as a protected veteran. Location & Sponsorship Please note that Bolt Graphics does not currently sponsor candidates for this role. This role is strictly based in Sunnyvale, CA and will require someone to be locally based, preferably in the Bay Area. #J-18808-Ljbffr Bolt Graphics
- ...of Huobi globe spanning infrastructure. • Work with engineering teams to make sure new features and changes are deployed quickly... .... • Constantly improve our system performance and reliability through better tools, process and monitoring system. •...SuggestedWorldwide
- ...Job Description Job Description Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes...Suggested
$180k - $200k
...Holmdel, NJ. Join us and be part of a team that's shaping the future of payments—one experience at a time. As our Site Reliability Engineer, you will design, build, and maintain the systems and infrastructure that power our applications, ensuring their...SuggestedFor contractorsWork at officeWork from homeFlexible hours- ..., and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. The team owns...SuggestedRemote work
$152k - $241.5k
...infrastructure platforms for automated host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑driven operations (... ...languages such as Python, Go, Perl, or Ruby. Mentored other engineers and influenced technical direction through design reviews,...Suggested- ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas... ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle...
$148k - $235.75k
...see how you can make a lasting impact on the world.Join our team of innovative engineers who are building an AI Data Center AIOps platform that turns raw, high-volume telemetry into reliable, job-centric insights and automation for GPU fleets. We’re hiring a DevOps Engineer...- Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Knowledge of the application of tools/techniques Experience in one coding language (Preferred) Experience in Database (Preferred...
- ...that keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity... ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in...Work experience placement
$210k - $300k
...Site Reliability Engineer (SRE) / DevOps Engineer Location: Onsite in NYC or San Francisco Compensation: $210,000–$300,000 Base Salary About the Role We are seeking an experienced Site Reliability Engineer (SRE) / DevOps Engineer to help build, scale, and operate...- ...design by customizing MES tool per business needs Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Experience in C#, Delphi desired Knowledge of the...Work at office
$217.57k - $260k
...Identity Left Behind" to enable all people to have a secure digital identity. To learn more, visit Role Overview The Staff Site Reliability Engineer, Infrastructure role is building a high-scale infrastructure team responsible for owning environments with thousands of...Full timeTemporary workWork at officeRemote workFlexible hoursShift work$200k - $322k
...environment, where NVIDIANs are inspired to excel and make a profound global impact. NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations function at scale. This role goes beyond traditional service management to build...$168.93k - $192.5k
...Left Behind" to enable all people to have a secure digital identity. To learn more, visit Role Overview We are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE team builds the automation, observability, and operational...Full timeTemporary workWork at officeRemote workFlexible hours$147.4k - $220.9k
Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn...Relocation$174k - $252k
Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California...Full time- At NVIDIA, Site Reliability Engineering provides a rare chance to define, develop, and support large-scale production systems with high efficiency and availability. This demanding position merges software and systems engineering efforts to guarantee flawless service operation...
- ...Infrastructure Footprint: Global production infrastructure across AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available global infrastructure supporting production systems...
$151.6k - $245.3k
...outcomes. Job Summary Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture...Full timeWork at officeVisa sponsorshipWork visa$180k - $260k
...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work...Odd jobWork at officeRemote work- Job Summary Note: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud-native infrastructure. You’re responsible for architecting the reliability, scalability, and security of a...Visa sponsorshipWork visaShift work
$202k - $247k
Job Category Site Reliability Engineering Posting Date 11/18/2025, 12:24 AM Locations Santa Clara, CA, United States Job Schedule Full time Job Description At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best...Full timeWorldwide$207k - $300k
Staff Software Engineer, Site Reliability Engineering, Traffic Virtnet corporate_fare Google place Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with software development in one or...Full time$147.4k - $272.1k
Site Reliability Engineer (Edge Services), Infrastructure Services Sunnyvale, California, United States Software and Services We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive...RelocationShift work$126k - $204.5k
...As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and... ...team to influence the operability of the product and ensure the reliability and availability of our services. Qualifications Required...$168k - $270.25k
NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. SRE at NVIDIA ensures that our internal and external-facing GPU cloud gaming services have reliability and uptime as promised to the users and at the same time enables developers...Full time$176k - $276k
Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized...$180k - $320k
...Job Description Job Description About the role Own the infrastructure that engineering depends on — Kubernetes clusters, CI/CD pipelines, on-prem ↔ cloud sync, observability, and high-availability platforms for chip-design and ML workloads. Work with chip-design and...H1bVisa sponsorshipWork visa$175.8k - $264.2k
Senior Site Reliability Engineer - Apple Services Engineering (ASE) / iCloud Cupertino, CA People at Apple don't just build products - they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many...$120.3k - $194.53k
...drives great outcomes. Job Summary Palo Alto Networks runs a large hybrid infrastructure across multiple public clouds. As a Site Reliability Engineer on the Internet Security Platform team, you will be part of a team supporting Advanced DNS Security services. This...Full timeWork at officeVisa sponsorshipWork visa
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!
- site reliability engineer Sunnyvale, CA
- website content developer Sunnyvale, CA
- site safety Sunnyvale, CA
- on-site clinical research associate (traveling/remote) Sunnyvale, CA
- IT site lead Sunnyvale, CA
- site leader Sunnyvale, CA
- junior website developer Sunnyvale, CA
- site reliability engineer
- site reliability engineer sre
- junior site reliability engineer



