Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

$145k - $165k

Bolt Graphics

Bolt Graphics is a semiconductor startup based in Sunnyvale, CA building the fastest and most efficient graphics processors. We pride ourselves on our first principles approach to solving problems. We are energized by our mission to reduce the barrier of entry for content creation and consumption. Our goal is to enable everyone to easily create, simulate and consume immersive experiences as vividly as they can imagine them. Our Values Be Fearless : Unmute yourself. Test boundaries and get proven right. Remain Adaptable : Stay comfortable in a continuously changing world. If you’re wrong, concede and move on. Educate Your Ego : Selflessly collaborate towards our shared purpose. About the role Bolt Graphics is seeking a highly experienced Site Reliability Engineer (SRE) to design, build, and operate highly reliable developer and production systems. This role is mission-critical to maintaining uptime, performance, and operational excellence across compute, storage, and networking environments. Exceptional Linux expertise and advanced automation capabilities are mandatory for success in this role. What you'll do Design, implement, and operate highly available, fault-tolerant infrastructure and services. Install, maintain, and upgrade server, storage, and networking hardware in office and colocation facilities. Continuously monitor developer and production environments and proactively remediate reliability risks. Participate in an on-call rotation and lead incident response efforts, including rapid triage, mitigation, and post-incident root cause analysis. Respond effectively under pressure to outages and degradation events to restore service availability. Develop, maintain, and continuously improve automation and operational tooling using Bash and Python. Partner closely with engineering teams to support development, testing, and production workloads at scale. Qualifications (required) Expert-level Linux systems administration across complex, production environments (this is a core requirement). Exceptional proficiency in Bash and Python; advanced scripting and automation skills are mandatory, not optional. Proven ability to write maintainable automation and diagnostic tooling for large-scale systems. Deep understanding of server hardware, storage subsystems, and datacenter operations. Hands‑on experience with virtualization platforms including Proxmox (current), VMware vSphere, and/or OpenShift. Strong experience with containerization technologies (Docker, containerd) and orchestration platforms (Kubernetes). Experience operating workloads in AWS and/or Microsoft Azure environments. Experience implementing observability, monitoring, and alerting using tools such as Prometheus and Grafana. Additional Qualifications Familiarity with systems programming languages such as C, C++, Rust, Go, and/or Julia. Relevant certifications such as CompTIA A+, Azure Engineer, or similar are preferred. Active government clearance or the ability to obtain one is preferred. On-Call & Incident Response Expectations This role includes participation in an on-call rotation supporting developer and production systems. The SRE is expected to respond to incidents outside of normal business hours as required, lead technical incident response efforts, communicate effectively with stakeholders during outages, and produce clear post-incident documentation and corrective action plans. Compensation Range $145,000–$165,000 per year (California). This range represents the anticipated base pay for this role; the final offer may vary based on qualifications, experience, and location. Medical, Dental, & Vision - 100% covered premiums Equity - Stock Options 401(k) match Equal Opportunity Statement Bolt is committed to building a diverse and inclusive environment in which we recognize and value each other’s differences as well as fostering a culture that promotes its core values: Professionalism, Integrity, and Respect. As an equal opportunity employer, all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, or status as a protected veteran. Location & Sponsorship Please note that Bolt Graphics does not currently sponsor candidates for this role. This role is strictly based in Sunnyvale, CA and will require someone to be locally based, preferably in the Bay Area. #J-18808-Ljbffr Bolt Graphics

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Sunnyvale, CA vacancy
  •  ...of Huobi globe spanning infrastructure. •       Work with engineering teams to make sure new features and changes are deployed quickly...  .... •       Constantly improve our system performance and reliability through better tools, process and monitoring system. •... 
    Suggested
    Worldwide

    Cryptoware Technologies Inc

    Santa Clara, CA
    4 days ago
  •  ...Job Description Job Description Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes... 
    Suggested

    Amiri Recruiting

    Mountain View, CA
    4 days ago
  • $180k - $200k

     ...Holmdel, NJ. Join us and be part of a team that's shaping the future of payments—one experience at a time. As our Site Reliability Engineer, you will design, build, and maintain the systems and infrastructure that power our applications, ensuring their... 
    Suggested
    For contractors
    Work at office
    Work from home
    Flexible hours

    PayNearMe

    Santa Clara, CA
    4 days ago
  •  ..., and the challenges of building in a high-growth startup, we’d love to talk. This is more than a job—it’s a journey. Site Reliability Engineers (SREs) are responsible for the overall performance and reliability of ASAPP's infrastructure and products. The team owns... 
    Suggested
    Remote work

    ASAPP

    Mountain View, CA
    22 days ago
  • $152k - $241.5k

     ...infrastructure platforms for automated host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑driven operations (...  ...languages such as Python, Go, Perl, or Ruby. Mentored other engineers and influenced technical direction through design reviews,... 
    Suggested

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas...  ...evangelize cloud best practices while building a culture of reliability and observability Engage in and improve the end to end lifecycle... 

    forhyre.com

    Sunnyvale, CA
    4 days ago
  • $148k - $235.75k

     ...see how you can make a lasting impact on the world.Join our team of innovative engineers who are building an AI Data Center AIOps platform that turns raw, high-volume telemetry into reliable, job-centric insights and automation for GPU fleets. We’re hiring a DevOps Engineer... 

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Knowledge of the application of tools/techniques Experience in one coding language (Preferred) Experience in Database (Preferred... 

    FII

    Sunnyvale, CA
    5 days ago
  •  ...that keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity...  ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in... 
    Work experience placement

    Illumio

    Sunnyvale, CA
    1 day ago
  • $210k - $300k

     ...Site Reliability Engineer (SRE) / DevOps Engineer Location: Onsite in NYC or San Francisco Compensation: $210,000–$300,000 Base Salary About the Role We are seeking an experienced Site Reliability Engineer (SRE) / DevOps Engineer to help build, scale, and operate... 

    TechLine Consulting

    Sunnyvale, CA
    2 days ago
  •  ...design by customizing MES tool per business needs Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Experience in C#, Delphi desired Knowledge of the... 
    Work at office

    Foxconn Industrial Internet - FII

    Sunnyvale, CA
    4 days ago
  • $217.57k - $260k

     ...Identity Left Behind" to enable all people to have a secure digital identity. To learn more, visit Role Overview The Staff Site Reliability Engineer, Infrastructure role is building a high-scale infrastructure team responsible for owning environments with thousands of... 
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours
    Shift work

    ID.me

    Mountain View, CA
    4 days ago
  • $200k - $322k

     ...environment, where NVIDIANs are inspired to excel and make a profound global impact. NVIDIA is seeking a Senior Manager of Site Reliability Engineering to lead and reshape how IT operations function at scale. This role goes beyond traditional service management to build... 

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $168.93k - $192.5k

     ...Left Behind" to enable all people to have a secure digital identity. To learn more, visit Role Overview We are seeking a Site Reliability Engineer to join our Core Platform Engineering organization. The SRE team builds the automation, observability, and operational... 
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours

    ID.me

    Mountain View, CA
    4 days ago
  • $147.4k - $220.9k

    Site Reliability Engineer, Customer Systems Sunnyvale, California, United States Software and Services Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn... 
    Relocation

    Apple Inc.

    Sunnyvale, CA
    5 days ago
  • $174k - $252k

    Senior Software Engineer, Site Reliability Engineering X Applicants in San Francisco: Qualified applications with arrest or conviction records will be considered for employment in accordance with the San Francisco Fair Chance Ordinance for Employers and the California... 
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • At NVIDIA, Site Reliability Engineering provides a rare chance to define, develop, and support large-scale production systems with high efficiency and availability. This demanding position merges software and systems engineering efforts to guarantee flawless service operation... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  •  ...Infrastructure Footprint: Global production infrastructure across AWS, South America, and Europe Role Overview Seeking a Senior Site Reliability Engineer / DevOps Engineer to design, scale, and operate highly available global infrastructure supporting production systems... 

    Prophet Town

    Mountain View, CA
    3 days ago
  • $151.6k - $245.3k

     ...outcomes. Job Summary Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture... 
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  • $180k - $260k

     ...facilitating effortless integration into customers’ logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work... 
    Odd job
    Work at office
    Remote work

    Booster

    Mountain View, CA
    2 days ago
  • Job Summary Note: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud-native infrastructure. You’re responsible for architecting the reliability, scalability, and security of a... 
    Visa sponsorship
    Work visa
    Shift work

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $202k - $247k

    Job Category Site Reliability Engineering Posting Date 11/18/2025, 12:24 AM Locations Santa Clara, CA, United States Job Schedule Full time Job Description At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best... 
    Full time
    Worldwide

    Fortinet, Inc.

    Santa Clara, CA
    4 days ago
  • $207k - $300k

    Staff Software Engineer, Site Reliability Engineering, Traffic Virtnet corporate_fare Google place Sunnyvale, CA, USA Bachelor’s degree in Computer Science, a related field, or equivalent practical experience. 8 years of experience with software development in one or... 
    Full time

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $147.4k - $272.1k

    Site Reliability Engineer (Edge Services), Infrastructure Services Sunnyvale, California, United States Software and Services We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive... 
    Relocation
    Shift work

    Apple Inc.

    Sunnyvale, CA
    5 days ago
  • $126k - $204.5k

     ...As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and...  ...team to influence the operability of the product and ensure the reliability and availability of our services. Qualifications Required... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  • $168k - $270.25k

    NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join its GeForce Now (GFN) team. SRE at NVIDIA ensures that our internal and external-facing GPU cloud gaming services have reliability and uptime as promised to the users and at the same time enables developers... 
    Full time

    NVIDIA

    Santa Clara, CA
    2 days ago
  • $176k - $276k

    Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and availability using the combination of software and systems engineering practices. This is a highly specialized... 

    NVIDIA Corporation

    Santa Clara, CA
    3 days ago
  • $180k - $320k

     ...Job Description Job Description About the role Own the infrastructure that engineering depends on — Kubernetes clusters, CI/CD pipelines, on-prem ↔ cloud sync, observability, and high-availability platforms for chip-design and ML workloads. Work with chip-design and... 
    H1b
    Visa sponsorship
    Work visa

    DensityAI

    Mountain View, CA
    3 days ago
  • $175.8k - $264.2k

    Senior Site Reliability Engineer - Apple Services Engineering (ASE) / iCloud Cupertino, CA People at Apple don't just build products - they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many... 

    Hong Kong Study Skills Research Institute

    Cupertino, CA
    5 days ago
  • $120.3k - $194.53k

     ...drives great outcomes. Job Summary Palo Alto Networks runs a large hybrid infrastructure across multiple public clouds. As a Site Reliability Engineer on the Internet Security Platform team, you will be part of a team supporting Advanced DNS Security services. This... 
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!