Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer

$145k - $165k

Bolt Graphics

Bolt Graphics is a semiconductor startup based in Sunnyvale, CA building the fastest and most efficient graphics processors. We pride ourselves on our first principles approach to solving problems. We are energized by our mission to reduce the barrier of entry for content creation and consumption. Our goal is to enable everyone to easily create, simulate and consume immersive experiences as vividly as they can imagine them. Our Values Be Fearless : Unmute yourself. Test boundaries and get proven right. Remain Adaptable : Stay comfortable in a continuously changing world. If you’re wrong, concede and move on. Educate Your Ego : Selflessly collaborate towards our shared purpose. About the role Bolt Graphics is seeking a highly experienced Site Reliability Engineer (SRE) to design, build, and operate highly reliable developer and production systems. This role is mission-critical to maintaining uptime, performance, and operational excellence across compute, storage, and networking environments. Exceptional Linux expertise and advanced automation capabilities are mandatory for success in this role. What you'll do Design, implement, and operate highly available, fault-tolerant infrastructure and services. Install, maintain, and upgrade server, storage, and networking hardware in office and colocation facilities. Continuously monitor developer and production environments and proactively remediate reliability risks. Participate in an on-call rotation and lead incident response efforts, including rapid triage, mitigation, and post-incident root cause analysis. Respond effectively under pressure to outages and degradation events to restore service availability. Develop, maintain, and continuously improve automation and operational tooling using Bash and Python. Partner closely with engineering teams to support development, testing, and production workloads at scale. Qualifications (required) Expert-level Linux systems administration across complex, production environments (this is a core requirement). Exceptional proficiency in Bash and Python; advanced scripting and automation skills are mandatory, not optional. Proven ability to write maintainable automation and diagnostic tooling for large-scale systems. Deep understanding of server hardware, storage subsystems, and datacenter operations. Hands‑on experience with virtualization platforms including Proxmox (current), VMware vSphere, and/or OpenShift. Strong experience with containerization technologies (Docker, containerd) and orchestration platforms (Kubernetes). Experience operating workloads in AWS and/or Microsoft Azure environments. Experience implementing observability, monitoring, and alerting using tools such as Prometheus and Grafana. Additional Qualifications Familiarity with systems programming languages such as C, C++, Rust, Go, and/or Julia. Relevant certifications such as CompTIA A+, Azure Engineer, or similar are preferred. Active government clearance or the ability to obtain one is preferred. On-Call & Incident Response Expectations This role includes participation in an on-call rotation supporting developer and production systems. The SRE is expected to respond to incidents outside of normal business hours as required, lead technical incident response efforts, communicate effectively with stakeholders during outages, and produce clear post-incident documentation and corrective action plans. Compensation Range $145,000–$165,000 per year (California). This range represents the anticipated base pay for this role; the final offer may vary based on qualifications, experience, and location. Medical, Dental, & Vision - 100% covered premiums Equity - Stock Options 401(k) match Equal Opportunity Statement Bolt is committed to building a diverse and inclusive environment in which we recognize and value each other’s differences as well as fostering a culture that promotes its core values: Professionalism, Integrity, and Respect. As an equal opportunity employer, all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, or status as a protected veteran. Location & Sponsorship Please note that Bolt Graphics does not currently sponsor candidates for this role. This role is strictly based in Sunnyvale, CA and will require someone to be locally based, preferably in the Bay Area. #J-18808-Ljbffr Bolt Graphics

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer in Sunnyvale, CA vacancy
  • $250k

     ...systems, eGain provides the single source of truth—explainable, reliable, and maintainable—that serves as the repository for all...  ...at scale. Position Overview As Director of Site Reliability Engineering, you will ensure that eGain’s AI knowledge management platform... 
    Suggested
    Work at office

    eGain Corporation

    Sunnyvale, CA
    1 day ago
  •  ...Overview: *Must have Apple experience* • At least 8+ years in a Reliability Engineering, DevOps or infrastructure focused role • Advanced experience with programming languages (Python, Java) • Passion for designing and building reliable systems • Strong sense... 
    Suggested

    Purple Drive

    Sunnyvale, CA
    6 hours ago
  •  ...keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is driven by a culture that thrives...  ...basis, you will work on enhancing system reliability and scalability of Illumio SaaS products,... 
    Suggested
    Work experience placement
    Immediate start

    Illumio

    Sunnyvale, CA
    7 days ago
  •  ...Site Reliability Engineer (SRE) Location: Santa Clara Valley (Cupertino), California, Hybrid. Duration: 6+ Months Job Description Deploy, support and monitor new and existing services, platforms, and application stacks. Use scale testing to measure, tune... 
    Suggested

    Zortech Solutions

    Cupertino, CA
    1 day ago
  •  ...Senior Site Reliability Engineer Location: Remote Duration: 12 month contract to start IV Process: 1-3 Round IV process International Tech Top Skills: Java Python NodeJS -DevOps Engineer should work here too Main Responsibilities:... 
    Suggested
    Contract work
    Local area
    Remote work

    My3Tech Inc

    Sunnyvale, CA
    4 days ago
  • Job Description : Need to have experience with ticket support, azure, Splunk, ServiceNow, and any Java experience is a plus. Ideally candidates that come from an Enterprise background Handling tickets for the Walmart environment. Splunk, Servicenow...

    3B Staffing LLC

    Sunnyvale, CA
    6 hours ago
  • $145k - $175k

     ...Site Reliability Engineer (SRE) Bolt Graphics is a semiconductor startup based in Sunnyvale, CA building the fastest and most efficient graphics processors. We pride ourselves on our first principles approach to solving problems. We are energized by our mission to reduce... 
    Work at office
    Immediate start
    Work from home

    Bolt Graphics

    Sunnyvale, CA
    2 days ago
  • $181.69k - $213.75k

     ...Senior Site Reliability Engineer San Francisco, California; Santa Clara, California; Seattle, WA The Company You'll Join Carta connects founders, investors, and limited partners through world-class software, purpose-built for everyone in venture capital, private... 
    Full time
    Work at office

    Carta

    Santa Clara, CA
    6 hours ago
  •  ...keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of...  ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in... 
    Work experience placement
    Immediate start

    Illumio

    Sunnyvale, CA
    2 days ago
  • $150k - $175k

     ...Site Reliability Engineer At ASAPP, our mission is simple: deliver the best AI-powered customer experience—faster than anyone else. To achieve that, we're guided by principles that shape how we think, build, and execute. We value customer obsession, purposeful speed... 
    Remote work

    ASAPP

    Mountain View, CA
    1 day ago
  •  ...Location: Sunnyvale, CA (3x/ week onsite) Duration: 6 months SRE - Site Reliability Engineer Responsibilities: Engage with our product teams to understand requirements, design and implement resilient and scalable infrastructure solutions.... 

    Diverse Lynx

    Sunnyvale, CA
    1 day ago
  • $170k - $200k

     ...Site Reliability Engineer We are seeking a talented and motivated Site Reliability Engineer to join our engineering team. You will be responsible for building, maintaining, and troubleshooting cloud service/cluster, infrastructure, and monitoring systems to ensure high... 
    Full time
    Worldwide

    Edelman

    Sunnyvale, CA
    3 days ago
  • $148k - $235.75k

     ...Processes organization where you will be working as a Senior SRE Engineer. The position will be part of a fast-paced crew that develops...  ...: Manage NVIDIA's on-prem infrastructure. Maintain uptime, reliability and readiness of on-prem engineering cloud spread across... 
    Remote work

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...Senior Site Reliability Engineer LeanData helps the world's fastest-growing companies automate, simplify, and accelerate revenue. We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud infrastructure. Reporting directly... 
    Full time
    Work at office
    Flexible hours
    2 days per week

    LeanData

    Santa Clara, CA
    6 hours ago
  • Education Requirements, Ideal Experience: Associate’s degree in Industrial Engineering or IT related field Minimum of 0-3 years’ relevant experience Knowledge of the application of tools/techniques Experience in one coding language (Preferred) Experience in Database (Preferred... 

    FII

    Sunnyvale, CA
    1 day ago
  •  ...that keep the world running. Location: 5 on-site days a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is shaping the future of cybersecurity...  ...are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in... 
    Work experience placement

    Illumio

    Sunnyvale, CA
    2 days ago
  • $152k - $241.5k

     ...infrastructure platforms for automated host lifecycle management, fleet reliability/auto‑healing, E2E observability or data‑driven operations (...  ...languages such as Python, Go, Perl, or Ruby. Mentored other engineers and influenced technical direction through design reviews,... 

    NVIDIA Gruppe

    Santa Clara, CA
    1 day ago
  •  ...cybersecurity will depend on you Learn how Illumio approaches AI with integrity — view our Transparency Statement. Senior Backend Software Engineer (Python (Golang a plus)) Hybrid: 2 days in office/week in Sunnyvale, CA In this role, you will focus on the Azure Firewall... 
    Work at office
    2 days per week

    Illumio

    Sunnyvale, CA
    3 days ago
  • Site Reliability Engineer Onsite- Bay Area, CA Skills Relevant Skills and Experience What You’ll Do (Day-to-Day) Own and manage our cloud infrastructure (GCP or AWS, on-prem). Build, maintain, and optimize Kubernetes clusters (including GPU-backed clusters). Implement... 

    Amiri Recruiting

    Mountain View, CA
    9 days ago
  •  ...building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into reliable, job‑centric insights and automation for GPU fleets. Join our team of innovative engineers who are building this platform and operating it (not the compute cluster): uptime, performance... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $152k - $241.5k

    Overview NVIDIA is looking for a Senior Site Reliability Engineer (SRE) to join our Compute Farm team and help build the next generation of our global services platform. The role focuses on keeping critical systems operational while leveraging AI technologies to deliver... 

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  •  ...Site Reliability Engineer, Enterprise Technology Services At Apple, groundbreaking ideas quickly transform into extraordinary products and services that delight millions worldwide. If you're passionate about engineering and operating robust, large-scale systems, imagine... 
    Worldwide
    Relocation

    Apple

    Sunnyvale, CA
    2 days ago
  • $180k - $260k

     ...effortless integration into customers' logistics operations. About the role We are seeking an experienced Senior/Staff Site Reliability Engineer to support the operation, monitoring, and scaling of our growing fleet of autonomous vehicles. In this role, you will work... 
    Odd job
    Work at office
    Remote work

    Gatik AI

    Mountain View, CA
    6 hours ago
  • $175k - $250k

     ...Staff Site Reliability Engineer Figure is an AI robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are engineered to perform a variety of tasks in the home... 
    Full time

    Figure

    Sunnyvale, CA
    6 hours ago
  • $217.57k - $260k

     ...Left Behind" to enable all people to have a secure digital identity. To learn more, visit Role Overview The Staff Site Reliability Engineer, Infrastructure role is building a high-scale infrastructure team responsible for owning environments with thousands of... 
    Full time
    Temporary work
    Work at office
    Remote work
    Flexible hours
    Shift work

    ID.me

    Mountain View, CA
    6 hours ago
  • $202k - $247k

     ...Principal Site Reliability Engineer At FortiCNAPP At Fortinet, we strive to provide a supportive, collaborative environment where people are empowered to do the best work of their careers. Our team members enjoy solving complex problems, and obsess over getting the... 
    Full time
    Worldwide

    Edelman

    Santa Clara, CA
    3 days ago
  • $252k - $308k

     ...Staff Site Reliability Engineer Mountain View, US About EarnIn As one of the first pioneers of earned wage access, our passion at EarnIn is building products that deliver real-time financial flexibility for those with the unique needs of living paycheck to paycheck... 
    Full time
    Work at office
    2 days per week

    Earnin

    Mountain View, CA
    3 days ago
  • $126k - $204.5k

     ...As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and...  ...team to influence the operability of the product and ensure the reliability and availability of our services. Qualifications... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $200k - $260k

     ...for enterprise trust, as we bring Work AI to every employee, in every company. About the Role: Glean is seeking a Site Reliability Engineering Lead to foster a culture of engineering excellence, drive technical strategy, and develop a high-performing,... 
    Work at office
    Home office
    Flexible hours

    Glean.info

    Mountain View, CA
    2 days ago
  •  ...Role Number: 200663929-3956 Summary We are seeking a proactive Site Reliability Engineer to champion the evolution of our production ecosystems. In this role, you will help drive the vision for our visibility, moving beyond simple uptime metrics to build a sophisticated... 
    Work experience placement
    Shift work

    Apple

    Sunnyvale, CA
    6 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer. Be the first to apply!