Infrastructure & Site Reliability Engineer
$125k - $195kAtomicsemi
About Atomic Semi Atomic Semi is building a small, fast semiconductor fab. It’s already possible to build this with today’s technology and a few simplifications. We’ll build the tools ourselves so we can quickly iterate and improve. We’re building a small team of exceptional, hands-on engineers to make this happen. Mechanical, electrical, hardware, computer, and process. We’ll own the stack from atoms to architecture. Our team is optimistic about the future and we want to continue pushing the limits of technology. Smaller is better. Faster is better. Building it ourselves is better. We believe our team and lab can build anything. We’ve set up 3D printers, a wide array of microscopes, e-beam writers, general fabrication equipment - and whatever is missing, we’ll just invent along the way. Atomic was founded by Sam Zeloof and Jim Keller. Sam is best known for making chips in his garage, and Jim has been a leader in the semiconductor industry for the past 40 years. About the role We are seeking an Infrastructure & Site Reliability Engineer to design, build, deploy, and manage the on-prem backend infrastructure that powers our small, fast semiconductor fab. This is a broad role encompassing all aspects of backend infrastructure and services. Our philosophy towards infra is minimal, understandable, on-site, and close to the hardware. You won’t find much docker, cloud services, or kubernetes. Instead, there is a lot of bare-metal linux, systemd, and single file binaries. You’ll see a lot of rust and go and occasionally some python. We’re open to a range of experience levels — from exceptional early-career engineers to senior and staff-level builders. There isn’t a single background we’re optimizing for. What matters most is that you’ve already built real things, you love getting close to the metal, and show strong signs of engineering excellence. If you’re excited by performance engineering, building complex features from scratch, and learning new domains quickly, this is a great place for you. A portfolio or GitHub is generally required to apply: show us the things you’ve built! For us, a good portfolio includes evidence of strong engineering skills and curiosity. Responsibilities Design and implement light-weight, performant, and reliable software infrastructure to power a semiconductor fab. Procure, deploy and manage our fleet of on-prem servers, virtual machines, and single-board computers running on semiconductor fabrication equipment. Deploy and manage backend services, e.g., consul, vault, grafana, victoria-metrics, alertmanager, redpanda, vector, gitea, postgres Design and setup low level networking components, e.g., service discovery, DNS, reverse proxies, TLS, S3 compatible storage, VPNs Scale our observability platform: Build systems to ingest and display both traditional system metrics as well as high frequency telemetry from semiconductor fabrication equipment. Design and implement cross-site networking, replication and backups. Automate OS image creation and deployment and software build and deployment systems. Develop best practices and tools for security, authentication, authorization, and secrets management Help build our in-house infrastructure-as-code tool. Required Experience BS in Computer Science, Computer Engineering, or demonstrated exceptional skill in software engineering Deep experience in at least one statically-typed, compiled language Backend infrastructure or SRE experience Code portfolio – show us something you’ve built either professionally or personally. Could be a backend service, homelab setup script, etc. Nice-to-have Rust or go experience On-prem infra experience Working at Atomic Semi We’re an early-stage hardware startup with solid funding, world-class advisors, and a lab/office in San Francisco, CA. Compensation & Benefits Compensation: Atomic Semi is committed to fair and equitable compensation practices. The annual salary range for this role is $125,000 – $195,000. Compensation is determined based on your qualifications and experience. Our total compensation package also includes generous equity in Atomic Semi. Benefits: Atomic Semi offers the following benefits, subject to applicable eligibility requirements: Medical, Dental, and Vision insurance Generous Paid Time Off inclusive of Holidays and Sick Time Visa Sponsorship Life and Disability Insurance Paid Parental Leave 401(k) retirement plan Weekly Learning & Development opportunities Commuter Benefits including Parking and Late Night Uber rides from the office Lunches daily, Dinners 3x per week, Stocked Office Kitchen with Snacks and Spindrifts We are an equal-opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses. Export Control Analysis: This position involves access to technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant’s capacity to serve in compliance with U.S. export controls. #J-18808-Ljbffr Atomicsemi
$15 per hour
Summary The Wikimedia Foundation is looking for a Senior Site Reliability Engineer to support and develop the platform serving the world’s... ...for ensuring our global top-10 website and its underlying infrastructure is healthy and developing further in support of Wikimedia...SuggestedPermanent employmentFor contractorsRemote work- Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved only...SuggestedFull timeRemote work
- Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early‑stage startups access to the kind of scaled AI infrastructure once reserved...SuggestedFull timeRemote work
$250k
...new opportunity? Join a seed-stage AI infrastructure company building large-scale training... ...infrastructure-as-code, CI/CD pipelines, and reliability standards across thousands of nodes.... ...in SRE, DevOps, or Infrastructure Engineering roles supporting large-scale compute...SuggestedImmediate start- ...designs, builds, and operates critical infrastructure that enables research at OpenAI. Our... ...size of our workloads, while remaining reliable and easy to use. About the Role We're looking for an experienced Site Reliability Engineer to own production-critical...Suggested
$232k - $319k
...secures AI by building the trusted, neutral infrastructure that enables organizations to safely... ...the service with great people and reliable, cost-effective, and efficient infrastructure... ...with architects and product engineering Build a world-class observability platform...Permanent employmentLocal areaWorldwideFlexible hours$127k - $249k
We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team, to guide the security of our cloud-based infrastructure. As a Staff SRE, you will be very hands‑on technically while also mentoring a small team of SREs. The InfraSec team collaborates...Local areaRemote workFlexible hours- ...Title : SRE Infrastructure Engineer Location : SFO, CA (5 Days Onsite) Job Description: We... ...professional experience ensuring the reliability, scalability, and performance of Google... ...performance, and security compliance. · Site Reliability Engineer, Google Cloud...
- ...B run millions of sandboxes. Today our infrastructure runs on Nomad and Terraform across Google... .... We're looking for an infrastructure engineer who actually wants to live in Terraform... ...growing startup with in-person (4 days on-site, 1 day WFH) offices in San Francisco and...Live inWork from home
$157.7k - $277.8k
...time Location Type Hybrid Department Engineering, product & design Compensation SF &... ...must be available, performant, and reliable, 24/7. As an Infrastructure engineer, you'll be at the heart of... ...stipend Company-wide off-sites and team off-sites Competitive compensation...Full timeWork at officeLocal areaFlexible hours$156.86k - $191.72k
...System Infrastructure / Platform Engineer The National Energy Research Scientific Computing Center (NERSC... ..., balancing innovation with reliability, performance, and security at scale.... ...This position requires substantial on-site presence, but is eligible for a flexible...Full timeRemote workFlexible hours- # Senior Site Reliability EngineerHybrid - San Francisco**Our Mission & Values:** At Drata,... ...SRE team operates as both a central engineering function and an embedded reliability... ...than in how we approach reliability.Our infrastructure runs on AWS across multiple accounts,...Work at officeImmediate startWorldwideMonday to FridayFlexible hours
$175k - $250k
...fast‑growing customer base of SaaS companies. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE) team ensures... ...a generalist mindset and are comfortable working across infrastructure layers—from compute and networking to storage, databases...Remote work$60 per hour
Senior Site Reliability Engineer (Copy) Seattle Hybrid (Hybrid location). Full-time. About Us Supio is a trusted AI platform purpose-built... ...releases, weekly deploys, and hotfixes — while also automating infrastructure, monitoring systems, and GitHub workflows. This is a...Full timeWork at officeFlexible hours- ...significantly outperforms individual engineers. We combine language models with human... ...The Role We are seeking an experienced Site Reliability Engineer to join our Platform Engineering... ...automation platforms, and owning the infrastructure that powers our AI‑driven analysis...
- ...allowing them to focus on their core strengths in hardware and infrastructure. Your next opportunity starts here—apply today.... ...Required Skills/Qualifications BS/MS degree in Computer Science, Engineering, or a related subject. Equivalent experience accepted. Proven...Work experience placementStart working todayRemote workFlexible hours
$163k - $203k
...on the SRE team, responsible for the reliability, scalability, and security of Prosper’... ...portfolio. This is as much of a platform engineering role as it is SRE role — you will... ...Kubernetes‑based compute (managed by the Infrastructure Engineering team) across all environments...Work experience placementWork at officeLocal areaRemote workFlexible hours2 days per week- ...will find a home at Fieldguide. About the Role As a Senior Site Reliability Engineer (SRE) at Fieldguide, you will be responsible for ensuring... .... You will apply software engineering principles to infrastructure and operations, designing systems that are resilient, highly...Remote workWork from homeFlexible hours
- The role We're looking for a world-class Site Reliability Engineer to ensure the reliability, performance, and scalability of our AI infrastructure platform. You’ll be building and operating the core systems that power agentic AI at scale. Your mission: keep our ultra...
- ...that possible. We’re a team of doctors, engineers, designers, researchers, and creatives... ...end-to-end. Improve operational reliability: Identify recurring issues and reliability... ...improve Kubernetes clusters, cloud infrastructure, and core platform services, with growing...Work at officeWorldwide
- ...curiosity. About the role Gamma's infrastructure needs to be rock-solid for... ...users while enabling our engineering teams to ship fast. You'll... ...automation and tooling that improves reliability and partnering with... ...What you'll bring 5+ years in Site Reliability Engineering,...Work at officeWork from home
$125k - $165k
Position: Site Reliability Engineer Location: San Francisco, CA Job Id: 434 # of Openings: 1 TELCOR Inc, a leading innovator in laboratory... ...AI Systems team! Do you have strong experience in cloud infrastructure, distributed systems and production operations? Do you...Temporary workWork at officeVisa sponsorshipWork visaRelocation packageFlexible hours- ...Cloud, is a leader in AI cloud infrastructure serving tens of thousands of... ...day is currently Tuesday. Engineering at Lambda is responsible for... ...and improve product reliability. Lead members of other engineering... ...Have 5+ years of experience in Site Reliability Engineering...Work at officeLocal areaWork from home
$166.9k - $225.9k
...team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close... ...we approach reliability. Our infrastructure runs on AWS across multiple accounts... ...bring 6+ years of experience in Site Reliability Engineering, Cloud...Flexible hours- US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an... ...be responsible for designing and implementing automated infrastructure using Terraform, managing containerized workloads within...
- ...onboard services and teams to the reliability tenets. Establish and maintain Service... ...scalable, reliable, and secure infrastructure, ensuring cloud‑native best practices... ...equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and...
- ...Math, and Computer Science — is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and security. As an aggregator of...
- ...About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our product... ...are fully implemented and measured. Infrastructure and Platform Operations: Improve the reliability...Work at officeRemote workFlexible hours2 days per week
$165k - $225k
...changing Stellar ecosystem. SDF is looking for a Senior Site Reliability Engineer to help build and operate the foundation that powers our... ...reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate...Temporary workWork at officeLocal areaWorldwideFlexible hours$125k - $165k
Position Site Reliability Engineer Location Lincoln, NE, San Francisco, CA, or Remote Job ID 434 Openings 1 Job Summary The Site Reliability... ...and containerized environments, and manage production infrastructure and deployment workflows across environments. Pay Range...Temporary workRemote workVisa sponsorshipWork visaFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Infrastructure & Site Reliability Engineer. Be the first to apply!
- principal infrastructure engineer San Francisco, CA
- lead infrastructure engineer San Francisco, CA
- remote infrastructure engineer San Francisco, CA
- data infrastructure engineer San Francisco, CA
- senior infrastructure engineer San Francisco, CA
- infrastructure engineer San Francisco, CA
- infrastructure automation engineer San Francisco, CA
- infrastructure developer San Francisco, CA
- entry level infrastructure engineer San Francisco, CA
- infrastructure engineering manager San Francisco, CA

