Infrastructure & Site Reliability Engineer

$125k - $195k

Atomicsemi

About Atomic Semi Atomic Semi is building a small, fast semiconductor fab. It’s already possible to build this with today’s technology and a few simplifications. We’ll build the tools ourselves so we can quickly iterate and improve. We’re building a small team of exceptional, hands-on engineers to make this happen. Mechanical, electrical, hardware, computer, and process. We’ll own the stack from atoms to architecture. Our team is optimistic about the future and we want to continue pushing the limits of technology. Smaller is better. Faster is better. Building it ourselves is better. We believe our team and lab can build anything. We’ve set up 3D printers, a wide array of microscopes, e-beam writers, general fabrication equipment - and whatever is missing, we’ll just invent along the way. Atomic was founded by Sam Zeloof and Jim Keller. Sam is best known for making chips in his garage, and Jim has been a leader in the semiconductor industry for the past 40 years. About the role We are seeking an Infrastructure & Site Reliability Engineer to design, build, deploy, and manage the on-prem backend infrastructure that powers our small, fast semiconductor fab. This is a broad role encompassing all aspects of backend infrastructure and services. Our philosophy towards infra is minimal, understandable, on-site, and close to the hardware. You won’t find much docker, cloud services, or kubernetes. Instead, there is a lot of bare-metal linux, systemd, and single file binaries. You’ll see a lot of rust and go and occasionally some python. We’re open to a range of experience levels — from exceptional early-career engineers to senior and staff-level builders. There isn’t a single background we’re optimizing for. What matters most is that you’ve already built real things, you love getting close to the metal, and show strong signs of engineering excellence. If you’re excited by performance engineering, building complex features from scratch, and learning new domains quickly, this is a great place for you. A portfolio or GitHub is generally required to apply: show us the things you’ve built! For us, a good portfolio includes evidence of strong engineering skills and curiosity. Responsibilities Design and implement light-weight, performant, and reliable software infrastructure to power a semiconductor fab. Procure, deploy and manage our fleet of on-prem servers, virtual machines, and single-board computers running on semiconductor fabrication equipment. Deploy and manage backend services, e.g., consul, vault, grafana, victoria-metrics, alertmanager, redpanda, vector, gitea, postgres Design and setup low level networking components, e.g., service discovery, DNS, reverse proxies, TLS, S3 compatible storage, VPNs Scale our observability platform: Build systems to ingest and display both traditional system metrics as well as high frequency telemetry from semiconductor fabrication equipment. Design and implement cross-site networking, replication and backups. Automate OS image creation and deployment and software build and deployment systems. Develop best practices and tools for security, authentication, authorization, and secrets management Help build our in-house infrastructure-as-code tool. Required Experience BS in Computer Science, Computer Engineering, or demonstrated exceptional skill in software engineering Deep experience in at least one statically-typed, compiled language Backend infrastructure or SRE experience Code portfolio – show us something you’ve built either professionally or personally. Could be a backend service, homelab setup script, etc. Nice-to-have Rust or go experience On-prem infra experience Working at Atomic Semi We’re an early-stage hardware startup with solid funding, world-class advisors, and a lab/office in San Francisco, CA. Compensation & Benefits Compensation: Atomic Semi is committed to fair and equitable compensation practices. The annual salary range for this role is $125,000 – $195,000. Compensation is determined based on your qualifications and experience. Our total compensation package also includes generous equity in Atomic Semi. Benefits: Atomic Semi offers the following benefits, subject to applicable eligibility requirements: Medical, Dental, and Vision insurance Generous Paid Time Off inclusive of Holidays and Sick Time Visa Sponsorship Life and Disability Insurance Paid Parental Leave 401(k) retirement plan Weekly Learning & Development opportunities Commuter Benefits including Parking and Late Night Uber rides from the office Lunches daily, Dinners 3x per week, Stocked Office Kitchen with Snacks and Spindrifts We are an equal-opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses. Export Control Analysis: This position involves access to technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant’s capacity to serve in compliance with U.S. export controls. #J-18808-Ljbffr Atomicsemi

Apply

Vacancy posted 3 days ago

Similar jobs that could be interesting for youBased on the Infrastructure & Site Reliability Engineer in San Francisco, CA vacancy

Site Reliability Engineer, Frontier Systems Infrastructure
...and keep these hyperscale supercomputers reliable and efficient during the training of... ...models. About the Role We are looking for engineers to operate the next generation of... ...distributed systems engineering with hands-on infrastructure work on our largest datacenters. You...
Suggested
Dormont Manufacturing Co
San Francisco, CA
1 day ago
Site Reliability Engineer - AI Infrastructure
Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved only...
Suggested
Full time
Remote work
Andromeda Cluster
San Francisco, CA
3 days ago
Site Reliability Engineer (Senior or Staff), Infrastructure Security
$127k - $249k
Senior / Staff Engineer - SRE, InfraSec We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team to guide the security of our cloud‑based infrastructure. You will be highly hands‑on technically while also mentoring a small team of SREs. The...
Suggested
Local area
Remote work
The Consulting Solutions
San Francisco, CA
3 days ago
Site Reliability Engineer
...About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our... ...improvements are fully implemented and measured. Infrastructure and Platform Operations: Improve the...
Suggested
Work at office
Remote work
Flexible hours
2 days per week
Plenful
San Francisco, CA
4 days ago
SRE/Infrastructure Engineer
...B run millions of sandboxes. Today our infrastructure runs on Nomad and Terraform across Google... .... We're looking for an infrastructure engineer who actually wants to live in Terraform... ...growing startup with in-person (4 days on-site, 1 day WFH) offices in San Francisco and...
Suggested
Live in
Work from home
Dormont Manufacturing Co
San Francisco, CA
2 days ago
Site Reliability Engineer (SRE) - AI Infrastructure
$300k
...work at the intersection of hyperscale infrastructure and AI, shaping the operational... ...petascale, and be part of a founding engineering team, this is the place to do it.... ...infrastructure-as-code, CI/CD pipelines, and reliability standards across thousands of nodes....
Permanent employment
Flexible hours
San Francisco, CA
more than 2 months ago
Senior Site Reliability Engineer
...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers... ...reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate...
TechChain Talent
San Francisco, CA
1 day ago
Site Reliability Engineer (SRE)
$170k - $250k
...Site Reliability Engineer (SRE) Location: San Francisco, CA / Palo Alto, CA Company Stage of Funding: Growth-Stage AI Infrastructure Company ($80M Raised) Office Type: Onsite (4 Days Per Week) Salary: $170,000-$250,000 + Competitive Equity Company Description...
Work at office
Visa sponsorship
Flexible hours
Recruiting from Scratch
San Francisco, CA
2 days ago
Site Reliability Engineer
...continue to scale. About the role We're hiring a Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability... ...on operating real systems at scale - not just building infrastructure, but deeply understanding how it behaves under load,...
Work at office
Remote work
Flexible hours
2 days per week
Plenful
San Francisco, CA
21 hours ago
Site Reliability Engineer (SRE)
...Site Reliability Engineer (SRE) FLUIX is building the AI operating system that plans, designs, and optimizes AI infrastructure. We are based in Silicon Valley. We specialize in providing AI-driven solutions for data centers and power providers, leveraging cutting-edge...
Work at office
Weekend work
Fluix AI
San Francisco, CA
21 hours ago
Site Reliability Engineer
$200k - $300k
...Site Reliability Engineer Title of Role: Site Reliability Engineer Location: San Francisco, onsite Company Stage of Funding: Venture... .... What You Will Do Design and implement robust infrastructure solutions to support scalable applications in a fast-paced...
Work at office
Recruiting from Scratch
San Francisco, CA
5 days ago
Senior Site Reliability Engineer
...significantly outperforms individual engineers. We combine language models with human... ...Role: We are seeking an experienced Site Reliability Engineer to join our Platform... ...automation platforms, and owning the infrastructure that powers our AI-driven analysis engine...
CodeRabbit
San Francisco, CA
2 days ago
Site Reliability Engineer
$150k - $250k
...Site Reliability Engineer role USC or GC only are considered at this time. San Francisco - Local to Bay area only but role... ...see candidates who can jump in and own this piece of our infrastructure! What our team says about this role Client is...
Work experience placement
Casual work
Local area
Immediate start
Remote work
3B Staffing LLC
San Francisco, CA
21 hours ago
Site Reliability Engineer
...SRE @ Clay In this role, you'll join our growing infrastructure team in building and fine-tuning our infrastructure to keep our services... ...ensure we achieve the right balance of developer velocity, reliability and performance, and cost efficiency. What You'll Bring...
clay.global
San Francisco, CA
4 days ago
Site Reliability Engineer
$150k
About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong focus on DevSecOps to join our growing engineering... ..., security posture, and operational hygiene of our cloud infrastructure, APIs, and software supply chain. You will drive patch...
VantageScore
San Francisco, CA
21 hours ago
Site Reliability Engineer II
$98.58k - $138.02k
...Site Reliability Engineer II Restaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique... ..., enhancing, and maintaining Restaurant365's cloud infrastructure and applications. Qualified candidates will demonstrate growing...
Work at office
Restaurant365
San Francisco, CA
4 days ago
Site Reliability Engineer
...Site Reliability Engineer We are looking for a dynamic engineer to join our rapidly growing SRE team. As an SRE, you will report to our... ...on cutting edge technologies. We operate a hybrid cloud infrastructure. As such you will be expected to work with public cloud providers...
Relocation package
1872 Consulting
San Francisco, CA
21 hours ago
Site Reliability Engineer
..., CA (5 Days In-Office) You are the infrastructure expert who enables our rapid product development... .... What We Look for in a Great Engineer You have the intensity and technical... ...release while maintaining the highest reliability. DevX Support: Support Developer...
Work at office
Latent
San Francisco, CA
2 days ago
Site Reliability Engineer
$113.4k - $162k
...Site Reliability Engineer San Francisco, CA We believe communication belongs to everyone. We exist to democratize phone service. TextNow... ...is looking for motivated Site Reliability Engineer to own infrastructure, monitoring, logging, ci/cd, reliability and everything in...
Temporary work
TextNow
San Francisco, CA
1 day ago
Senior Site Reliability Engineer
$160k - $250k
...integrating GPUs. Even with these data centers, we maintain a hybrid infrastructure with public clouds when the right fit. As we continue to... ...learning models, we also need to grow our DevOps and Site Reliability team to maintain the reliability of our enterprise SaaS...
Hive
San Francisco, CA
21 hours ago
Senior Site Reliability Engineer
...Computer Science - is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and security. As an aggregator of compute...
Hyperbolic Labs
San Francisco, CA
21 hours ago
Sr. Site Reliability Engineer
$106k - $130k
...employment Visa sponsorship. Overall Purpose To create and maintain the next generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best practices. Essential Functions Implement...
Hourly pay
Work experience placement
Work at office
Immediate start
Visa sponsorship
Work visa
Flexible hours
Early Warning Services
San Francisco, CA
4 days ago
Site Reliability Engineer (SRE)
$170k - $230k
...Site Reliability Engineer (SRE) Palo Alto / San Francisco Bay Area About Mithril Mithril is an AI infrastructure platform built to make GPU compute more accessible and affordable for the world's leading enterprises, AI startups, and the AI research community,...
Work at office
Local area
1 day per week
Mithril
San Francisco, CA
21 hours ago
Senior Infrastructure & Reliability Engineer - AI Platform
$157.7k - $277.8k
...time Location Type Hybrid Department Engineering, product & design Compensation SF &... ...must be available, performant, and reliable, 24/7. As an Infrastructure engineer, you'll be at the heart of... ...stipend Company-wide off-sites and team off-sites Competitive compensation...
Full time
Work at office
Local area
Flexible hours
Writer
San Francisco, CA
1 day ago
Senior Manager, Site Reliability Engineering - Infrastructure Platform
$232k - $319k
...secures AI by building the trusted, neutral infrastructure that enables organizations to safely... ...the service with great people and reliable, cost-effective, and efficient infrastructure... ...with architects and product engineering Build a world-class observability platform...
Permanent employment
Local area
Worldwide
Flexible hours
Okta
San Francisco, CA
more than 2 months ago
Senior Software Engineer - Site Reliability Engineering
...Udaip Cloud-Based Data And Ai Platform Engineer At U.S. Bank, we're on a journey to do our best. Helping the customers and businesses... ...level, UDAIP success includes industry leading hybrid cloud infrastructure, innovative data capabilities with leverage of many open-...
Temporary work
Work experience placement
Phenom People
San Francisco, CA
21 hours ago
Senior Site Reliability Engineer
US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an... ...be responsible for designing and implementing automated infrastructure using Terraform, managing containerized workloads within...
Axiom Pursuits
San Francisco, CA
1 day ago
Senior Site Reliability Engineer
...onboard services and teams to the reliability tenets. Establish and maintain Service... ...scalable, reliable, and secure infrastructure, ensuring cloud‑native best practices... ...equivalent. 6+ years of experience in Site Reliability Engineering, managing infrastructure and...
OutSystems, Inc.
San Francisco, CA
1 day ago
Site Reliability Engineer
About HappyRobot HappyRobot is the infrastructure for enterprises to build and orchestrate AI... ...Role We're looking for an Infrastructure Engineer to take the lead on scaling our... ...high-trust role where you’ll shape how reliability is done - reducing incident load, building...
Worldwide
Shift work
Happyrobot Inc.
San Francisco, CA
1 day ago
Senior Site Reliability Engineer
$166.9k - $225.9k
...team operates as both a central engineering function and an embedded reliability practice. You'll be part of a close... ...we approach reliability. Our infrastructure runs on AWS across multiple accounts... ...bring 6+ years of experience in Site Reliability Engineering, Cloud...
Flexible hours
Drata
San Francisco, CA
2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Infrastructure & Site Reliability Engineer. Be the first to apply!