Infrastructure & Site Reliability Engineer
$125k - $195kAtomicsemi
About Atomic Semi Atomic Semi is building a small, fast semiconductor fab. It’s already possible to build this with today’s technology and a few simplifications. We’ll build the tools ourselves so we can quickly iterate and improve. We’re building a small team of exceptional, hands-on engineers to make this happen. Mechanical, electrical, hardware, computer, and process. We’ll own the stack from atoms to architecture. Our team is optimistic about the future and we want to continue pushing the limits of technology. Smaller is better. Faster is better. Building it ourselves is better. We believe our team and lab can build anything. We’ve set up 3D printers, a wide array of microscopes, e-beam writers, general fabrication equipment - and whatever is missing, we’ll just invent along the way. Atomic was founded by Sam Zeloof and Jim Keller. Sam is best known for making chips in his garage, and Jim has been a leader in the semiconductor industry for the past 40 years. About the role We are seeking an Infrastructure & Site Reliability Engineer to design, build, deploy, and manage the on-prem backend infrastructure that powers our small, fast semiconductor fab. This is a broad role encompassing all aspects of backend infrastructure and services. Our philosophy towards infra is minimal, understandable, on-site, and close to the hardware. You won’t find much docker, cloud services, or kubernetes. Instead, there is a lot of bare-metal linux, systemd, and single file binaries. You’ll see a lot of rust and go and occasionally some python. We’re open to a range of experience levels — from exceptional early-career engineers to senior and staff-level builders. There isn’t a single background we’re optimizing for. What matters most is that you’ve already built real things, you love getting close to the metal, and show strong signs of engineering excellence. If you’re excited by performance engineering, building complex features from scratch, and learning new domains quickly, this is a great place for you. A portfolio or GitHub is generally required to apply: show us the things you’ve built! For us, a good portfolio includes evidence of strong engineering skills and curiosity. Responsibilities Design and implement light-weight, performant, and reliable software infrastructure to power a semiconductor fab. Procure, deploy and manage our fleet of on-prem servers, virtual machines, and single-board computers running on semiconductor fabrication equipment. Deploy and manage backend services, e.g., consul, vault, grafana, victoria-metrics, alertmanager, redpanda, vector, gitea, postgres Design and setup low level networking components, e.g., service discovery, DNS, reverse proxies, TLS, S3 compatible storage, VPNs Scale our observability platform: Build systems to ingest and display both traditional system metrics as well as high frequency telemetry from semiconductor fabrication equipment. Design and implement cross-site networking, replication and backups. Automate OS image creation and deployment and software build and deployment systems. Develop best practices and tools for security, authentication, authorization, and secrets management Help build our in-house infrastructure-as-code tool. Required Experience BS in Computer Science, Computer Engineering, or demonstrated exceptional skill in software engineering Deep experience in at least one statically-typed, compiled language Backend infrastructure or SRE experience Code portfolio – show us something you’ve built either professionally or personally. Could be a backend service, homelab setup script, etc. Nice-to-have Rust or go experience On-prem infra experience Working at Atomic Semi We’re an early-stage hardware startup with solid funding, world-class advisors, and a lab/office in San Francisco, CA. Compensation & Benefits Compensation: Atomic Semi is committed to fair and equitable compensation practices. The annual salary range for this role is $125,000 – $195,000. Compensation is determined based on your qualifications and experience. Our total compensation package also includes generous equity in Atomic Semi. Benefits: Atomic Semi offers the following benefits, subject to applicable eligibility requirements: Medical, Dental, and Vision insurance Generous Paid Time Off inclusive of Holidays and Sick Time Visa Sponsorship Life and Disability Insurance Paid Parental Leave 401(k) retirement plan Weekly Learning & Development opportunities Commuter Benefits including Parking and Late Night Uber rides from the office Lunches daily, Dinners 3x per week, Stocked Office Kitchen with Snacks and Spindrifts We are an equal-opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses. Export Control Analysis: This position involves access to technology that is subject to U.S. export controls. Any job offer made will be contingent upon the applicant’s capacity to serve in compliance with U.S. export controls. #J-18808-Ljbffr Atomicsemi
$250k
...new opportunity? Join a seed-stage AI infrastructure company building large-scale training... ...infrastructure-as-code, CI/CD pipelines, and reliability standards across thousands of nodes.... ...in SRE, DevOps, or Infrastructure Engineering roles supporting large-scale compute...SuggestedImmediate start- Senior Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco • Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early‑stage startups access to the kind of scaled AI infrastructure once reserved...SuggestedFull timeRemote work
- ...and keep these hyperscale supercomputers reliable and efficient during the training of... ...models. About the Role We are looking for engineers to operate the next generation of... ...distributed systems engineering with hands-on infrastructure work on our largest datacenters. You...Suggested
- Site Reliability Engineer - AI Infrastructure Location: Global Remote / San Francisco · Full-Time About Andromeda Andromeda Cluster was founded by Nat Friedman and Daniel Gross to give early-stage startups access to the kind of scaled AI infrastructure once reserved only...SuggestedFull timeRemote work
- ...Horowitz, NEA, and Addition with $250+ million raised to date. About the role Anyscale is looking for a Senior Site Reliability Engineer to join the Infrastructure team. Anyscale aims to provide the next generation of tools and infrastructure to make developing and...Suggested
$127k - $249k
Senior / Staff Engineer - SRE, InfraSec We are looking for an experienced Senior or Staff Engineer for our SRE, InfraSec team to guide the security of our cloud‑based infrastructure. You will be highly hands‑on technically while also mentoring a small team of SREs. The...Local areaRemote work- ...About the role We’re hiring an SRE to join our engineering team at Plenful and take ownership of the reliability and performance of the systems that power our... ...improvements are fully implemented and measured. Infrastructure and Platform Operations: Improve the...Work at officeRemote workFlexible hours2 days per week
- ...B run millions of sandboxes. Today our infrastructure runs on Nomad and Terraform across Google... .... We're looking for an infrastructure engineer who actually wants to live in Terraform... ...growing startup with in-person (4 days on-site, 1 day WFH) offices in San Francisco and...Live inWork from home
- ...Site Reliability Engineer We are looking for a dynamic engineer to join our rapidly growing SRE team. As an SRE, you will report to our... ...on cutting edge technologies. We operate a hybrid cloud infrastructure. As such you will be expected to work with public cloud providers...Relocation package
$150k
...Site Reliability Engineer San Francisco, CA About The Role We are seeking an experienced Site Reliability Engineer (SRE) with a strong... ..., security posture, and operational hygiene of our cloud infrastructure, APIs, and software supply chain. You will drive patch...$98.58k - $138.02k
...Site Reliability Engineer II Restaurant365 is a SaaS company disrupting the restaurant industry! Our cloud-based platform provides a unique... ..., enhancing, and maintaining Restaurant365's cloud infrastructure and applications. Qualified candidates will demonstrate growing...Work at office- ...Computer Science - is poised to redefine computing. About the Role We're seeking a Site Reliability Engineer to ensure Hyperbolic's GPU marketplace and AI infrastructure operate with exceptional reliability, performance, and security. As an aggregator of compute...
$106k - $130k
...employment Visa sponsorship. Overall Purpose To create and maintain the next generation of application infrastructure and to be responsible for reliability, automation and scalability using and the latest best practices. Essential Functions Implement...Hourly payWork experience placementWork at officeImmediate startVisa sponsorshipWork visaFlexible hours$170k - $230k
...Site Reliability Engineer (SRE) Palo Alto / San Francisco Bay Area About Mithril Mithril is an AI infrastructure platform built to make GPU compute more accessible and affordable for the world's leading enterprises, AI startups, and the AI research community,...Work at officeLocal area1 day per week$180k - $250k
..., this is a rare opportunity to help define the foundational infrastructure shaping the future of AI development. Apply today! Responsibilities... .... Collaborate closely with Distributed Systems Engineers and work directly with users. Skills/Must have: ~5+ years...- ...significantly outperforms individual engineers. We combine language models with human... ...Role: We are seeking an experienced Site Reliability Engineer to join our Platform... ...automation platforms, and owning the infrastructure that powers our AI-driven analysis engine...
- ...SRE @ Clay In this role, you'll join our growing infrastructure team in building and fine-tuning our infrastructure to keep our services... ...ensure we achieve the right balance of developer velocity, reliability and performance, and cost efficiency. What You'll Bring...
$150k - $250k
...Site Reliability Engineer role USC or GC only are considered at this time. San Francisco - Local to Bay area only but role... ...see candidates who can jump in and own this piece of our infrastructure! What our team says about this role Client is...Work experience placementCasual workLocal areaImmediate startRemote work- ...About the job Senior Site Reliability Engineer About the Company Stellar is a decentralized, public blockchain that gives developers... ...reliability and scalability of our systems, design and improve the infrastructure behind our production environments, and automate...
$170k - $250k
...Site Reliability Engineer (SRE) Location: San Francisco, CA / Palo Alto, CA Company Stage of Funding: Growth-Stage AI Infrastructure Company ($80M Raised) Office Type: Onsite (4 Days Per Week) Salary: $170,000-$250,000 + Competitive Equity Company Description...Work at officeVisa sponsorshipFlexible hours- ..., CA (5 Days In-Office) You are the infrastructure expert who enables our rapid product development... .... What We Look for in a Great Engineer You have the intensity and... ...release while maintaining the highest reliability. DevX Support: Support Developer Experience...Work at office
$113.4k - $162k
...Site Reliability Engineer San Francisco, CA We believe communication belongs to everyone. We exist to democratize phone service. TextNow... ...is looking for motivated Site Reliability Engineer to own infrastructure, monitoring, logging, ci/cd, reliability and everything in...Temporary work$160k - $250k
...integrating GPUs. Even with these data centers, we maintain a hybrid infrastructure with public clouds when the right fit. As we continue to... ...learning models, we also need to grow our DevOps and Site Reliability team to maintain the reliability of our enterprise SaaS...- ...continue to scale. About the role We're hiring a Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability... ...on operating real systems at scale - not just building infrastructure, but deeply understanding how it behaves under load,...Work at officeRemote workFlexible hours2 days per week
- ...Site Reliability Engineer (SRE) FLUIX is building the AI operating system that plans, designs, and optimizes AI infrastructure. We are based in Silicon Valley. We specialize in providing AI-driven solutions for data centers and power providers, leveraging cutting-edge...Work at officeWeekend work
$200k - $300k
...Site Reliability Engineer Title of Role: Site Reliability Engineer Location: San Francisco, onsite Company Stage of Funding: Venture... .... What You Will Do Design and implement robust infrastructure solutions to support scalable applications in a fast-paced...Work at office$157.7k - $277.8k
...time Location Type Hybrid Department Engineering, product & design Compensation SF &... ...must be available, performant, and reliable, 24/7. As an Infrastructure engineer, you'll be at the heart of... ...stipend Company-wide off-sites and team off-sites Competitive compensation...Full timeWork at officeLocal areaFlexible hours$180k - $280k
About the Role As an ML infrastructure and reliability engineer, you will join the team responsible for building and maintaining TypeSafe’s API platform for inference. These APIs will be user facing, latency sensitive, and (once we ship) have uptime, reliability and backwards...Visa sponsorship$79.61k - $168.59k
.... KPMG is currently seeking a Senior Associate, Infrastructure Project Advisory (Construction/Engineering) in Infrastructure and Projects Advisory for our Deal... ...be found towards the bottom of our KPMG US Careers site at Benefits & How We Work . Follow this link...Full timeContract workH1bLocal area$73.15k - $145.02k
...KPMG is currently seeking an Associate in Infrastructure and Projects Advisory for our Deal... ...Bachelor's degree or masters' degree in Engineering, Architecture, Building Science, Construction... ...the bottom of our KPMG US Careers site at Benefits & How We Work . Follow...Full timeContract workFor subcontractorH1bLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Infrastructure & Site Reliability Engineer. Be the first to apply!
- infrastructure automation engineer San Francisco, CA
- senior infrastructure engineer San Francisco, CA
- security infrastructure engineer San Francisco, CA
- principal infrastructure engineer San Francisco, CA
- infrastructure engineer San Francisco, CA
- infrastructure engineering manager San Francisco, CA
- lead infrastructure engineer San Francisco, CA
- remote infrastructure engineer San Francisco, CA
- data infrastructure engineer San Francisco, CA
- entry level infrastructure engineer San Francisco, CA



