Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Site Reliability Engineer — HPC & Automation (Silicon Engineering)

$125k - $150k
Full-time

SpaceX

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars. SITE RELIABILITY ENGINEER — HPC & AUTOMATION (SILICON ENGINEERING) At SpaceX we’re leveraging our experience in building rockets and spacecraft to deploy Starlink, the world’s most advanced broadband internet system. Starlink is the world’s largest satellite constellation and is providing fast, reliable internet to millions of users worldwide. We design, build, test, and operate all parts of the system – thousands of satellites, consumer receivers that allow users to connect within minutes of unboxing, and the software that brings it all together. We’ve only begun to scratch the surface of Starlink’s potential global impact and are looking for best-in-class engineers to help maximize Starlink’s utility for communities and businesses around the globe. We are seeking a motivated, proactive, and intellectually curious engineer who will work alongside world-class cross-disciplinary teams (systems, firmware, architecture, design, validation, product engineering, ASIC implementation). As a Site Reliability Engineer on the Silicon Engineering team you will get the opportunity to design, operate, scale, and automate the high performance computing infrastructure we use to develop the chips powering the world's largest satellite constellation and a global internet service. This position will have a meaningful impact on Starlink silicon by enabling faster design-iterations, simulations, and regression turnaround times that gate how fast our chip teams can ship.

RESPONSIBILITIES:

Deploy, upgrade, operate, maintain, and scale our suite of clusters and services Collaborate with engineers to develop automated, full turnkey solutions for silicon simulation workflows to speed up project timelines Manage our underlying infrastructure as code and use modern observability tools to provide a complete picture of cluster and infrastructure health Operate the continuous integration pipeline, build and release systems, and version control across the environment Identify and eliminate performance bottlenecks using measurement and creative engineering

BASIC QUALIFICATIONS:

Bachelor’s degree in computer science, information systems, or an engineering discipline; OR 2+ years of professional experience in system administration, high performance computing, or site reliability engineering 1+ years of development experience with Bash, Python, and/or other programming languages 1+ years of experience with Linux operating systems

PREFERRED SKILLS AND EXPERIENCE:

Familiarity with containerization technologies (i.e. Docker, Kubernetes) Knowledge in computer system concepts (computer architecture, computer organization, operating systems and concurrency) Experience with databases and data modeling (e.g., MySQL, PostgreSQL, SQLite) Networking knowledge of TCP/IP Experience with high performance computing and workload managers (e.g., Slurm, LSF) Experience with Terraform, Ansible, Puppet, or similar automation frameworks Experience building monitoring and alerting as code (e.g., Grafana, Prometheus, custom exporters) Experience with CI/CD automation at scale (e.g., Jenkins, Bamboo, build systems) Experience with infrastructure as code (IaC) tools for managing fleets of servers Experience with using & building REST API clients/servers Experience with enterprise/networked storage automation (e.g., NetApp ONTAP REST API/CLI, NFS) Experience with ASIC design flows and tools (e.g., Cadence, Synopsys, Ansys, Keysight, Siemens) Strong desire to find performance bottlenecks and performance improvement techniques Excellent communication skills with the ability to communicate with customers, peers, management, etc. in both formal and informal situations Ability to quickly learn new tools and frameworks Interest in or experience with AI/LLM-assisted tooling (e.g., Grok, Claude Code)

ADDITIONAL REQUIREMENTS:

Ability to work extended hours and weekends as needed to meet critical milestones

COMPENSATION AND BENEFITS:

Pay Range: Level 1: $125,000.00 - $150,000.00 Level 2: $145,000.00 - $175,000.00 Your actual level and base salary will be determined on a case-by-case basis and may vary based on the following considerations: job-related knowledge and skills, education, and experience. Base salary is just one part of your total rewards package at SpaceX. You may also be eligible for long-term incentives, in the form of company stock or long-term cash awards, as well as potential discretionary bonuses and the ability to purchase additional stock at a discount through an Employee Stock Purchase Plan. You will also receive access to comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short and long-term disability insurance, life insurance, paid parental leave, and various other discounts and perks. You may also accrue 3 weeks of paid vacation and will be eligible for 10 or more paid holidays per year. Employees in Washington State accrue paid sick time in compliance with state and federal law. Company shuttles are offered to employees for roundtrip travel from select Seattle locations to the SpaceX Redmond office Monday to Friday.

ITAR REQUIREMENTS:

To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here. SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status. Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should reach out to View email address on click.appcast.io.

Vacancy posted 18 hours ago
Similar jobs that could be interesting for youBased on the Site Reliability Engineer — HPC & Automation (Silicon Engineering) in Redmond, WA vacancy
  • SpaceX is seeking an RF Silicon Software Engineer to join its revolutionary mission of deploying Starlink. The engineer will design and build automated measurement systems, contributing to the life cycle of silicon used in high-performance digital communication systems.... 
    Suggested

    Latent AI

    Redmond, WA
    5 days ago
  • $122.5k - $145k

     ...of enabling human life on Mars. RF SILICON SOFTWARE ENGINEER (STARLINK) At the company we’re leveraging...  ...constellation and is providing fast, reliable connectivity to millions of users...  .../software boundary: RF measurements, automation software, hardware development, and embedded... 
    Suggested
    Permanent employment
    Temporary work
    Worldwide
    Weekend work

    United States Digital Space LLC

    Redmond, WA
    5 days ago
  • $160k - $220k

    SpaceX is looking for a SR. RF SILICON SOFTWARE ENGINEER to join our RFIC team in Redmond, Washington. The ideal candidate will have 5+ years of experience in C, C++, or Python and a Bachelor’s degree in electrical or computer engineering. Key responsibilities include bringing... 
    Suggested

    Dormont Manufacturing Company

    Redmond, WA
    5 days ago
  • $160k - $220k

     ...enabling human life on Mars. SR. RF SILICON SOFTWARE ENGINEER (STARLINK) At SpaceX we’re leveraging...  ...constellation and is providing fast, reliable connectivity to millions of users worldwide...  .../software boundary: RF measurements, automation software, hardware development, and... 
    Suggested
    Permanent employment
    Temporary work
    Worldwide
    Weekend work

    Dormont Manufacturing Co

    Redmond, WA
    1 day ago
  • United States Digital Space LLC based in Redmond, WA is seeking an RF Silicon Software Engineer to enhance Starlink's next-gen satellite technology. The position requires deep expertise in RF measurements, software development, and innovative hardware solutions. Successful... 
    Suggested

    United States Digital Space LLC

    Redmond, WA
    3 days ago
  •  ...A leading aerospace manufacturer in Redmond is seeking a Sr. Software Engineer for the Starlink program. You will develop real-time software to enhance satellite internet services, focusing on optimizing user experiences for millions. The role requires a Bachelor's degree... 

    SpaceX

    Redmond, WA
    5 days ago
  •  ...manage AI resources on Microsoft Azure, including AI Foundry and RAG solutions Monitor and ensure service uptime, availability, reliability, and latency Track and integrate SRE metrics with enterprise monitoring systems Support CI/CD and DevOps workflows using... 

    Tech M USAAvance Consulting

    Redmond, WA
    5 days ago
  • $165k - $230k

     ...with the ultimate goal of enabling human life on Mars. SR. SITE RELIABILITY ENGINEER (STARSHIELD) Starshield leverages the company’s Starlink...  ...testing, and operational support. RESPONSIBILITIES Develop automation to deploy and manage compute resources both on-premises... 
    Permanent employment
    Temporary work
    Work at office
    Immediate start
    Monday to Friday
    Weekend work

    United States Digital Space LLC

    Redmond, WA
    4 days ago
  • $165k - $230k

    Sr. Site Reliability Engineer (Starshield) Redmond, WA SpaceX was founded under the belief that a future where humanity is out exploring the stars...  ..., and operational support. Responsibilities Develop automation to deploy and manage compute resources both on-premises and... 
    Permanent employment
    Temporary work
    Work at office
    Immediate start
    Monday to Friday
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Redmond, WA
    2 days ago
  • United States Digital Space LLC in Redmond is seeking a Software Engineer specializing in High Performance Computing for the Starlink program. You will develop real-time software to optimize user experience and participate in major design reviews. The role significantly... 

    United States Digital Space LLC

    Redmond, WA
    3 days ago
  • SpaceX is seeking a Sr. Software Engineer in Redmond, WA, to develop highly reliable, real-time software for optimizing user experiences in its Starlink broadband network. Candidates should have robust programming experience and a degree in a relevant field, alongside teamwork... 

    SpaceX

    Redmond, WA
    3 days ago
  • $125k - $150k

    Site Reliability Engineer, Kubernetes Platform (Starshield) Redmond, WA SpaceX was founded under the belief that a future where humanity is out...  ..., and our internal Kubernetes platforms. You will develop automation to deploy and manage on-premise compute resources, create... 
    Permanent employment
    Temporary work
    Work at office
    Immediate start
    Monday to Friday
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Redmond, WA
    2 days ago
  • $165k - $230k

    Sr. Hardware / Infrastructure Site Reliability Engineer (Starlink) - Redmond, WA SpaceX is developing the technologies to make space exploration...  ...businesses around the globe. Responsibilities Develop automation to deploy and manage on‑premise compute resources. Deploy... 
    Permanent employment
    Work at office
    Worldwide
    Monday to Friday
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Redmond, WA
    1 day ago
  • $160k - $225k

    SR. LINUX SITE RELIABILITY ENGINEER The company is looking for an experienced engineer with deep working knowledge of Kubernetes and related containerized...  ...capable engineers. Drive scripting, self‑service and automation to develop solutions to reduce administrative overhead and... 
    Permanent employment
    Temporary work
    Work at office
    Monday to Friday
    Flexible hours
    Weekend work

    United States Digital Space LLC

    Redmond, WA
    5 days ago
  • $165k - $260k

    Sr. Technical Project Lead (Silicon Engineering) Redmond, WA SpaceX was founded under the belief that a future where humanity is out exploring...  ...’s largest satellite constellation and is providing fast, reliable internet to millions of users worldwide. We design, build,... 
    Permanent employment
    Temporary work
    Work at office
    Worldwide
    Monday to Friday
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Redmond, WA
    5 days ago
  • $165k - $230k

    SpaceX is seeking a motivated Senior ASIC Design Verification Engineer to develop cutting-edge ASICs for deployment in Starlink's global...  ...regression tests, and contributing to both pre and post-silicon processes. Compensation includes a salary range of $165,000 - $... 

    Latent AI

    Redmond, WA
    3 days ago
  • $184k - $287.5k

     ...fully optimized NVIDIA AI and HPC software stack. We’re...  ..., and a passion for building reliable, debuggable, and scalable manufacturing...  ...tool design. Drive pre-silicon readiness for factory & manufacturing...  .... Mentor architects and engineering teams to grow them into... 
    Full time
    Shift work

    NVIDIA

    Redmond, WA
    2 days ago
  • $165k - $230k

    United States Digital Space LLC is seeking a Senior Software Engineer specializing in high-performance computing for Starlink in Redmond, WA. You will enhance satellite internet services, focusing on software development cycles from inception to deployment. The ideal candidate... 

    United States Digital Space LLC

    Redmond, WA
    2 days ago
  • $125k - $145k

     ...OS/Platform Software Engineer (Starlink) Redmond, WA SpaceX was founded under the belief...  ...satellite constellation and is providing fast, reliable internet to 9M+ users worldwide. We...  ...satellite components. Interface with Silicon, Electrical, and Application Software Engineers... 
    Permanent employment
    Temporary work
    Internship
    Work at office
    Worldwide
    Monday to Friday
    Weekend work

    SPACE EXPLORATION TECHNOLOGIES CORP

    Redmond, WA
    1 day ago
  • $117.92k - $176.83k

    Protingent is seeking an Azure Systems Engineer to optimize and manage Azure Government environments in Bellevue, WA. This role includes administering various Azure services, ensuring compliance, and implementing security controls. Candidates should have over 5 years of... 

    Protingent

    Bellevue, WA
    3 days ago
  •  ...Site Reliability Engineer Join the innovators connecting just about anything—from families to cars to now things—on T-Mobile's biggest and best network yet. The SyncUP Things platform team has an immediate need for a Site Reliability Engineer. Responsibilities:... 
    Contract work
    Immediate start
    Remote work

    Software Technology Inc

    Bellevue, WA
    5 days ago
  •  ...next generation of intelligent devices, contributing to a mission of delivering exceptional quality and reliability. Are you a passionate Test Automation Engineer ready to make a significant impact on revolutionary device technology? We are seeking a talented Senior Test... 
    Worldwide
    Flexible hours

    Aquent Talent

    Redmond, WA
    4 hours ago
  • $143.7k - $194.4k

     ...network. Our mission is to deliver fast, reliable internet connectivity to customers...  ...with every device we design, from custom silicon to secure software, to enable innovative...  ...reality. As a Device Software Development Engineer on the Amazon Leo for Government (ALG)... 
    Internship
    Local area
    Relocation package
    Flexible hours

    Amazon

    Redmond, WA
    2 days ago
  •  ...tested in the lab to identify bugs, reliability, or performance issues. As a Firmware Test Engineer at Meta Reality Labs, Redmond,...  ...software and related tools to automate testing of a large variety of...  ...definition for custom silicon, firmware architecture, implementation... 

    TechDigital Group

    Redmond, WA
    2 days ago
  •  ...Washington, is looking for an Azure Systems Engineer to manage Azure Government hosted resources...  ...scripting skills, and a deep understanding of HPC operations. This role requires collaboration across teams to provide reliable cloud services while supporting nuclear... 

    TerraPower

    Bellevue, WA
    1 day ago
  • $194k - $267k

     ...be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from...  ...on new concepts and tools. Position Overview: The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes... 
    Permanent employment
    Work at office
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    Bellevue, WA
    4 days ago
  • $163.62k - $212.71k

     ...platforms, and processes that improve our engineering teams' productivity and streamline the...  ...seasoned and strategic Lead/Principal Site Reliability Engineer to drive the reliability,...  ...identify and resolve system anomalies. Automation: Drive automation across all... 
    Full time
    Part time
    Work experience placement
    Work at office
    Local area
    Immediate start
    Remote work
    Work from home
    Flexible hours
    Shift work
    3 days per week
    1 day per week

    iSpot.tv

    Bellevue, WA
    4 days ago
  • $183k - $247.6k

     ...mission is to deliver fast, reliable internet connectivity to customers...  ...looking for a Sr. Embedded Engineer to lead platform specific...  ...ASIC projects before and after silicon tapeouts. The candidate should...  ...of innovators. You\'ll build automation and tooling that bridges... 
    Permanent employment
    Internship
    Local area
    Flexible hours

    Amazon

    Redmond, WA
    5 days ago
  •  ...Title: Reliability Test Engineer Location: Redmond, WA (onsite) Duration: 12+ month contract Compensation: $60 - 65.0...  ...MIL-HDBK-217 Python, MATLAB, LabVIEW, or other test automation/data analysis scripting experience Experience supporting... 
    Contract work
    Local area
    Flexible hours

    INSPYR Solutions

    Redmond, WA
    4 days ago
  • $32 - $33 per hour

     ...Lead Building Operations Engineer - Building Automation/Integrator Alternate/Related Job Titles: Building Automation Systems (BAS) Integrator...  ...building programs, working alongside two integrators on-site. The primary responsibility is working with Building... 
    Contract work
    Work visa
    Monday to Friday
    Shift work

    Global Technical Talent

    Redmond, WA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Site Reliability Engineer — HPC & Automation (Silicon Engineering). Be the first to apply!