Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr. SRE Platform Architect

Full-time

Bitdeer Technologies Group

About Bitdeer Technologies Group

Bitdeer is a world-leading technology company for AI and Bitcoin mining infrastructure.

Bitdeer is committed to providing comprehensive Bitcoin mining solutions for its customers and building AI computational infrastructure to support the AI revolution. Bitdeer handles complex processes involved in computing such as equipment procurement, transport logistics, data center design and construction, equipment management, and daily operations. Bitdeer also offers advanced cloud capabilities to customers with high demand for artificial intelligence.

Headquartered in Singapore, Bitdeer has deployed data centers across multiple countries, including the United States, Norway, Bhutan, and Ethiopia.

Position Overview

Bitdeer is seeking a visionary and hands-on Cloud SRE Architect to lead the design, development, and evolution of our next-generation public cloud platform. This role will oversee the end-to-end architecture across CPU, GPU, RDS, storage, networking, serverless, and AI services, ensuring global scalability, reliability, and performance. The ideal candidate is a strategic thinker with deep technical expertise in cloud infrastructure, platform engineering and AI systems, capable of bridging architecture vision with real-world engineering execution. You will collaborate closely with cross-functional teams and global partners to define our cloud technology roadmap, optimize multi-region deployments, and deliver world-class infrastructure and platform solutions that power large-scale AI and enterprise workloads.

Key Responsibilities

Own the end-to-end architecture of the NeoCloud SRE platform — the substrate that observes, protects, and operates a multi-region GPU rental fleet across self-built and OEM-rented data centers. You are the single point of architectural accountability across the platform's ~57 bounded contexts, ~12 frameworks, and three operational tiers (Edge DC → Regional Controller → Global Hub).

This role is for someone who writes the design, defends it under review, and shepherds it through the engineering squads that build it.

What You'll Do

  1. Write  and maintain the platform architecture document — keep the design coherent across all sections, frameworks, and tiers. The current document is your starting point.
  2. Review every framework-level change — new bounded context, new plugin kind, tier-deployment shift, schema change, naming change, cross-context contract change. Architecture changes ride GitOps PRs like any other artifact.
  3. Set design invariants — residency rules (raw data stays in Region), Tier 2 self-sufficiency budget (≥ 24 h), survival-uplink contracts, naming conventions, SLO catalogues, redaction-at-boundary rules.
  4. Run the plugin framework — every extension uses one uniform contract (Common + Domain manifest, lifecycle, observability). You author and evolve this contract.
  5. Decide tier placement — what runs at Edge DC vs Regional Controller vs Global Hub, with data-residency / compliance / availability tradeoffs explicit.
  6. Coordinate  with cloud-service teams and tenants — they author plugins, SDKs, dashboards, agent recipes that ride the platform. You set the contracts they consume.
  7. Coordinate with Security — joint ownership of vulnerability management, exposure management, joint operations. Security owns policy and risk acceptance; you own the operational mechanisms they ride.
  8. Pre-flight   roadmap items — for any new capability, produce a one-page design that fits the existing layered model (L1–L6), tier topology, naming conventions, and extension contracts before implementation starts.
  9. Defend   the design under review — say no to scope creep, special-case workarounds, and one-off integrations that don't fit the framework model. Say yes when a new plugin kind is genuinely needed.

Qualifications

  • 10+years of production SRE / platform-engineering / infra-architecture, including ≥ 3 years at architect level.
  •   Hands-on  with GPU / AI-compute infrastructure — NVIDIA GPU ops (DCGM, MIG, vGPU, NVLink/NVSwitch, XID semantics, NCCL), InfiniBand or RoCE fabrics (subnet manager, fabric partitioning, optical health), HPC storage (Lustre, NetApp/Pure/DDN/VAST, NVMe-oF).
  • Multi-region  observability at scale — metrics / logs / traces / profiles / analytics-lake substrate; recording rules, MWMBR burn-rate alerting, SLI/SLO discipline.
  • Cluster platforms  — first-hand experience with Kubernetes (control plane + GPU Operator + topology-aware scheduling) AND at least one of Slurm / Volcano / Kueue / Ray / KubeRay.
  • Data-center operations  — ZTP, BMC/IPMI/Redfish, BIOS/firmware lifecycle, RMA, multi-vendor OEM management (self-built + leased DC mix).
  • Strong DDD instincts — bounded contexts, public contracts, no shared databases, one-context-one-repo discipline.
  • Plugin framework design — you have built (or substantively contributed to) a real extension framework with a uniform manifest + lifecycle.
  • Writing fluency — you can author and maintain a multi-thousand-line architecture document under review without it drifting; you can also write a one-pager an executive will read.
  • Cross-team operating tempo  — design reviews, runbook authorship, on-call shadowing, post-mortem facilitation
  • Hyperscale or NeoCloud experience
  • BS/MS in Computer Science or similar

--------------------------------------------------------------------

Bitdeer is committed to providing equal employment opportunities in accordance with country, state, and local laws. Bitdeer does not discriminate against employees or applicants based on conditions such as race, color, gender identity and/or expression, sexual orientation, marital and/or parental status, religion, political opinion, nationality, ethnic background or social origin, social status, disability, age, indigenous status, and union.

Vacancy posted 7 hours ago
Similar jobs that could be interesting for youBased on the Sr. SRE Platform Architect in San Jose, CA vacancy
  •  ...Job Title Chief Architect / Sr. Architect – Distributed Cloud Job Overview F5 Distributed Cloud...  ...execution across our entire distributed SaaS platform. This senior leadership role is...  ...among product, security, infrastructure, SRE, and development teams to ensure coherent... 
    Platform
    Senior
    Local area
    Remote work

    F5 Networks

    San Jose, CA
    3 days ago
  • $268.8k - $403.2k

     ...individual can thrive. Job Title: Chief Architect / Sr. Architect - Distributed Cloud...  ...execution across our entire distributed SaaS platform. This is a senior leadership role responsible...  ...product, security, infrastructure, SRE, and development teams to ensure coherent... 
    Platform
    Senior
    Local area
    Remote work
    Home office

    F5

    San Jose, CA
    3 days ago
  • $185.9k - $278.9k

     ...Engineering Overview Qualcomm is looking for an experienced SoC architect to work on the next generation AI products in the datacenter....  ...Support multiple clients using VMs on the same HW platform Leverage industry best practices for security and isolation between... 
    Platform
    Senior
    Work experience placement
    Work from home

    Qualcomm

    Santa Clara, CA
    5 days ago
  • Senior DevOps & SRE Manager - Platform Reliability & Global Operations A senior technical leader responsible for reliability, scalability, security, and operational excellence of a complex, multi‑platform ecosystem spanning applications, workflows, event streaming, and... 
    Platform
    Senior
    Work at office
    3 days per week

    Qcells North America

    Santa Clara, CA
    5 days ago
  • $100k

     ...must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of...  ...systems, and software. We are seeking a CPU Performance Modeling Architect to help shape the next generation of high-performance RISC-V CPUs... 
    Platform
    Senior
    Permanent employment
    Full time

    Tenstorrent

    Santa Clara, CA
    4 days ago
  • $284.9k - $427.3k

     ...applications. We are looking for a highly experienced Server Product Architect to define the architecture of a Server SoC that meets critical...  .... Key Responsibilities Collaborate with chip and platform architects to define and develop product and SoC architecture... 
    Platform
    Senior
    Work from home

    Jobleads-US

    Santa Clara, CA
    6 days ago
  • $190.61k - $361.48k

     ...CPU Performance Architect The Role and Impact: As a CPU Performance Architect, you will play a pivotal role in shaping the future...  ...Santa Clara, US, Oregon, Hillsboro Business group: Silicon and Platform Engineering Group (SPE): Deliver breakthrough silicon and platform... 
    Platform
    Senior
    Local area
    Immediate start
    Shift work

    Chandler Chamber of Commerce

    Santa Clara, CA
    5 days ago
  •  ...NetSuite, SAP, Oracle, Workday, ServiceNow, Coupa, and similar platforms. ~ Experience with REST, SOAP, JSON, XML, OAuth, SFTP, EDI,...  ...engagements. Preferred Qualifications Boomi Professional Architect and/or Workato certifications. Experience with AI-driven... 
    Platform
    Senior

    Jade Global

    San Jose, CA
    1 day ago
  • 42dot is seeking a talented Sr. Staff Firmware Engineer in Sunnyvale, CA to design and implement critical secure boot systems for next-generation software-defined vehicles. The role requires expertise in embedded C programming and hardware security configurations. Applicants... 
    Platform
    Senior

    42dot

    Sunnyvale, CA
    2 days ago
  • A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS...  ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has... 
    Platform
    Senior

    EITACIES Inc.

    Santa Clara, CA
    3 days ago
  • $356.5k

     ...NVIDIA Gruppe is seeking a Senior Software Architect in Santa Clara, California. This role involves co-designing next-generation data center platforms and developing scalable communications software to enhance Deep Learning and HPC applications. Candidates should have... 
    Platform
    Senior

    Jobleads-US

    Santa Clara, CA
    6 days ago
  • $189.24k - $266.76k

    42dot Inc. in Sunnyvale, United States is seeking a Sr. Staff Firmware Engineer to architect a next-generation OTA update framework for the Software-Defined Vehicle platform. This role emphasizes on-device software design for secure, high-availability firmware solutions... 
    Platform
    Senior

    42dot Inc.

    Sunnyvale, CA
    2 days ago
  • $212k - $386.3k

    Apple Inc. is seeking a senior professional in Machine Learning and AI to enhance user experiences through LLM-based question answering and generative AI features. The ideal candidate will have over 10 years of R&D experience in search and NLP, alongside a relevant MS or...
    Platform
    Senior

    Apple Inc.

    Santa Clara, CA
    6 days ago
  • Palo Alto Networks, Inc. is seeking a Principal Software Engineer to architect and enhance the observability platform across various systems and workflows. This role focuses on deep technical leadership, leveraging AI-embedded solutions and open-source technologies for... 
    Platform
    Senior

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $232k - $368k

    NVIDIA Gruppe is looking for a Senior Power and Performance Architect in Santa Clara, California. The role involves designing innovative...  ...solutions and optimizing power systems across varied platforms and products. Ideal candidates will have over 15 years of experience... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    6 days ago
  • $189.24k - $266.76k

    42dot Inc. is looking for a Sr. Staff Firmware Engineer in Sunnyvale, California. In this position, you will design and implement...  ...ensuring high performance and reliability. Responsibilities include architecting the secure boot-chain and managing hardware isolation... 
    Platform
    Senior

    42dot Inc.

    Sunnyvale, CA
    3 days ago
  •  ...is looking for a Distinguished Technologist Mechanical Engineer to lead the development of complex networking systems and chassis platforms. The role requires over 15 years of experience in product development and a BS in Mechanical Engineering. The ideal candidate will... 
    Platform
    Senior

    Hewlett Packard Enterprise Development LP

    Sunnyvale, CA
    4 days ago
  • 42dot is seeking a Sr. Staff Firmware Engineer to develop next-generation firmware platforms for Hyundai's Software-Defined Vehicles. You will be involved in designing and implementing key components, ensuring high performance and safety. The ideal candidate has over 8... 
    Platform
    Senior

    42dot

    Sunnyvale, CA
    3 days ago
  • $176k - $333.5k

    NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in infrastructure... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $211.8k - $317.8k

    Qualcomm in Santa Clara, California is seeking a Software Engineer to design and develop embedded and cloud edge software. You will work on implementing firmware for Qualcomm’s upcoming products and collaborate with hardware and firmware teams. The role requires a Bachelor...
    Platform
    Senior

    Qualcomm

    Santa Clara, CA
    6 days ago
  • $168k - $322k

    NVIDIA Corporation is looking for a Senior AI Platform Engineer in Santa Clara to build and maintain next-generation AI-powered products. The role focuses on defining AI-native infrastructure, scaling LLM/ML systems, and ensuring reliability across platforms. The ideal... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • Nari is seeking an Electrical Systems Architect to lead the platform-level electronics architecture at their headquarters in Santa Clara, CA. This role involves defining the architecture for Agilent’s LC/MS products, balancing performance and reliability while collaborating... 
    Platform
    Senior

    Nari

    Santa Clara, CA
    2 days ago
  • $224k - $356.5k

    NVIDIA Gruppe in Santa Clara is looking for a strong technical leader for its DriveOS software architecture group. The ideal candidate will have in-depth knowledge of complex systems, solid experience in Embedded Systems, and a Master's degree, with 12+ years of relevant...
    Platform
    Senior
    Work experience placement

    Jobleads-US

    Santa Clara, CA
    2 days ago
  • $320k

    NVIDIA Gruppe in Santa Clara is seeking a Distinguished Software Architect to lead the design of next-generation data center platforms. This role demands deep expertise in HPC and networking, aiming to improve GPU communication technologies. You will research and implement... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    6 days ago
  •  ...Ventures is looking for a Senior Staff SI/PI Engineer responsible for ensuring the electrical integrity of high-performance AI compute platforms. You will own the SI/PI strategy for next-generation AI accelerators and lead complex multi-chip package modeling. The ideal... 
    Platform
    Senior

    Entrada Ventures

    Santa Clara, CA
    3 days ago
  • Intel Corporation is seeking a Senior SoC Chiplet Architect in Santa Clara, CA. This role will define and lead the architecture strategy...  ..., and driving technical alignment across architecture and platform teams. A Bachelor's in Electrical Engineering and extensive SoC... 
    Platform
    Senior

    Intel Corporation

    Santa Clara, CA
    3 days ago
  • NVIDIA Gruppe in Santa Clara is seeking a Senior Systems Engineer to lead advancements in high-speed sensor streaming on the Holoscan platform. This role involves collaboration with top SDK developers and will require solid experience in C/C++/Python. The ideal candidate... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    6 days ago
  • Digital Technologies, LLC is looking for experts to design, integrate, and deploy IAM products, particularly involving the Saviynt platform. Ideal candidates will have over 10 years of experience in enterprise software development, with a strong focus on identity and... 
    Platform
    Senior

    Digital Technologies Inc

    Santa Clara, CA
    5 days ago
  •  ...seeking a Senior Hardware Engineer to develop solutions for GPU products. You will collaborate in launching new GPU Accelerated Server Platforms optimized for AI and analytics. Your responsibilities include developing diagnostic tests, defining manufacturing screens, and... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $254.34k - $310.86k

    SiFive, Inc. in Santa Clara is seeking an experienced SoC architect to lead the development of high-performance system IPs, including...  ...and memory controllers. The successful candidate will define platform security requirements, collaborate with cross-functional teams... 
    Platform
    Senior

    SiFive, Inc.

    Santa Clara, CA
    5 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr. SRE Platform Architect. Be the first to apply!