Sr. SRE Platform Architect
Bitdeer Technologies Group
About Bitdeer Technologies Group
Bitdeer is a world-leading technology company for AI and Bitcoin mining infrastructure.
Bitdeer is committed to providing comprehensive Bitcoin mining solutions for its customers and building AI computational infrastructure to support the AI revolution. Bitdeer handles complex processes involved in computing such as equipment procurement, transport logistics, data center design and construction, equipment management, and daily operations. Bitdeer also offers advanced cloud capabilities to customers with high demand for artificial intelligence.
Headquartered in Singapore, Bitdeer has deployed data centers across multiple countries, including the United States, Norway, Bhutan, and Ethiopia.
Position Overview
Bitdeer is seeking a visionary and hands-on Cloud SRE Architect to lead the design, development, and evolution of our next-generation public cloud platform. This role will oversee the end-to-end architecture across CPU, GPU, RDS, storage, networking, serverless, and AI services, ensuring global scalability, reliability, and performance. The ideal candidate is a strategic thinker with deep technical expertise in cloud infrastructure, platform engineering and AI systems, capable of bridging architecture vision with real-world engineering execution. You will collaborate closely with cross-functional teams and global partners to define our cloud technology roadmap, optimize multi-region deployments, and deliver world-class infrastructure and platform solutions that power large-scale AI and enterprise workloads.
Key Responsibilities
Own the end-to-end architecture of the NeoCloud SRE platform — the substrate that observes, protects, and operates a multi-region GPU rental fleet across self-built and OEM-rented data centers. You are the single point of architectural accountability across the platform's ~57 bounded contexts, ~12 frameworks, and three operational tiers (Edge DC → Regional Controller → Global Hub).
This role is for someone who writes the design, defends it under review, and shepherds it through the engineering squads that build it.
What You'll Do
- Write and maintain the platform architecture document — keep the design coherent across all sections, frameworks, and tiers. The current document is your starting point.
- Review every framework-level change — new bounded context, new plugin kind, tier-deployment shift, schema change, naming change, cross-context contract change. Architecture changes ride GitOps PRs like any other artifact.
- Set design invariants — residency rules (raw data stays in Region), Tier 2 self-sufficiency budget (≥ 24 h), survival-uplink contracts, naming conventions, SLO catalogues, redaction-at-boundary rules.
- Run the plugin framework — every extension uses one uniform contract (Common + Domain manifest, lifecycle, observability). You author and evolve this contract.
- Decide tier placement — what runs at Edge DC vs Regional Controller vs Global Hub, with data-residency / compliance / availability tradeoffs explicit.
- Coordinate with cloud-service teams and tenants — they author plugins, SDKs, dashboards, agent recipes that ride the platform. You set the contracts they consume.
- Coordinate with Security — joint ownership of vulnerability management, exposure management, joint operations. Security owns policy and risk acceptance; you own the operational mechanisms they ride.
- Pre-flight roadmap items — for any new capability, produce a one-page design that fits the existing layered model (L1–L6), tier topology, naming conventions, and extension contracts before implementation starts.
- Defend the design under review — say no to scope creep, special-case workarounds, and one-off integrations that don't fit the framework model. Say yes when a new plugin kind is genuinely needed.
Qualifications
- 10+years of production SRE / platform-engineering / infra-architecture, including ≥ 3 years at architect level.
- Hands-on with GPU / AI-compute infrastructure — NVIDIA GPU ops (DCGM, MIG, vGPU, NVLink/NVSwitch, XID semantics, NCCL), InfiniBand or RoCE fabrics (subnet manager, fabric partitioning, optical health), HPC storage (Lustre, NetApp/Pure/DDN/VAST, NVMe-oF).
- Multi-region observability at scale — metrics / logs / traces / profiles / analytics-lake substrate; recording rules, MWMBR burn-rate alerting, SLI/SLO discipline.
- Cluster platforms — first-hand experience with Kubernetes (control plane + GPU Operator + topology-aware scheduling) AND at least one of Slurm / Volcano / Kueue / Ray / KubeRay.
- Data-center operations — ZTP, BMC/IPMI/Redfish, BIOS/firmware lifecycle, RMA, multi-vendor OEM management (self-built + leased DC mix).
- Strong DDD instincts — bounded contexts, public contracts, no shared databases, one-context-one-repo discipline.
- Plugin framework design — you have built (or substantively contributed to) a real extension framework with a uniform manifest + lifecycle.
- Writing fluency — you can author and maintain a multi-thousand-line architecture document under review without it drifting; you can also write a one-pager an executive will read.
- Cross-team operating tempo — design reviews, runbook authorship, on-call shadowing, post-mortem facilitation
- Hyperscale or NeoCloud experience
- BS/MS in Computer Science or similar
--------------------------------------------------------------------
Bitdeer is committed to providing equal employment opportunities in accordance with country, state, and local laws. Bitdeer does not discriminate against employees or applicants based on conditions such as race, color, gender identity and/or expression, sexual orientation, marital and/or parental status, religion, political opinion, nationality, ethnic background or social origin, social status, disability, age, indigenous status, and union.
- ...Job Title Chief Architect / Sr. Architect – Distributed Cloud Job Overview F5 Distributed Cloud... ...execution across our entire distributed SaaS platform. This senior leadership role is... ...among product, security, infrastructure, SRE, and development teams to ensure coherent...PlatformSeniorLocal areaRemote work
$268.8k - $403.2k
...individual can thrive. Job Title: Chief Architect / Sr. Architect - Distributed Cloud... ...execution across our entire distributed SaaS platform. This is a senior leadership role responsible... ...product, security, infrastructure, SRE, and development teams to ensure coherent...PlatformSeniorLocal areaRemote workHome office$185.9k - $278.9k
...Engineering Overview Qualcomm is looking for an experienced SoC architect to work on the next generation AI products in the datacenter.... ...Support multiple clients using VMs on the same HW platform Leverage industry best practices for security and isolation between...PlatformSeniorWork experience placementWork from home- Senior DevOps & SRE Manager - Platform Reliability & Global Operations A senior technical leader responsible for reliability, scalability, security, and operational excellence of a complex, multi‑platform ecosystem spanning applications, workflows, event streaming, and...PlatformSeniorWork at office3 days per week
$100k
...must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of... ...systems, and software. We are seeking a CPU Performance Modeling Architect to help shape the next generation of high-performance RISC-V CPUs...PlatformSeniorPermanent employmentFull time$284.9k - $427.3k
...applications. We are looking for a highly experienced Server Product Architect to define the architecture of a Server SoC that meets critical... .... Key Responsibilities Collaborate with chip and platform architects to define and develop product and SoC architecture...PlatformSeniorWork from home$190.61k - $361.48k
...CPU Performance Architect The Role and Impact: As a CPU Performance Architect, you will play a pivotal role in shaping the future... ...Santa Clara, US, Oregon, Hillsboro Business group: Silicon and Platform Engineering Group (SPE): Deliver breakthrough silicon and platform...PlatformSeniorLocal areaImmediate startShift work- ...NetSuite, SAP, Oracle, Workday, ServiceNow, Coupa, and similar platforms. ~ Experience with REST, SOAP, JSON, XML, OAuth, SFTP, EDI,... ...engagements. Preferred Qualifications Boomi Professional Architect and/or Workato certifications. Experience with AI-driven...PlatformSenior
- 42dot is seeking a talented Sr. Staff Firmware Engineer in Sunnyvale, CA to design and implement critical secure boot systems for next-generation software-defined vehicles. The role requires expertise in embedded C programming and hardware security configurations. Applicants...PlatformSenior
- A leading technology company is looking for a Java SRE Engineer to support large-scale cloud migrations and production systems on AWS... ...and Kubernetes. You will lead migrations, design robust AWS EKS platforms, and implement deployment strategies. The ideal candidate has...PlatformSenior
$356.5k
...NVIDIA Gruppe is seeking a Senior Software Architect in Santa Clara, California. This role involves co-designing next-generation data center platforms and developing scalable communications software to enhance Deep Learning and HPC applications. Candidates should have...PlatformSenior$189.24k - $266.76k
42dot Inc. in Sunnyvale, United States is seeking a Sr. Staff Firmware Engineer to architect a next-generation OTA update framework for the Software-Defined Vehicle platform. This role emphasizes on-device software design for secure, high-availability firmware solutions...PlatformSenior$212k - $386.3k
Apple Inc. is seeking a senior professional in Machine Learning and AI to enhance user experiences through LLM-based question answering and generative AI features. The ideal candidate will have over 10 years of R&D experience in search and NLP, alongside a relevant MS or...PlatformSenior- Palo Alto Networks, Inc. is seeking a Principal Software Engineer to architect and enhance the observability platform across various systems and workflows. This role focuses on deep technical leadership, leveraging AI-embedded solutions and open-source technologies for...PlatformSenior
$232k - $368k
NVIDIA Gruppe is looking for a Senior Power and Performance Architect in Santa Clara, California. The role involves designing innovative... ...solutions and optimizing power systems across varied platforms and products. Ideal candidates will have over 15 years of experience...PlatformSenior$189.24k - $266.76k
42dot Inc. is looking for a Sr. Staff Firmware Engineer in Sunnyvale, California. In this position, you will design and implement... ...ensuring high performance and reliability. Responsibilities include architecting the secure boot-chain and managing hardware isolation...PlatformSenior- ...is looking for a Distinguished Technologist Mechanical Engineer to lead the development of complex networking systems and chassis platforms. The role requires over 15 years of experience in product development and a BS in Mechanical Engineering. The ideal candidate will...PlatformSenior
- 42dot is seeking a Sr. Staff Firmware Engineer to develop next-generation firmware platforms for Hyundai's Software-Defined Vehicles. You will be involved in designing and implementing key components, ensuring high performance and safety. The ideal candidate has over 8...PlatformSenior
$176k - $333.5k
NVIDIA Corporation in Santa Clara is seeking a Site Reliability Engineer (SRE) to design and maintain large-scale production systems focusing on reliability and observability. Candidates should have a BS in Computer Science or related field and 8+ years' experience in infrastructure...PlatformSenior$211.8k - $317.8k
Qualcomm in Santa Clara, California is seeking a Software Engineer to design and develop embedded and cloud edge software. You will work on implementing firmware for Qualcomm’s upcoming products and collaborate with hardware and firmware teams. The role requires a Bachelor...PlatformSenior$168k - $322k
NVIDIA Corporation is looking for a Senior AI Platform Engineer in Santa Clara to build and maintain next-generation AI-powered products. The role focuses on defining AI-native infrastructure, scaling LLM/ML systems, and ensuring reliability across platforms. The ideal...PlatformSenior- Nari is seeking an Electrical Systems Architect to lead the platform-level electronics architecture at their headquarters in Santa Clara, CA. This role involves defining the architecture for Agilent’s LC/MS products, balancing performance and reliability while collaborating...PlatformSenior
$224k - $356.5k
NVIDIA Gruppe in Santa Clara is looking for a strong technical leader for its DriveOS software architecture group. The ideal candidate will have in-depth knowledge of complex systems, solid experience in Embedded Systems, and a Master's degree, with 12+ years of relevant...PlatformSeniorWork experience placement$320k
NVIDIA Gruppe in Santa Clara is seeking a Distinguished Software Architect to lead the design of next-generation data center platforms. This role demands deep expertise in HPC and networking, aiming to improve GPU communication technologies. You will research and implement...PlatformSenior- ...Ventures is looking for a Senior Staff SI/PI Engineer responsible for ensuring the electrical integrity of high-performance AI compute platforms. You will own the SI/PI strategy for next-generation AI accelerators and lead complex multi-chip package modeling. The ideal...PlatformSenior
- Intel Corporation is seeking a Senior SoC Chiplet Architect in Santa Clara, CA. This role will define and lead the architecture strategy... ..., and driving technical alignment across architecture and platform teams. A Bachelor's in Electrical Engineering and extensive SoC...PlatformSenior
- NVIDIA Gruppe in Santa Clara is seeking a Senior Systems Engineer to lead advancements in high-speed sensor streaming on the Holoscan platform. This role involves collaboration with top SDK developers and will require solid experience in C/C++/Python. The ideal candidate...PlatformSenior
- Digital Technologies, LLC is looking for experts to design, integrate, and deploy IAM products, particularly involving the Saviynt platform. Ideal candidates will have over 10 years of experience in enterprise software development, with a strong focus on identity and...PlatformSenior
- ...seeking a Senior Hardware Engineer to develop solutions for GPU products. You will collaborate in launching new GPU Accelerated Server Platforms optimized for AI and analytics. Your responsibilities include developing diagnostic tests, defining manufacturing screens, and...PlatformSenior
$254.34k - $310.86k
SiFive, Inc. in Santa Clara is seeking an experienced SoC architect to lead the development of high-performance system IPs, including... ...and memory controllers. The successful candidate will define platform security requirements, collaborate with cross-functional teams...PlatformSenior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr. SRE Platform Architect. Be the first to apply!
- senior cloud service delivery manager San Jose, CA
- senior business analyst contract San Jose, CA
- senior product design engineer San Jose, CA
- senior game producer San Jose, CA
- senior software manager San Jose, CA
- senior manager business analytics San Jose, CA
- senior marketing account manager San Jose, CA
- senior marketing manager San Jose, CA
- senior contracts analyst San Jose, CA
- sr operations manager San Jose, CA

