Rack Scale Serviceability & Telemetry Architect
Advanced Micro Devices , Inc.
WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover that the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. Rack Scale Serviceability & Telemetry Architect THE TEAM AMD’s Data Center GPU Systems Architecture team defines next‑generation AMD Instinct platforms and complete rack‑scale solutions for hyperscale AI and HPC deployments. We work across silicon, GPU system firmware, server and board architecture, BMC/platform firmware, management software, security, validation, manufacturing, and ecosystem partners to turn product strategy into deployable, serviceable, production‑ready platforms. THE ROLE AMD is seeking a Principal Member of Technical Staff (PMTS) to own the architecture for rack‑scale serviceability and telemetry across AMD Instinct product lines and complete rack‑scale solutions. This is a highly visible technical leadership role responsible for defining the end‑to‑end manageability, observability, and serviceability architecture spanning node, chassis/tray, rack, and fleet domains. You will drive the strategy, architecture, execution, and delivery of standards‑based solutions for inventory, discovery, health monitoring, telemetry, eventing, diagnostics, firmware lifecycle management, and field service workflows across the full AMD rack‑scale stack. In this role, you will independently own a critical cross‑product architecture area and drive alignment across GPU/SoC architecture, server/platform architecture, BIOS/UEFI, BMC and embedded software, security, RAS, validation, ODM/OEM partners, and customer‑facing teams. The role spans early concept definition through bring‑up, validation, deployment, and post‑launch improvement. THE PERSON The ideal candidate is a deeply technical system architect with strong first‑principles thinking and a track record of delivering manageability, telemetry, and serviceability solutions for servers, accelerators, storage, networking, or rack‑scale AI/HPC platforms. You are equally comfortable setting long‑range technical direction and diving hands‑on into protocol definitions, interface design, telemetry models, bring‑up, debug, and root‑cause analysis. You thrive in ambiguity, influence without authority, raise execution quality across teams, and exemplify AMD’s values through direct, humble, collaborative, and inclusive leadership. KEY RESPONSIBILITIES Define and own the end‑to‑end rack‑scale serviceability and telemetry architecture for AMD Instinct‑based solutions, spanning node BMC, chassis/rack management, service processors/controllers, management network, and fleet‑level observability integration. Define the standards strategy and interface architecture using DMTF Redfish, PLDM, MCTP, and related specifications, maximizing standards compliance while establishing AMD/OEM extensions only where required. Drive OpenBMC‑based architecture and implementation direction for BMC and rack management controllers, including D‑Bus object models, bmcweb/Redfish requirements, sensor and FRU inventory models, logging, eventing, firmware update, and debug workflows. Architect telemetry frameworks for health, power, thermal, inventory, error, utilization, and service data. Define schemas, metric taxonomies, triggers, event models, aggregation, retention, and reporting strategies required for at‑scale observability and automated service operations. Define platform serviceability flows covering discovery, inventory correlation, fault isolation, diagnostics, crash‑dump and error capture, remote recovery, FRU replacement, firmware/driver update orchestration, and return‑to‑service procedures. Partner with GPU/SoC architects, board and system architects, firmware and software teams, security/RAS, validation, manufacturing, and customer engineering to translate requirements into production‑ready architecture and deliverables. Work closely with ODM/OEMs and ecosystem partners to review designs, close gaps, guide implementation trade‑offs, and deliver robust reference solutions and customer platforms on schedule. Drive validation and conformance strategy for manageability and telemetry, including interoperability, Redfish/PLDM compliance, fault injection, service workflow validation, scale testing, and field debug methodology. Influence future AMD Instinct platform roadmaps using insights from bring‑up, partner integrations, deployment learnings, and telemetry‑driven data. Represent AMD in relevant standards and open‑source communities, including DMTF and OpenBMC forums, and guide upstream/downstream strategy where appropriate. Mentor engineers and architects across the organization and serve as the senior technical point of contact for rack‑scale serviceability and telemetry. PREFERRED EXPERIENCE Expert level experiences in platform architecture, system management, BMC/embedded firmware, server manageability, or adjacent domains, including significant time in architect or technical leadership roles. Proven experience defining serviceability/manageability architecture for servers, accelerators, storage, networking, or rack‑scale infrastructure in datacenter, cloud, AI, or HPC environments. Deep knowledge of DMTF Redfish, including schema design, OEM extension strategy, eventing, update service, and telemetry concepts such as MetricReportDefinition/Metric Reports; strong understanding of PLDM/MCTP for platform inventory, monitoring, control, and update workflows. Strong hands‑on experience with OpenBMC, including Yocto/OpenEmbedded, D‑Bus, systemd, bmcweb/Redfish, phosphor services, firmware update flows, sensor frameworks, and log/event handling. Experience with embedded Linux, ARM‑based BMC SoCs, U‑Boot, Linux kernel/device driver concepts, device tree, and low‑level interfaces such as I2C/I3C, SPI, UART, GPIO, SMBus/PMBus, and related platform‑management buses. Strong understanding of server/platform RAS and serviceability features such as health monitoring, error logging, crash‑dump, diagnostics, inventory/FRU management, and remote recovery. Experience with secure manageability architectures, including secure boot, root of trust, attestation, firmware signing, SPDM, and protection of out‑of‑band management paths. Experience creating architecture specifications, product requirements, conformance plans, validation strategies, and design reviews that drive execution across multiple internal teams and external partners. Strong programming and scripting background in C/C++, Python, and shell, with the ability to debug across firmware, hardware, and system software boundaries. Experience with large‑scale telemetry or observability pipelines, metrics consumers, or fleet operations tooling is strongly preferred. Experience with AMD server or GPU platforms, AI/HPC system design, liquid cooling/power/thermal infrastructure, or OCP‑aligned rack architectures is a plus. Strong written and verbal communication skills with proven ability to influence senior engineering leadership, customers, and strategic partners. ACADEMIC CREDENTIALS Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field. Advanced degree preferred. LOCATION Austin, Texas preferred. Other AMD datacenter engineering locations may be considered based on team alignment and business needs. This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal‑opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess, or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy. #J-18808-Ljbffr
- ...Advanced Micro Devices is seeking a Principal Member of Technical Staff (PMTS) to define and own the architecture for rack-scale serviceability and telemetry. The role requires expertise in AMD Instinct solutions, focusing on end-to-end manageability and observability...Suggested
$123.1k - $273k
...specialist product advisors and technical architects. This team not only resolves technical... ...As a Success Architect specializing in Service Cloud, you will: Drive Customer Impact... ...practices. ~ Experience with large-scale, complex implementations, including SaaS...SuggestedImmediate startWorldwide- ...Senior Architect, Enterprise Architecture – Enterprise Servicing Career Architecture Title: IT Principal Architect, Enterprise Architecture Position Summary... ...workforce forecasting, staffing, and coaching Scaled servicing capabilities with improved agility and reduced...SuggestedLocal areaWork from home
- ...Delivery - Sr. Staff Engineer to own the architecture of the Lead Services platform in Austin, Texas. The role demands over 10 years of... ...software engineering experience with a strong background in large-scale systems and technical leadership. Key responsibilities include...Suggested
$134.05k - $221.21k
...The Americas Telco Services Delivery Managing Architect is instrumental in guiding technical direction and driving customer success for our most strategic customers. Through the knowledge of the Telco industry and Red Hat technologies, this experienced technical manager...SuggestedPermanent employmentFull timeContract workWork experience placementWork at officeRemote workFlexible hours$152.8k - $191k
...Description POSITION SUMMARY We are seeking a visionary Lead AI Architect to lead the design and implementation of next-generation AI... ...Developers to embed AI outputs directly into the flow of work (Service Console, Sales Cloud LWC, etc.). Act as the subject matter...Work at officeImmediate startWorldwide- ...designers, software engineers and systems architects, Graphcore enjoys a culture of... ...responsible for architecting a cohesive, AI rack scale platform optimized for trillion-parameter... ...plan, commuter benefits, wellness services and an Employee Assistance Programme (EAP...Flexible hours
$150.1k - $227k
...we are delivering a successful experience to every customer at scale. Cloud Success Readiness improves customer Adoption and Product... ...to deliver product innovation to our customers. As a Readiness Architect, you will be responsible for understanding the Industries Cloud...Shift work$126k - $229.8k
...for an experienced SoC architect to work on the next... ...server level to a full rack. Responsibilities:... ...Design/architect for easy serviceability Implement HW/SW... ...RAS, virtualization, telemetry, power, cooling, lifecycle... ...~ Good knowledge of scale-up and scale-out architectures...Work experience placement- ...States is seeking a Strategic Sourcing Leader to drive efficient sourcing strategies and manage supplier relationships in Professional Services. The ideal candidate will have over 7 years of experience in strategic sourcing and the ability to influence senior stakeholders....
$160k - $200k
...leading investment management firm is seeking a Principal Power Generation Architect Engineer. This role involves owning the reference architecture for on-site power generation solutions across mega-scale data centers. The ideal candidate should have a Bachelor’s degree in...Flexible hours- ...Growth to own the revenue engine across acquisition, analytics, and creative strategy. This hands-on role requires proven experience scaling a DTC business from $150M to over $250M, managing substantial paid media budgets, and leading high-performing teams. The ideal...Remote job
$153k - $187k
...innovative, and forward-thinking Sr. GTM AI Architect to lead the design and deployment of AI-... ...go deeper — providing dedicated focus to scale those efforts and drive adoption of the... ..., we offer a bonus after 7 years of service. Wellness Subsidy – We provide a subsidy...Currently hiringRemote workFlexible hours$96k - $175k
Principal IT Solution Delivery - Service Now CSM Delivery Architect page is loaded## Principal IT Solution Delivery - Service Now CSM Delivery Architectremote type: Hybridlocations: United States of America, Washington, Liberty Lake: United States of America, Texas, Austintime...ApprenticeshipInternshipWorldwideFlexible hours- ...that connect Sales, Marketing, Finance, and Operations need to scale with it. The Director of Revenue Operations owns that infrastructure... ...and document what you build so others can rely on it Financial services or compliance industry experience a plus Actively leverages AI...Full timeContract workWork at office
- ...teams across diverse healthcare types and scales. Technical understanding of health‑... ...orientation with understanding of consulting‑service challenges. Excellent oral and written... ...or equivalent. Licensed or credentialed architect preferred. 15+ years of experience in the...Full timeContract workTemporary workPart timeCasual workLocal areaFlexible hours
- ...enterprises to control risk, manage costs and scale efficiently for a data and AI led world.... ...government organizations, financial services, media and information technology... ...). To achieve this mission, you will architect and build data platform solutions that leverage...Local areaRemote work
$139.4k - $230k
...best-in-class solutions. As a Senior Architect, you will partner with Technology and Business... ..., agentic frameworks, and cloud AI services — and contribute insights that help... ...technologies. Successfully architected large scale technology initiatives. Leadership &...Work experience placementLocal area- ...Principal Ai Platforms Architect Locations: Atlanta | Austin | Boston | Brooklyn | Chicago... ..., Data & Digital Platforms, AI at Scale, Agile, Cybersecurity and Digitizing the... ...testing, and monitoring of models and AI services (LLMOps). Communication and Collaboration...
$70k - $100k
...infrastructure investment worldwide, our services are in great demand. We invite you to... ...seeking a talented and motivated Project Architect to join our Buildings + Places business... ...in your local community and on a global scale - that are transforming our industry and...Work at officeLocal areaWorldwideFlexible hours$220.92k - $311.89k
...our customers to design leadership products, global manufacturing scale and supply chain, through the continuous yield improvements to... ...foundry customers' products receive our utmost focus in terms of service, technology enablement and capacity commitments. Employees in...Local areaImmediate startShift work- ...leading integrated design practice. Our architects, engineers, interior designers, consultants... ...a range of healthcare project types and scales with proven ability to balance a book of... ...development challenges of a consulting-services provider Excellent oral and written...Full timeContract workTemporary workPart timeCasual workWork at officeLocal areaFlexible hours
- ...Security, is looking for an experienced ServiceNow ITAM Architect to join our cross-functional Service Management team. This individual will be a key team... ...and impactful organization as SailPoint continues to scale globally as the industry leader in Identity Security....Full timeTemporary workRemote workFlexible hours
$75 - $85 per hour
...MDM Architect | Austin, Texas, United States Job Summary: MDM Architect (AI-Driven Data Management... ..., directly impacting Health and Human Services data quality and analytics. Drive... ...Informatica and ETL development for large-scale data systems. - 8+ years designing and...$156.64k
...currently seeking a Senior Cloud Platform Architect to lead the vision, design, and... ...Lead new product development, including services evaluation, POC (proof of concept) development... ..., influencing architecture decisions at scale. Lead cross functional initiatives spanning...Remote workShift work$160k
...CI/CD, and infrastructure as code for AI services Internal APIs, reusable services, admin... ...shared tooling Observability, evaluation, telemetry, security controls, and feedback systems... ...that keep AI systems reliable at scale Review, debug, and refine AI‑generated code...Shift work- ...IDR is seeking a MDM Architect to join one of our top clients for a remote opportunity... ...implement master data solutions for a large-scale Master Data Management (MDM) environment... ...governance across the Health and Human Services System. Position Overview for...Work at officeRemote work
$103.3k - $287.6k
AI/ML Architect, MediaOS Platform,IQVIA Digital We are seeking a visionary AI/ML Architect... ...production-grade ML systems at scale Strong programming expertise in Python (... ...leading global provider of clinical research services, commercial insights and healthcare intelligence...Full timePart timeWork at officeImmediate startRemote workWorldwideFlexible hours$164.47k - $269.1k
...performance networking silicon. Our team architects next-generation networking solutions... ...86 clusters), including core selection, scaling strategy, and configuration tradeoffs... ...plane assist, offload execution, management services) # Drive compute architecture decisions...Local areaImmediate startShift work$196.64k - $328.35k
...Principal Integration Architect - IT/OT Convergence Location: Houston, TX, US Ann Arbor... ...industrial and operational environments at scale. This is an opportunity to shape complex... ..., including MuleSoft, Azure Integration Services, Boomi, Apigee, IBM webMethods, Workato,...Full timePart timeWork experience placementWork at officeLocal areaRelocationVisa sponsorshipFlexible hours
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Rack Scale Serviceability & Telemetry Architect. Be the first to apply!


