Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Rack Scale Serviceability & Telemetry Architect

Advanced Micro Devices , Inc.

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover that the real differentiator is our culture. We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. Rack Scale Serviceability & Telemetry Architect THE TEAM AMD’s Data Center GPU Systems Architecture team defines next‑generation AMD Instinct platforms and complete rack‑scale solutions for hyperscale AI and HPC deployments. We work across silicon, GPU system firmware, server and board architecture, BMC/platform firmware, management software, security, validation, manufacturing, and ecosystem partners to turn product strategy into deployable, serviceable, production‑ready platforms. THE ROLE AMD is seeking a Principal Member of Technical Staff (PMTS) to own the architecture for rack‑scale serviceability and telemetry across AMD Instinct product lines and complete rack‑scale solutions. This is a highly visible technical leadership role responsible for defining the end‑to‑end manageability, observability, and serviceability architecture spanning node, chassis/tray, rack, and fleet domains. You will drive the strategy, architecture, execution, and delivery of standards‑based solutions for inventory, discovery, health monitoring, telemetry, eventing, diagnostics, firmware lifecycle management, and field service workflows across the full AMD rack‑scale stack. In this role, you will independently own a critical cross‑product architecture area and drive alignment across GPU/SoC architecture, server/platform architecture, BIOS/UEFI, BMC and embedded software, security, RAS, validation, ODM/OEM partners, and customer‑facing teams. The role spans early concept definition through bring‑up, validation, deployment, and post‑launch improvement. THE PERSON The ideal candidate is a deeply technical system architect with strong first‑principles thinking and a track record of delivering manageability, telemetry, and serviceability solutions for servers, accelerators, storage, networking, or rack‑scale AI/HPC platforms. You are equally comfortable setting long‑range technical direction and diving hands‑on into protocol definitions, interface design, telemetry models, bring‑up, debug, and root‑cause analysis. You thrive in ambiguity, influence without authority, raise execution quality across teams, and exemplify AMD’s values through direct, humble, collaborative, and inclusive leadership. KEY RESPONSIBILITIES Define and own the end‑to‑end rack‑scale serviceability and telemetry architecture for AMD Instinct‑based solutions, spanning node BMC, chassis/rack management, service processors/controllers, management network, and fleet‑level observability integration. Define the standards strategy and interface architecture using DMTF Redfish, PLDM, MCTP, and related specifications, maximizing standards compliance while establishing AMD/OEM extensions only where required. Drive OpenBMC‑based architecture and implementation direction for BMC and rack management controllers, including D‑Bus object models, bmcweb/Redfish requirements, sensor and FRU inventory models, logging, eventing, firmware update, and debug workflows. Architect telemetry frameworks for health, power, thermal, inventory, error, utilization, and service data. Define schemas, metric taxonomies, triggers, event models, aggregation, retention, and reporting strategies required for at‑scale observability and automated service operations. Define platform serviceability flows covering discovery, inventory correlation, fault isolation, diagnostics, crash‑dump and error capture, remote recovery, FRU replacement, firmware/driver update orchestration, and return‑to‑service procedures. Partner with GPU/SoC architects, board and system architects, firmware and software teams, security/RAS, validation, manufacturing, and customer engineering to translate requirements into production‑ready architecture and deliverables. Work closely with ODM/OEMs and ecosystem partners to review designs, close gaps, guide implementation trade‑offs, and deliver robust reference solutions and customer platforms on schedule. Drive validation and conformance strategy for manageability and telemetry, including interoperability, Redfish/PLDM compliance, fault injection, service workflow validation, scale testing, and field debug methodology. Influence future AMD Instinct platform roadmaps using insights from bring‑up, partner integrations, deployment learnings, and telemetry‑driven data. Represent AMD in relevant standards and open‑source communities, including DMTF and OpenBMC forums, and guide upstream/downstream strategy where appropriate. Mentor engineers and architects across the organization and serve as the senior technical point of contact for rack‑scale serviceability and telemetry. PREFERRED EXPERIENCE Expert level experiences in platform architecture, system management, BMC/embedded firmware, server manageability, or adjacent domains, including significant time in architect or technical leadership roles. Proven experience defining serviceability/manageability architecture for servers, accelerators, storage, networking, or rack‑scale infrastructure in datacenter, cloud, AI, or HPC environments. Deep knowledge of DMTF Redfish, including schema design, OEM extension strategy, eventing, update service, and telemetry concepts such as MetricReportDefinition/Metric Reports; strong understanding of PLDM/MCTP for platform inventory, monitoring, control, and update workflows. Strong hands‑on experience with OpenBMC, including Yocto/OpenEmbedded, D‑Bus, systemd, bmcweb/Redfish, phosphor services, firmware update flows, sensor frameworks, and log/event handling. Experience with embedded Linux, ARM‑based BMC SoCs, U‑Boot, Linux kernel/device driver concepts, device tree, and low‑level interfaces such as I2C/I3C, SPI, UART, GPIO, SMBus/PMBus, and related platform‑management buses. Strong understanding of server/platform RAS and serviceability features such as health monitoring, error logging, crash‑dump, diagnostics, inventory/FRU management, and remote recovery. Experience with secure manageability architectures, including secure boot, root of trust, attestation, firmware signing, SPDM, and protection of out‑of‑band management paths. Experience creating architecture specifications, product requirements, conformance plans, validation strategies, and design reviews that drive execution across multiple internal teams and external partners. Strong programming and scripting background in C/C++, Python, and shell, with the ability to debug across firmware, hardware, and system software boundaries. Experience with large‑scale telemetry or observability pipelines, metrics consumers, or fleet operations tooling is strongly preferred. Experience with AMD server or GPU platforms, AI/HPC system design, liquid cooling/power/thermal infrastructure, or OCP‑aligned rack architectures is a plus. Strong written and verbal communication skills with proven ability to influence senior engineering leadership, customers, and strategic partners. ACADEMIC CREDENTIALS Bachelor’s or Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, or a related technical field. Advanced degree preferred. LOCATION Austin, Texas preferred. Other AMD datacenter engineering locations may be considered based on team alignment and business needs. This role is not eligible for visa sponsorship. Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee‑based recruitment services. AMD and its subsidiaries are equal‑opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third‑party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process. AMD may use Artificial Intelligence to help screen, assess, or select applicants for this position. AMD’s “Responsible AI Policy” is available here. This posting is for an existing vacancy. #J-18808-Ljbffr

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Rack Scale Serviceability & Telemetry Architect in Austin, TX vacancy
  •  ...Advanced Micro Devices is seeking a Principal Member of Technical Staff (PMTS) to define and own the architecture for rack-scale serviceability and telemetry. The role requires expertise in AMD Instinct solutions, focusing on end-to-end manageability and observability... 
    Suggested

    Advanced Micro Devices , Inc.

    Austin, TX
    5 days ago
  • $123.1k - $273k

     ...specialist product advisors and technical architects. This team not only resolves technical...  ...As a Success Architect specializing in Service Cloud, you will: Drive Customer Impact...  ...practices. ~ Experience with large-scale, complex implementations, including SaaS... 
    Suggested
    Immediate start
    Worldwide

    Salesforce.Com Inc

    Austin, TX
    4 days ago
  •  ...Senior Architect, Enterprise Architecture – Enterprise Servicing Career Architecture Title: IT Principal Architect, Enterprise Architecture Position Summary...  ...workforce forecasting, staffing, and coaching Scaled servicing capabilities with improved agility and reduced... 
    Suggested
    Local area
    Work from home

    CarepathRx

    Austin, TX
    5 days ago
  •  ...Delivery - Sr. Staff Engineer to own the architecture of the Lead Services platform in Austin, Texas. The role demands over 10 years of...  ...software engineering experience with a strong background in large-scale systems and technical leadership. Key responsibilities include... 
    Suggested

    News Corp

    Austin, TX
    5 days ago
  • $134.05k - $221.21k

     ...The Americas Telco Services Delivery Managing Architect is instrumental in guiding technical direction and driving customer success for our most strategic customers. Through the knowledge of the Telco industry and Red Hat technologies, this experienced technical manager... 
    Suggested
    Permanent employment
    Full time
    Contract work
    Work experience placement
    Work at office
    Remote work
    Flexible hours

    Red Hat

    Austin, TX
    5 days ago
  • $152.8k - $191k

     ...Description POSITION SUMMARY We are seeking a visionary Lead AI Architect to lead the design and implementation of next-generation AI...  ...Developers to embed AI outputs directly into the flow of work (Service Console, Sales Cloud LWC, etc.). Act as the subject matter... 
    Work at office
    Immediate start
    Worldwide

    Natera

    Austin, TX
    27 days ago
  •  ...designers, software engineers and systems architects, Graphcore enjoys a culture of...  ...responsible for architecting a cohesive, AI rack scale platform optimized for trillion-parameter...  ...plan, commuter benefits, wellness services and an Employee Assistance Programme (EAP... 
    Flexible hours

    Graphcore

    Austin, TX
    5 days ago
  • $150.1k - $227k

     ...we are delivering a successful experience to every customer at scale. Cloud Success Readiness improves customer Adoption and Product...  ...to deliver product innovation to our customers. As a Readiness Architect, you will be responsible for understanding the Industries Cloud... 
    Shift work

    Salesforce

    Austin, TX
    4 days ago
  • $126k - $229.8k

     ...for an experienced SoC architect to work on the next...  ...server level to a full rack. Responsibilities:...  ...Design/architect for easy serviceability Implement HW/SW...  ...RAS, virtualization, telemetry, power, cooling, lifecycle...  ...~ Good knowledge of scale-up and scale-out architectures... 
    Work experience placement

    Qualcomm

    Austin, TX
    4 days ago
  •  ...States is seeking a Strategic Sourcing Leader to drive efficient sourcing strategies and manage supplier relationships in Professional Services. The ideal candidate will have over 7 years of experience in strategic sourcing and the ability to influence senior stakeholders.... 

    IBM Computing

    Austin, TX
    5 days ago
  • $160k - $200k

     ...leading investment management firm is seeking a Principal Power Generation Architect Engineer. This role involves owning the reference architecture for on-site power generation solutions across mega-scale data centers. The ideal candidate should have a Bachelor’s degree in... 
    Flexible hours

    Tract Capital Management, LP

    Austin, TX
    3 days ago
  •  ...Growth to own the revenue engine across acquisition, analytics, and creative strategy. This hands-on role requires proven experience scaling a DTC business from $150M to over $250M, managing substantial paid media budgets, and leading high-performing teams. The ideal... 
    Remote job

    Everyday Dose Inc.

    Austin, TX
    1 day ago
  • $153k - $187k

     ...innovative, and forward-thinking Sr. GTM AI Architect to lead the design and deployment of AI-...  ...go deeper — providing dedicated focus to scale those efforts and drive adoption of the...  ..., we offer a bonus after 7 years of service. Wellness Subsidy – We provide a subsidy... 
    Currently hiring
    Remote work
    Flexible hours

    Invoca

    Austin, TX
    4 days ago
  • $96k - $175k

    Principal IT Solution Delivery - Service Now CSM Delivery Architect page is loaded## Principal IT Solution Delivery - Service Now CSM Delivery Architectremote type: Hybridlocations: United States of America, Washington, Liberty Lake: United States of America, Texas, Austintime... 
    Apprenticeship
    Internship
    Worldwide
    Flexible hours

    120 LocusView Solutions Incorporated

    Austin, TX
    12 hours ago
  •  ...that connect Sales, Marketing, Finance, and Operations need to scale with it. The Director of Revenue Operations owns that infrastructure...  ...and document what you build so others can rely on it Financial services or compliance industry experience a plus Actively leverages AI... 
    Full time
    Contract work
    Work at office

    Cedarparktexasedc

    Austin, TX
    5 days ago
  •  ...teams across diverse healthcare types and scales. Technical understanding of health‑...  ...orientation with understanding of consulting‑service challenges. Excellent oral and written...  ...or equivalent. Licensed or credentialed architect preferred. 15+ years of experience in the... 
    Full time
    Contract work
    Temporary work
    Part time
    Casual work
    Local area
    Flexible hours

    Stantec Consulting International Ltd.

    Austin, TX
    5 days ago
  •  ...enterprises to control risk, manage costs and scale efficiently for a data and AI led world....  ...government organizations, financial services, media and information technology...  ...). To achieve this mission, you will architect and build data platform solutions that leverage... 
    Local area
    Remote work

    EDB

    Austin, TX
    5 days ago
  • $139.4k - $230k

     ...best-in-class solutions. As a Senior Architect, you will partner with Technology and Business...  ..., agentic frameworks, and cloud AI services — and contribute insights that help...  ...technologies. Successfully architected large scale technology initiatives. Leadership &... 
    Work experience placement
    Local area

    Travelers Insurance

    Austin, TX
    4 days ago
  •  ...Principal Ai Platforms Architect Locations: Atlanta | Austin | Boston | Brooklyn | Chicago...  ..., Data & Digital Platforms, AI at Scale, Agile, Cybersecurity and Digitizing the...  ...testing, and monitoring of models and AI services (LLMOps). Communication and Collaboration... 

    Boston Consulting Group

    Austin, TX
    2 days ago
  • $70k - $100k

     ...infrastructure investment worldwide, our services are in great demand. We invite you to...  ...seeking a talented and motivated Project Architect to join our Buildings + Places business...  ...in your local community and on a global scale - that are transforming our industry and... 
    Work at office
    Local area
    Worldwide
    Flexible hours

    AECOM

    Austin, TX
    5 days ago
  • $220.92k - $311.89k

     ...our customers to design leadership products, global manufacturing scale and supply chain, through the continuous yield improvements to...  ...foundry customers' products receive our utmost focus in terms of service, technology enablement and capacity commitments. Employees in... 
    Local area
    Immediate start
    Shift work

    Intel

    Austin, TX
    2 days ago
  •  ...leading integrated design practice. Our architects, engineers, interior designers, consultants...  ...a range of healthcare project types and scales with proven ability to balance a book of...  ...development challenges of a consulting-services provider Excellent oral and written... 
    Full time
    Contract work
    Temporary work
    Part time
    Casual work
    Work at office
    Local area
    Flexible hours

    Stantec

    Austin, TX
    2 days ago
  •  ...Security, is looking for an experienced ServiceNow ITAM Architect to join our cross-functional Service Management team. This individual will be a key team...  ...and impactful organization as SailPoint continues to scale globally as the industry leader in Identity Security.... 
    Full time
    Temporary work
    Remote work
    Flexible hours

    SailPoint Technologies Holdings, Inc.

    Austin, TX
    3 days ago
  • $75 - $85 per hour

     ...MDM Architect | Austin, Texas, United States Job Summary: MDM Architect (AI-Driven Data Management...  ..., directly impacting Health and Human Services data quality and analytics. Drive...  ...Informatica and ETL development for large-scale data systems. - 8+ years designing and... 

    Indotronix International Corporation

    Austin, TX
    5 days ago
  • $156.64k

     ...currently seeking a Senior Cloud Platform Architect to lead the vision, design, and...  ...Lead new product development, including services evaluation, POC (proof of concept) development...  ..., influencing architecture decisions at scale. Lead cross functional initiatives spanning... 
    Remote work
    Shift work

    MAXIMUS

    Austin, TX
    3 days ago
  • $160k

     ...CI/CD, and infrastructure as code for AI services Internal APIs, reusable services, admin...  ...shared tooling Observability, evaluation, telemetry, security controls, and feedback systems...  ...that keep AI systems reliable at scale Review, debug, and refine AI‑generated code... 
    Shift work

    Aalo Atomics

    Austin, TX
    4 days ago
  •  ...IDR is seeking a MDM Architect to join one of our top clients for a remote opportunity...  ...implement master data solutions for a large-scale Master Data Management (MDM) environment...  ...governance across the Health and Human Services System. Position Overview for... 
    Work at office
    Remote work

    IDR Healthcare

    Austin, TX
    5 days ago
  • $103.3k - $287.6k

    AI/ML Architect, MediaOS Platform,IQVIA Digital We are seeking a visionary AI/ML Architect...  ...production-grade ML systems at scale Strong programming expertise in Python (...  ...leading global provider of clinical research services, commercial insights and healthcare intelligence... 
    Full time
    Part time
    Work at office
    Immediate start
    Remote work
    Worldwide
    Flexible hours

    IQVIA

    Austin, TX
    5 days ago
  • $164.47k - $269.1k

     ...performance networking silicon. Our team architects next-generation networking solutions...  ...86 clusters), including core selection, scaling strategy, and configuration tradeoffs...  ...plane assist, offload execution, management services) # Drive compute architecture decisions... 
    Local area
    Immediate start
    Shift work

    Intel

    Austin, TX
    5 days ago
  • $196.64k - $328.35k

     ...Principal Integration Architect - IT/OT Convergence Location: Houston, TX, US Ann Arbor...  ...industrial and operational environments at scale. This is an opportunity to shape complex...  ..., including MuleSoft, Azure Integration Services, Boomi, Apigee, IBM webMethods, Workato,... 
    Full time
    Part time
    Work experience placement
    Work at office
    Local area
    Relocation
    Visa sponsorship
    Flexible hours

    Black & Veatch

    Austin, TX
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Rack Scale Serviceability & Telemetry Architect. Be the first to apply!