Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

System Debug Engineer Manager, Cloud AI Infrastructure

$192k - $278k

Google Inc.

corporate_fare Google place Kirkland, WA, USA ; Austin, TX, USA Apply In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include: Health, dental, vision, life, disability insurance Retirement Benefits: 401(k) with company match Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment Sick Time: 40 hours/year (increased to 69 hours/year for Seattle) including 5 discretionary sick days per instance Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks Baby Bonding Leave: 18 weeks Holidays: 13 paid days per year Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Kirkland, WA, USA; Austin, TX, USA . Bachelor's degree in Computer Science or IT-related field, or equivalent practical experience. 8 years of experience with system design. 5 years of experience managing or leading a team. 5 years of experience with managing technical work, engineering strategy, and roadmaps. 5 years of experience with hardware debug (silicon debug, platform debug, IO interface, memory analysis). 3 years of experience with organizational design. Preferred qualifications: 5 years of experience working with vendors or customers. 3 years of experience with leadership development and career growth of employees. 3 years of experience in analyzing and troubleshooting distributed systems. 2 years of CPU, dGPU, or TPU debug or validation experience. Understanding of memory and high-speed IO technologies. About the job Systems Development Engineering (SDE) at Google is a role where you manage services and systems at scale. SDEs creatively put their engineering discipline to use automating the mundane and reducing toil. We don’t just write code to fix bugs, but emphasize the development of tools and solutions that fix classes of problems. We know it’s hard to control what you can’t measure – so we focus on observability: instrumenting first, then turning data into knowledge, and finally knowledge into action. We know that the operational efficiency of Google systems, services, virtual compute environments and the operating systems that power them impact the environment, not just the bottom line. We know that working together we can do more, and that community matters. Google brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame‑free environment. We promote self‑direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow. Together we engineer and build the infrastructure, tools, access and telemetry for systems that enable orchestration of Google‑scale services. Come build things that matter. As a part of the Google Cloud Support team, you will ensure customers maximize their investment. As a Systems Debug Engineer, you will be a trusted advisor driving hardware understanding and issue resolution. You will troubleshoot platform challenges, providing expert solutions that enable innovation. You will represent the customer, collaborate with engineering and product teams to drive continuous improvement across global cloud products and services. The US base salary range for this full‑time position is $192,000‑$278,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job‑related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google . Responsibilities Drive technical team performance across on‑call activities and system management by delivering leadership, mentorship, and career development while collaborating with primary responders to address system issues. Debug platform hardware, silicon, and AI/ML workloads to drive root‑cause resolution, develop permanent infrastructure improvements, and build tools for faster diagnosis through troubleshooting and reproduction. Collaborate cross‑functionally with Product, Quality, and Engineering teams to enhance product outcomes, and engage with Site Reliability Engineering (SRE) teams to ensure high‑quality production and reliability. Resolve customer challenges on AI/ML infrastructure through effective diagnosis, resolution, and the implementation of investigation tools to increase productivity for critical reported issues. Serve as a consultant and subject matter expert for internal stakeholders to resolve deployment and operational obstacles across AI infrastructure environments daily. Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents‑to‑be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire. Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting. To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes. #J-18808-Ljbffr Google Inc.

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the System Debug Engineer Manager, Cloud AI Infrastructure in Austin, TX vacancy
  • $163k - $237k

     ...experience with systems automation, and...  ...with technical infrastructure (e.g.,...  ...resolve complex AI infrastructure...  ...Systems Development Engineering (SDE) at Google...  ...role where you manage services and systems...  .... The Google Cloud Support team...  .... As a Systems Debug Engineer, you will... 
    Suggested
    Full time
    Temporary work

    Google Inc.

    Austin, TX
    4 days ago
  • $200k - $300k

     ...Senior Engineering Manager page is loaded## Senior Engineering...  ...building products rooted in AI, automation, and...  ...our team focused on cloud governance, IAM,...  ...integrity of Zendesk’s systems while accelerating product...  ...environment infrastructure; etc.* Partner with Corp... 
    Suggested
    Work at office
    Remote work

    Zendesk Group

    Austin, TX
    12 hours ago
  • $163k - $237k

    Google is seeking a Systems Debug Engineer located in Austin, Texas. In this role, you'll manage services and systems at scale, focusing...  ...customer issues related to AI/ML workloads. You'll also participate...  ...automation and technical infrastructure. The compensation package... 
    Suggested

    Google

    Austin, TX
    2 days ago
  • Site Reliability Engineer (Edge Services), Infrastructure Services Austin,...  ...distributed systems and seamless user...  ...be comfortable debugging protocol‑level issues...  ...configuring and managing modern...  ...Experience managing cloud environments (AWS...  ...applying Generative AI tools within SRE... 
    Suggested
    Shift work

    Apple Inc.

    Austin, TX
    2 days ago
  • $81k - $260k

    Micron Technology, Inc. is seeking a candidate with extensive experience in system bring-up and debugging to drive technical collaborations with partners like Intel and AMD. The role requires solid knowledge of computer hardware systems and the ability to work in a fast... 
    Suggested

    Micron Technology, Inc.

    Austin, TX
    2 days ago
  • $170k - $220k

    At Electric Mind, Engineering is where strategy meets action. Our team helps...  ...designs and built modern cloud‑based data platforms that support analytics, AI, and machine learning workloads...  ...Experience and understanding in Infrastructure‑as‑Code Implement and integrate... 

    Electric Mind

    Austin, TX
    2 days ago
  • $183k - $265k

     ..., Technology, Engineering, Mathematics,...  ...experience in cloud computing or a...  ...of experience managing a software engineering...  ...developing AI/Generative AI...  ...(RAG) systems. Experience in...  ...solutions within infrastructures, ensuring data...  ...consult, but codes, debugs and jointly... 
    Full time

    Google Inc.

    Austin, TX
    4 days ago
  •  ...platform company based in Texas is looking for a mid-level engineer to develop and maintain robust cloud and on-premise platform infrastructure. In this role, you will be responsible for deploying Teradata systems across AWS, Azure, and Google Cloud, tuning performance... 

    Teradata

    Austin, TX
    3 days ago
  • $100k - $135k

     ...Sr. Product Manager Private Cloud Solutions As a Sr. Product Manager at...  ...Collaborate with Product, Engineering, Finance, Operations, Sales...  ...Technical knowledge of IT infrastructure (networking, compute, and storage...  ...tools (e.g., Miro), AI tools Familiarity with Salesforce... 
    Flexible hours

    Otava, Inc.

    Austin, TX
    9 days ago
  •  ...the most complete cloud analytics and data platform for AI. By delivering harmonized...  ...’ll Do As Senior Manager, Cloud Marketplace...  ...to-end operational engine that powers...  ...platform as Teradata’s system of record for co-sell...  ...marketplace and co-sell infrastructure — including SKU/... 
    Permanent employment
    Contract work
    Flexible hours

    Teradata Corporation (SE)

    Austin, TX
    1 day ago
  • Compunnel, Inc. is looking for an experienced Systems Engineer to support large-scale technology transformations. The role focuses...  ...enterprise systems and involves working with AI-assisted workflows and cloud-native applications. Ideal candidates have strong expertise... 

    Compunnel, Inc.

    Austin, TX
    2 days ago
  •  ...future of how clients manage their money by...  ...data, analytics, and AI capabilities through modern cloud platforms and reusable engineering solutions that accelerate...  ...cloud automation and infrastructure as code to enhance efficiency...  ..., Information Systems, or a related technical... 
    Work at office

    Charles Schwab

    Austin, TX
    3 days ago
  • $118.3k - $251.6k

     ...Description The OCI AI Infrastructure Network...  ...running on Oracle Cloud Infrastructure. As a Senior Manager, you will lead a...  ...fabrics and supporting systems. This role...  ...combined with software engineering experience. You...  ...guide the design, debugging, and enhancement... 
    Temporary work
    Flexible hours
    Night shift

    Oracle

    Austin, TX
    2 days ago
  • eBay Inc. is seeking an Engineering Manager to lead its Cloud Security Team, specializing in Identity and Access Management (IAM). The role involves guiding the development of secure identity systems and adopting AI-powered solutions to enhance security and operational... 

    eBay Inc.

    Austin, TX
    1 day ago
  • $218k - $323.95k

     ...Principal Engineer, Data Platform Infrastructure And Performance Engineering This role is the technical authority...  ..., data pipelines, messaging systems, caching infrastructure, and performance...  ..., own technical strategy, pioneer AI-first engineering practices, and mentor... 
    Work at office
    Local area
    Immediate start
    Flexible hours

    PayPal

    Austin, TX
    1 day ago
  • $100k

     ...on cutting-edge AI technology,...  ...'s AI Software Infrastructure team builds the...  ...workloads, and manage large-scale AI...  ...on Tenstorrent systems. This role ishybrid...  ...infrastructure engineer with experience...  ..., and debugging production issues...  ...infrastructure differs from cloud-native... 
    Permanent employment

    Tenstorrent

    Austin, TX
    4 days ago
  • $82.5k - $199.5k

     ...Job Description Principal Product Manager - Cloud Database & AI Innovation Location: Redwood...  ...database cloud services in Oracle Cloud Infrastructure with wide adoption across the globe...  ...to work collaboratively with engineering on product requirements and designs... 
    Temporary work
    Flexible hours

    Oracle

    Austin, TX
    4 days ago
  •  ...software, and servers into oneunified system. You’ll join a team of engineers and architects who are dedicated to...  ...secure web services and scalable infrastructure for highly available applications....  ...skills. Comfort building and using AI-assisted workflows that enhance efficiency... 

    Apple Inc.

    Austin, TX
    3 days ago
  • $114.4k - $157.3k

    Procore Technologies, Inc. is searching for a Finance Manager, Product & Technology in Austin, TX, to provide financial insights and forecasting for cloud infrastructure and software investments. In this position, you will drive expense forecasting, manage technology portfolios... 

    Procore Technologies, Inc.

    Austin, TX
    19 hours ago
  •  ...Technical Product Manager, AI Cloud Networking Mirantis is the Kubernetes-native AI infrastructure company, enabling organizations to build and operate scalable, secure...  ...Kubernetes orchestration, Mirantis empowers platform engineering teams to deliver composable, production‑... 

    jobs.frontdoordefense.com - Jobboard

    Austin, TX
    3 days ago
  • $119.2k - $175.45k

     ...analytics platform engineering team is at the...  ...company. As a Senior Systems Engineer, you'll...  ...passionate about cloud technologies,...  ...architecture with an AI‑enabled...  ...application deployment and management. BI platforms...  ...Build and maintain Infrastructure as Code using... 
    H1b
    Relocation package
    Flexible hours

    General Motors

    Austin, TX
    4 days ago
  •  ...computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture...  ...for System and Silicon debug of AMD EPYC Server &...  ...failures working with engineering teams across AMD. Candidate...  ...on time with quality. Manage and track technical... 

    Advanced Micro Devices

    Austin, TX
    2 days ago
  • $231.5k - $272k

     ...Role: Confluent's Cloud Networking team is responsible...  ...-maker alongside engineering leadership on the...  ...retention, attach rates, and infrastructure cost efficiency...  ...~8+ years of product management experience, with at least...  ...indicators ~ Practices AI-native product... 
    Full time
    Remote work

    Confluent

    Austin, TX
    7 days ago
  • $147.05k - $230.85k

    Expert Systems Engineer, Applied AI & On-Device Technology (Windows...  ...processes, not only cloud pipelines. Optimize...  ...& Device Management: Build automation frameworks...  ...team’s automation lab infrastructure for continuous...  ...workflows System‑level debugging and performance... 
    Full time
    Work at office
    Local area
    Relocation
    Flexible hours
    Shift work

    Hewlett Packard Enterprise

    Austin, TX
    4 days ago
  •  ...and Modernization teams with AI-enabled Applications and...  ...dynamic and experienced Software Engineering Manager to join our talented team...  ...coding, testing, and debugging, to maintain high standards...  ...Experience with containers, cloud infrastructure, and DevOps patterns Familiarity... 
    Full time
    Work at office

    Govini

    Austin, TX
    8 days ago
  • $125.5k - $230.2k

     ...and Decision Science - Data Engineering - Manager We are looking for a dynamic...  ...and implementing complex cloud analytics solutions with a strong...  ...sources (e.g., ERPs, POS systems, e-commerce platforms) to enable...  ...markets. Enabled by data, AI and advanced technology, EY... 
    Summer holiday
    Flexible hours

    EY

    Austin, TX
    1 day ago
  •  ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of...  ...THE ROLE: The Platform Systems Engineer, Power Enablement will drive the...  ...hardware, and software Coordinate debug of issues and drive them to closure... 

    Advanced Micro Devices

    Austin, TX
    4 days ago
  •  ...Requirements Site Reliability Engineer - Infrastructure and Automation Austin...  ...report to the Sr. Manager of Engineering and...  ...across public and private cloud environments. Manage...  ...debt, and optimizing system performance. Use...  ...development tools, including AI‑assisted workflows, to... 

    Electronic Arts

    Austin, TX
    3 days ago
  • $218.8k - $335.3k

     ...is seeking an experienced Staff AI/ML Engineer to join the AV ML Infra team....  ...designing and implementing scalable ML infrastructure solutions and requires over 8...  ...in large-scale distributed systems. Candidates should be proficient in cloud technologies such as GCP/Azure... 

    General Motors

    Austin, TX
    4 days ago
  •  ...offering comprehensive engineering, supply chain, and...  ...the globe.Job Family: Systems & SecurityJob Profile...  ...within our Enterprise and Infrastructure division by applying...  ...-level validation, debugging, and customer sample...  ...embedded remote management standards, interfaces... 
    Work at office
    Local area
    Remote work
    Worldwide

    Jabil Malaysia

    Austin, TX
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to System Debug Engineer Manager, Cloud AI Infrastructure. Be the first to apply!