Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior AIOps SRE for AI Data Center Platform

NVIDIA Gruppe

NVIDIA Gruppe in Santa Clara is seeking an experienced engineer to build an AI Data Center AIOps platform. The ideal candidate will have a strong background in Kubernetes and automation, ensuring the reliability of GPU fleet management. Key responsibilities include monitoring platform health, owning infrastructure deployments, and leading incident resolution. Candidates should possess 5+ years of experience in production systems and a degree in CS/CE. A competitive salary and generous benefits package are offered. #J-18808-Ljbffr NVIDIA Gruppe

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Senior AIOps SRE for AI Data Center Platform in Santa Clara, CA vacancy
  •  ...tapping into the unlimited potential of AI to define the next era of computing. An...  ...Overview You will be building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into...  ...production distributed systems as SRE/DevOps/Platform Ops. Proven ownership of... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    4 days ago
  • $184k - $356.5k

     ...is seeking an experienced Network Solutions Architect Engineer to help deploy next-generation AI networking platforms. You will guide architecture decisions across data centers, support on-site setups, and provide solutions to key technology customers. The role requires... 
    Platform
    Senior

    NVIDIA Gruppe

    Santa Clara, CA
    2 days ago
  • $159k - $231k

    Google Inc. is seeking a Senior Hardware Systems Design Engineer in Sunnyvale, CA...  ...role involves working on innovative ML/AI hardware systems for data center projects that push the boundaries of technology. As a member of the Platforms Infrastructure team, you will lead... 
    Platform
    Senior

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • A leading technology company is seeking a Senior Technical Marketing Engineer to focus on AI data center networking in Santa Clara. This role involves delivering content on NVIDIA’s networking platforms, articulating challenges with high-density deployments, and developing... 
    Platform
    Senior

    NVIDIA

    Santa Clara, CA
    3 days ago
  •  ...Senior Leadership Technical Program Manager Google's projects...  ...programs and teams. Google Data Centers (GDC) make up one of the largest...  ...Systems, Tooling, Data and AI team, will lead a high-impact...  ...technology optimization and platform solutions, advanced data analytics... 
    Platform
    Senior

    Google

    Sunnyvale, CA
    2 days ago
  • $182k - $242k

     ...Senior Software Engineer, Applied AI CoreWeave is The Essential Cloud for AI™. Built...  ...pioneers, CoreWeave delivers a platform of technology, tools, and...  ...Engineering team in the Data Infrastructure organization...  ...in our office and data center locations ~ A casual work... 
    Platform
    Senior
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    5 days ago
  • $190.61k - $361.48k

    Job Overview Join Intel's AI Revolution. Intel's new AI SoC organization...  ..., from edge devices to data‑center accelerators. We are seeking...  ...across teams. Mentor senior engineers and provide technical...  ...defining or influencing SoC or platform architecture roadmap across multiple... 
    Platform
    Senior
    Local area
    Shift work

    Intel Corporation

    Santa Clara, CA
    5 days ago
  • $189k - $210k

    Job Overview Cohesity is a leader in AI‑powered data security and management. Aided by an extensive...  ...get value from data across the data center, edge, and cloud. Cohesity helps...  ...product managers in the industry to own a platform built from the ground up for agentic AI... 
    Platform
    Senior
    Work at office
    Flexible hours
    Shift work
    2 days per week
    3 days per week

    Cohesity Inc.

    Santa Clara, CA
    1 day ago
  •  ...-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded...  ...center of that mission: making AMD the platform of choice for the most demanding AI...  .... THE OPPORTUNITY We're looking for a senior software engineer who combines deep... 
    Platform
    Senior
    Shift work

    Advanced Micro Devices , Inc.

    Santa Clara, CA
    4 days ago
  •  ...is looking for an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. What you will be doing: Partner with AI-native / consumer internet customers on large data center GPU... 
    Platform
    Senior

    NVIDIA

    Santa Clara, CA
    1 day ago
  • $262k - $365k

    Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements...  ...machine (and deep) learning, data mining, natural language...  ...compilers for mobile platforms, as well as core search...  ...worldwide. We’re at the center of amazing work at... 
    Platform
    Senior
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    1 day ago
  •  ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems....  ...future of AI and beyond. THE ROLE: As a Senior Manager (Individual Contributor), AI...  ...Data Center GPU and AI infrastructure platforms. This partner-facing role focuses on driving... 
    Platform
    Senior

    Advanced Micro Devices, Inc.

    Santa Clara, CA
    2 days ago
  • $174k - $252k

    Senior Software Engineer, Embedded Systems/Firmware, AI and Infrastructure Sunnyvale, CA, USA Bachelor’...  ...years of experience with data structures/algorithms....  ...and maintaining our data centers to building the next generation of Google platforms, we make Google's product... 
    Platform
    Senior
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    4 days ago
  • $160k - $253k

     ...NVIDIA, we are at the forefront of AI and accelerated computing, redefining...  ...as the powerful NVIDIA GB300 NVL72 platform, delivering unmatched data center-scale performance.**What you'll be doing...  ...status, risks, and insights to senior leadership.**What we need to see:***... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    1 day ago
  • $262k - $365k

    Senior Accelerators Systems Software Architect, AI Transformation corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree or...  ...that interacts with hardware. Experience with data center servers and AI platforms. Preferred Qualifications Master’s degree... 
    Platform
    Senior
    Worldwide

    Google Inc.

    Sunnyvale, CA
    3 days ago
  • $208k - $327.75k

    NVIDIA is driving a vision for AI factories that convert tokens to intelligence at scale...  .... Collaborate with NCP operators, SRE teams, and hardware vendor partners to integrate...  ...management experience in infrastructure, platform, or MLOps areas, or equivalent background.... 
    Platform
    Senior
    Live in

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $397.8k

     ...We are hiring a Senior Director to lead the Cloud AI Platform Solutions organization. This team helps customers deploy Neoverse SoC platforms in systems...  ...Experience leading customer delivery roles in the data center ecosystem (customer engineering, technical account leadership... 
    Platform
    Senior

    Arm Limited

    San Jose, CA
    22 hours ago
  • A global technology leader is looking for an experienced SRE software engineer in Cupertino, California, to build and enhance compute...  ...infrastructure for Apple's services. The role involves developing AI-powered tooling, automating deployment, and ensuring that services... 
    Platform
    Senior

    Apple Inc.

    Cupertino, CA
    3 days ago
  • $124k - $195.5k

     ...tapping into the unlimited potential of AI to define the next era of computing. An...  ...a lasting impact on the world. NVIDIA platforms are at the center of generative AI, autonomous driving, industrial...  ...robots, medical instruments and data centers across the world where GPU... 
    Platform
    Senior
    Full time

    NVIDIA

    Santa Clara, CA
    1 day ago
  •  ...-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded...  ...we advance your career. THE ROLE This Senior Finance Manager role is a high-...  ...operations to align financial plans with platform roadmaps and execution priorities Support... 
    Platform
    Senior

    Advanced Micro Devices , Inc.

    San Jose, CA
    2 days ago
  •  ...seeking a Datacenter Product Engineer to join its Datacenter team in Santa Clara, California. This role focuses on launching AI supercomputing platforms and supporting GPU production. The ideal candidate will collaborate with NPI teams and implement process improvements... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • Senior Solution Architect - AI / GPU Cloud Mountain View, California, United States GMI Cloud is seeking...  ...Partner with Infrastructure, Data Center Ops, and Engineering teams Identify...  ...product and engineering to improve the platform Required Qualifications Technical Background... 
    Platform
    Senior

    Glint Tech Solutions LLC

    Mountain View, CA
    3 days ago
  • NVIDIA Corporation is hiring a Senior System Engineer in Santa Clara, California. In this role, you will ensure functionality and validation of GPU rack platforms before mass deployment. Collaborating with global teams, you will optimize hardware and debug early server... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $164k - $205k

     ...to apply for the Senior Software Engineer...  ...Cohesity Get AI-powered advice on this...  ...leader in AI-powered data security and...  ...— across the data center, edge, and cloud. Cohesity...  ...infinitely scalable platform. Implement...  ...Engineer Software (AIOps for NGFW) Senior... 
    Platform
    Senior
    Full time
    Work at office
    Remote work
    2 days per week
    3 days per week

    Cohesity

    Santa Clara, CA
    3 days ago
  • $145k - $175k

     ...vertically integrated AI infrastructure company...  ...energy, manufacturing, data center construction, and cloud...  ...GPU compute power. As a Senior Cloud Support Engineer,...  ...Collaboration: Work closely with SRE, Networking, and...  ...other public cloud platforms (e.g., AWS, Azure, GCP)... 
    Platform
    Senior
    Full time
    Temporary work

    Epoch Biodesign

    Sunnyvale, CA
    4 days ago
  • A leading technology company is seeking a Senior Software Architect to innovate server systems for deep learning applications. This role involves leading software activities and system architecture design, requiring deep expertise in server performance and collaboration... 
    Platform
    Senior

    NVIDIA

    Santa Clara, CA
    2 days ago
  •  ...Senior Enterprise Sales Executive AI Inference Infrastructure Tensordyne (formerly Recogni) is building the next generation of AI inference infrastructure...  ...startups at a pivotal inflection point, as our platform moves from development to market launch. We're hiring... 
    Platform
    Senior

    Tensordyne

    Sunnyvale, CA
    4 days ago
  • $184k - $287.5k

     ...with developers, scientific researchers, and data scientists, gaining experience across a...  ...solution partners targeting our computing platform* Working closely with customers to help them...  ...performance and power efficiency of AI inference workloads on Kubernetes* Some travel... 
    Platform
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  •  ...building the next generation of AI inference infrastructure for...  ...environments. Role Description: - Senior Sales Executive, Hyperscale &...  ..., and major enterprise cloud platforms. Company Description...  ...is a high-impact role at the center of Tensordyne's growth, requiring... 
    Platform
    Senior
    Full time
    Remote work

    Tensordyne

    Sunnyvale, CA
    1 day ago
  •  ...Systems builds the world’s largest AI chip with wafer-scale...  ...applications. About The Role Senior Director of Technical Product...  ...strategy for our AI infrastructure platform. This leader will operate at the...  ...Clear, precise communicator. Data-driven decision maker. Operates... 
    Platform
    Senior

    Cerebras

    Sunnyvale, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior AIOps SRE for AI Data Center Platform. Be the first to apply!