Senior AIOps SRE for AI Data Center Platform
NVIDIA Gruppe
NVIDIA Gruppe in Santa Clara is seeking an experienced engineer to build an AI Data Center AIOps platform. The ideal candidate will have a strong background in Kubernetes and automation, ensuring the reliability of GPU fleet management. Key responsibilities include monitoring platform health, owning infrastructure deployments, and leading incident resolution. Candidates should possess 5+ years of experience in production systems and a degree in CS/CE. A competitive salary and generous benefits package are offered. #J-18808-Ljbffr NVIDIA Gruppe
- ...tapping into the unlimited potential of AI to define the next era of computing. An... ...Overview You will be building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into... ...production distributed systems as SRE/DevOps/Platform Ops. Proven ownership of...PlatformSenior
$184k - $356.5k
...is seeking an experienced Network Solutions Architect Engineer to help deploy next-generation AI networking platforms. You will guide architecture decisions across data centers, support on-site setups, and provide solutions to key technology customers. The role requires...PlatformSenior$159k - $231k
Google Inc. is seeking a Senior Hardware Systems Design Engineer in Sunnyvale, CA... ...role involves working on innovative ML/AI hardware systems for data center projects that push the boundaries of technology. As a member of the Platforms Infrastructure team, you will lead...PlatformSenior- A leading technology company is seeking a Senior Technical Marketing Engineer to focus on AI data center networking in Santa Clara. This role involves delivering content on NVIDIA’s networking platforms, articulating challenges with high-density deployments, and developing...PlatformSenior
- ...Senior Leadership Technical Program Manager Google's projects... ...programs and teams. Google Data Centers (GDC) make up one of the largest... ...Systems, Tooling, Data and AI team, will lead a high-impact... ...technology optimization and platform solutions, advanced data analytics...PlatformSenior
$182k - $242k
...Senior Software Engineer, Applied AI CoreWeave is The Essential Cloud for AI™. Built... ...pioneers, CoreWeave delivers a platform of technology, tools, and... ...Engineering team in the Data Infrastructure organization... ...in our office and data center locations ~ A casual work...PlatformSeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours$190.61k - $361.48k
Job Overview Join Intel's AI Revolution. Intel's new AI SoC organization... ..., from edge devices to data‑center accelerators. We are seeking... ...across teams. Mentor senior engineers and provide technical... ...defining or influencing SoC or platform architecture roadmap across multiple...PlatformSeniorLocal areaShift work$189k - $210k
Job Overview Cohesity is a leader in AI‑powered data security and management. Aided by an extensive... ...get value from data across the data center, edge, and cloud. Cohesity helps... ...product managers in the industry to own a platform built from the ground up for agentic AI...PlatformSeniorWork at officeFlexible hoursShift work2 days per week3 days per week- ...-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...center of that mission: making AMD the platform of choice for the most demanding AI... .... THE OPPORTUNITY We're looking for a senior software engineer who combines deep...PlatformSeniorShift work
- ...is looking for an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. What you will be doing: Partner with AI-native / consumer internet customers on large data center GPU...PlatformSenior
$262k - $365k
Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements... ...machine (and deep) learning, data mining, natural language... ...compilers for mobile platforms, as well as core search... ...worldwide. We’re at the center of amazing work at...PlatformSeniorFull timeWorldwide- ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.... ...future of AI and beyond. THE ROLE: As a Senior Manager (Individual Contributor), AI... ...Data Center GPU and AI infrastructure platforms. This partner-facing role focuses on driving...PlatformSenior
$174k - $252k
Senior Software Engineer, Embedded Systems/Firmware, AI and Infrastructure Sunnyvale, CA, USA Bachelor’... ...years of experience with data structures/algorithms.... ...and maintaining our data centers to building the next generation of Google platforms, we make Google's product...PlatformSeniorFull timeWorldwide$160k - $253k
...NVIDIA, we are at the forefront of AI and accelerated computing, redefining... ...as the powerful NVIDIA GB300 NVL72 platform, delivering unmatched data center-scale performance.**What you'll be doing... ...status, risks, and insights to senior leadership.**What we need to see:***...PlatformSenior$262k - $365k
Senior Accelerators Systems Software Architect, AI Transformation corporate_fare Google place Sunnyvale, CA, USA Apply Bachelor's degree or... ...that interacts with hardware. Experience with data center servers and AI platforms. Preferred Qualifications Master’s degree...PlatformSeniorWorldwide$208k - $327.75k
NVIDIA is driving a vision for AI factories that convert tokens to intelligence at scale... .... Collaborate with NCP operators, SRE teams, and hardware vendor partners to integrate... ...management experience in infrastructure, platform, or MLOps areas, or equivalent background....PlatformSeniorLive in$397.8k
...We are hiring a Senior Director to lead the Cloud AI Platform Solutions organization. This team helps customers deploy Neoverse SoC platforms in systems... ...Experience leading customer delivery roles in the data center ecosystem (customer engineering, technical account leadership...PlatformSenior- A global technology leader is looking for an experienced SRE software engineer in Cupertino, California, to build and enhance compute... ...infrastructure for Apple's services. The role involves developing AI-powered tooling, automating deployment, and ensuring that services...PlatformSenior
$124k - $195.5k
...tapping into the unlimited potential of AI to define the next era of computing. An... ...a lasting impact on the world. NVIDIA platforms are at the center of generative AI, autonomous driving, industrial... ...robots, medical instruments and data centers across the world where GPU...PlatformSeniorFull time- ...-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...we advance your career. THE ROLE This Senior Finance Manager role is a high-... ...operations to align financial plans with platform roadmaps and execution priorities Support...PlatformSenior
- ...seeking a Datacenter Product Engineer to join its Datacenter team in Santa Clara, California. This role focuses on launching AI supercomputing platforms and supporting GPU production. The ideal candidate will collaborate with NPI teams and implement process improvements...PlatformSenior
- Senior Solution Architect - AI / GPU Cloud Mountain View, California, United States GMI Cloud is seeking... ...Partner with Infrastructure, Data Center Ops, and Engineering teams Identify... ...product and engineering to improve the platform Required Qualifications Technical Background...PlatformSenior
- NVIDIA Corporation is hiring a Senior System Engineer in Santa Clara, California. In this role, you will ensure functionality and validation of GPU rack platforms before mass deployment. Collaborating with global teams, you will optimize hardware and debug early server...PlatformSenior
$164k - $205k
...to apply for the Senior Software Engineer... ...Cohesity Get AI-powered advice on this... ...leader in AI-powered data security and... ...— across the data center, edge, and cloud. Cohesity... ...infinitely scalable platform. Implement... ...Engineer Software (AIOps for NGFW) Senior...PlatformSeniorFull timeWork at officeRemote work2 days per week3 days per week$145k - $175k
...vertically integrated AI infrastructure company... ...energy, manufacturing, data center construction, and cloud... ...GPU compute power. As a Senior Cloud Support Engineer,... ...Collaboration: Work closely with SRE, Networking, and... ...other public cloud platforms (e.g., AWS, Azure, GCP)...PlatformSeniorFull timeTemporary work- A leading technology company is seeking a Senior Software Architect to innovate server systems for deep learning applications. This role involves leading software activities and system architecture design, requiring deep expertise in server performance and collaboration...PlatformSenior
- ...Senior Enterprise Sales Executive AI Inference Infrastructure Tensordyne (formerly Recogni) is building the next generation of AI inference infrastructure... ...startups at a pivotal inflection point, as our platform moves from development to market launch. We're hiring...PlatformSenior
$184k - $287.5k
...with developers, scientific researchers, and data scientists, gaining experience across a... ...solution partners targeting our computing platform* Working closely with customers to help them... ...performance and power efficiency of AI inference workloads on Kubernetes* Some travel...PlatformSenior- ...building the next generation of AI inference infrastructure for... ...environments. Role Description: - Senior Sales Executive, Hyperscale &... ..., and major enterprise cloud platforms. Company Description... ...is a high-impact role at the center of Tensordyne's growth, requiring...PlatformSeniorFull timeRemote work
- ...Systems builds the world’s largest AI chip with wafer-scale... ...applications. About The Role Senior Director of Technical Product... ...strategy for our AI infrastructure platform. This leader will operate at the... ...Clear, precise communicator. Data-driven decision maker. Operates...PlatformSenior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AIOps SRE for AI Data Center Platform. Be the first to apply!
- senior cloud service delivery manager Santa Clara, CA
- senior business analyst contract Santa Clara, CA
- senior product design engineer Santa Clara, CA
- senior game producer Santa Clara, CA
- senior software manager Santa Clara, CA
- senior manager business analytics Santa Clara, CA
- senior marketing account manager Santa Clara, CA
- senior marketing manager Santa Clara, CA
- senior contracts analyst Santa Clara, CA
- sr operations manager Santa Clara, CA


