Senior AIOps SRE for AI Data Center Platform
NVIDIA Gruppe
NVIDIA Gruppe in Santa Clara is seeking an experienced engineer to build an AI Data Center AIOps platform. The ideal candidate will have a strong background in Kubernetes and automation, ensuring the reliability of GPU fleet management. Key responsibilities include monitoring platform health, owning infrastructure deployments, and leading incident resolution. Candidates should possess 5+ years of experience in production systems and a degree in CS/CE. A competitive salary and generous benefits package are offered. #J-18808-Ljbffr
- ...tapping into the unlimited potential of AI to define the next era of computing. An... ...Overview You will be building an AI Data Center AIOps platform that turns raw, high‑volume telemetry into... ...production distributed systems as SRE/DevOps/Platform Ops. Proven ownership of...PlatformSenior
- ...Architect focused on security architecture for Client and Data Center SoCs. You'll drive AI-driven tools for analysis and patching vulnerabilities... ...The ideal candidate will need extensive experience in SOC/platform security architecture and proficiency with AI tools. A...PlatformSenior
$184k - $356.5k
...is seeking an experienced Network Solutions Architect Engineer to help deploy next-generation AI networking platforms. You will guide architecture decisions across data centers, support on-site setups, and provide solutions to key technology customers. The role requires...PlatformSenior$136k - $212.75k
NVIDIA Gruppe in Santa Clara is hiring a Senior Power System Engineer to lead the development... ...high-current power systems for advanced AI accelerators. The role involves architecting power delivery systems for data center platforms and collaborating with cross-functional...PlatformSenior- ...A leading technology company is seeking a Senior Technical Marketing Engineer to focus on AI data center networking in Santa Clara. This role involves delivering content on NVIDIA’s networking platforms, articulating challenges with high-density deployments, and developing...PlatformSenior
- ...Senior Solution Network Architect, Enterprise Products Responsibilities... ...solutions for enterprise AI/ML systems Craft detailed... ...understanding of Ethernet, InfiniBand, data center LAN, WAN, WiFi, and SD... ...for on-prem cloud-native platforms Architectural Design Skills:...PlatformSeniorRemote work
- ...Systems builds the world's largest AI chip, 56 times larger than... ...Sr. TPM role owns site and data center operations programs... ...metrics, and operational risks to senior leadership Required Background... ...basics ~ Hardware-centric platforms Proven ability to define...PlatformSenior
$190.61k - $361.48k
...Job Overview Join Intel's AI Revolution. Intel's new AI SoC... ...applications, from edge devices to data‑center accelerators. We are seeking... ...across teams. Mentor senior engineers and provide technical... ...defining or influencing SoC or platform architecture roadmap across multiple...PlatformSeniorLocal areaShift work$189k - $210k
...About Cohesity Cohesity is a leader in AI-powered data security and management. It makes it easy... ...get value from data across the data center, edge, and cloud. The company defends against... .... Have experience building a Gen AI Platform from beginning to end. Job Description...PlatformSeniorHourly payFull timeWork at officeShift work2 days per week3 days per week$152k - $241.5k
...NVIDIA seeks a senior software engineer to join the AI Networking co-design and benchmark R&D team. In this... ...utilization of system resources at data center scale. The role involves working on... ...with diverse hardware and platforms, such as Host Channel Adapters (HCAs...PlatformSenior$148k - $235.75k
...NVIDIA is looking for a Senior AI Compute Engineer to join its Infrastructure... ...to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the... ...GPFS. Familiarity with OEM GPU platforms NVIDIA is widely considered...PlatformSeniorRemote work$224k - $356.5k
...Station (Galaxy) is NVIDIA’s workstation-class AI computer—built on GB300 Blackwell GPUs with NVLink interconnect, delivering data-center-grade AI compute in a deskside form factor... ...-GPU, high-bandwidth architecture of this platform. We are looking for a deeply technical...PlatformSeniorLocal area- ...Business Area Engineering Seniority Level Mid-Senior level... ...to transform complex data into clear and actionable... ...databases, and AI. Cloudera is looking for... ...AI and machine learning platform. You will be responsible... ...provider or in private data centers. You’ll work with...PlatformSeniorWork from homeWorldwideFlexible hours
- ...-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded... ...technology. THE PERSON: As a Senior Staff Software Developer, you will be... ...AI agents, ensuring AMD is the platform of choice for the most demanding workloads...PlatformSenior
- ...is looking for an experienced Network Solutions Architect Engineer to help bring our next-generation AI networking platforms into production at customer data centers. Do you want to be part of a team that brings new AI hardware and software technologies to production...PlatformSeniorRemote work
$152k - $241.5k
...brings Artificial Intelligence (AI) to some of the biggest... ...Learning (ML), Deep Learning (DL), Data Analytics and other related topics... ...on various Cloud Computing Platforms. As part of the NVIDIA... .../containers, Kubernetes, data center deployments, etc. ~ Effective...PlatformSeniorLocal areaRemote work$172.43k - $230.95k
...Senior Software Engineer For The Ai Model Lifecycle Team Crusoe is on a mission to accelerate the abundance... ...across energy, manufacturing, data center construction, and cloud services.... ...in building a comprehensive managed platform for the entire application development...PlatformSeniorTemporary work$208k - $327.75k
...NVIDIA is driving a vision for AI factories that convert tokens to intelligence at scale... .... Collaborate with NCP operators, SRE teams, and hardware vendor partners to integrate... ...management experience in infrastructure, platform, or MLOps areas, or equivalent background....PlatformSeniorLive in- ...‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded... .... THE ROLE The Enterprise AI Partner Senior Specialist role is an opportunity to... ...vendors that align with company platform and ecosystem priorities. Ideate, drive...PlatformSenior
$187.2k - $208k
...location. Cohesity is a leader in AI-powered data security and management.... ...from data — across the data center, edge, and cloud. Cohesity... ...selling practices. As the Senior Enablement AI Specialist , you... ...teams that own the core AI platform, providing application‑level...PlatformSeniorFull timeWork at office2 days per week3 days per week$262k - $365k
Senior Staff Research Scientist, Google Cloud AI Research Google Sunnyvale, CA, USA Requirements... ...machine (and deep) learning, data mining, natural language... ...compilers for mobile platforms, as well as core search... ...worldwide. We’re at the center of amazing work at...PlatformSeniorFull timeWorldwide- ...generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems.... ...future of AI and beyond. The Role As a Senior Manager (Individual Contributor), AI... ...Data Center GPU and AI infrastructure platforms. This partner‑facing role focuses on driving...PlatformSenior
$174k - $252k
Senior Software Engineer, Embedded Systems/Firmware, AI and Infrastructure Sunnyvale, CA, USA Bachelor’... ...years of experience with data structures/algorithms.... ...and maintaining our data centers to building the next generation of Google platforms, we make Google's product...PlatformSeniorFull timeWorldwide$174k - $252k
Senior Software Engineer, Performance, AI and Infrastructure Google Sunnyvale, CA, USA Bachelor... ...performance, large scale systems data analysis, visualization... ...and maintaining our data centers to building the next generation of Google platforms, we make Google's product portfolio...PlatformSeniorFull timeWorldwide$166k - $244k
Senior Software Engineer, Infrastructure, Google Cloud AI Apply info_outline info_outline X Note: By applying... ...of experience with data structures/algorithms.... ...providing the essential platforms that enable developers... ...Global Networking, Data Center operations, systems...PlatformSeniorFull timeWorldwide$166k - $244k
Senior Software Engineer, AI/ML GenAI, Google Cloud AI Google Sunnyvale, CA, USA Qualifications... ...evaluation, optimization, data processing, debugging). 3... ...worldwide. We’re at the center of amazing work at Google... ...services, and offers platforms that developers use to build...PlatformSeniorFull timeWorldwide$131k - $175k
...Senior Hardware Systems Engineer – AI Rack & Cluster Infrastructure Arista Networks is an industry leader in data-driven, client-to-cloud networking for large data center, campus and routing environments. What sets... ..., ensuring Arista platforms integrate cleanly into...PlatformSeniorRemote workFlexible hours- ...A global technology leader is looking for an experienced SRE software engineer in Cupertino, California, to build and enhance compute... ...infrastructure for Apple's services. The role involves developing AI-powered tooling, automating deployment, and ensuring that services...PlatformSenior
$165k - $242k
...The Essential Cloud for AI™. Built for pioneers by... ..., CoreWeave delivers a platform of technology, tools,... ...What You'll Do: As a Senior Software Engineer II (IC... ...and performance using data and operational metrics... ...in our office and data center locations ~ A casual...PlatformSeniorPermanent employmentTemporary workCasual workWork at officeFlexible hours- ...located in Santa Clara, is seeking a talented engineer to join their platform SWQA team, focusing on server integration and automation. The... ...with OS automation, strong Linux skills, and a background in AI tools. You will contribute to the development and execution of NVIDIA...PlatformSenior
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior AIOps SRE for AI Data Center Platform. Be the first to apply!
- senior cost analyst Santa Clara, CA
- senior computer engineer Santa Clara, CA
- senior development engineer Santa Clara, CA
- senior manager quality engineering Santa Clara, CA
- senior software test automation engineer Santa Clara, CA
- senior design technologist Santa Clara, CA
- senior design verification engineer Santa Clara, CA
- senior director quality Santa Clara, CA
- senior director of development Santa Clara, CA
- sr project engineer Santa Clara, CA


