Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Distinguished Engineer, Storage - AI Cloud

$320k

NVIDIA

  • # Distinguished Engineer, Storage – AI CloudApplylocations: US, CA, Santa Claratime type: Full timeposted on: Posted Yesterdayjob requisition id: JR2018037NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.AI Cloud Data StorageNVIDIA DGXC Storage org handles some of the fastest training and inference tasks. Every GPU cycle depends on a storage platform built to keep tens of thousands of accelerators continuously busy. It maintains exabytes of data securely and powers the largest AI workloads worldwide across cloud, neocloud, and on-prem setups. With the growth of accelerated computing, storage is essential. It can make the difference between effective GPU use and wasted potential and between launching a frontier model on time or missing the deadline by months. We seek a Distinguished Engineer to lead NVIDIA's storage strategy for AI Cloud across the Neocloud Provider (NCP) and Cloud Service Provider (CSP) ecosystem. You will direct the architecture of high-performance parallel file systems, object stores, and block storage at exabyte scale. You will stay hands-on, collaborating with engineers, SREs, partners, and storage vendors. You will apply NVIDIA's AI tools to increase your productivity and that of those you impact. This is a distinctive prospect to establish the storage framework of the AI era at the company that introduced accelerated computing.## **What you'll be doing:*** Lead the multi-year technical plan for AI Cloud Storage expansion across NCPs — determine the reference architecture, capabilities, performance and durability SLOs, qualification methodology, and roadmap for the high-performance file, object, and block storage that each NCP must offer to qualify for NVIDIA GPU allocation.* Serve as the chief storage architect with deep hands-on involvement. Lead key reviews of storage builds and investigate root causes of complex production problems. Develop prototype reference implementations to minimize risks in new initiatives. Make final technical decisions on NCP storage deliveries using measurable SLOs. Apply AI tools heavily to amplify your technical influence throughout the program.* Define the standard for "production-ready" in NCP storage, including durability and availability SLOs measured in 9s. Ensure sustained efficiency per TiB, observability, blast-radius containment, and reduced operational toil. Influence GPU delivery gating by requiring AI Cloud to accept GPU capacity only after verifying storage-focused ancillary services.* Develop and guide the architectural direction by working closely with collaborators in training, inference, and accelerated-computing product lines. Coordinate with site-reliability, operations, networking, and security colleagues. Work together with external cloud providers, neocloud operators, and storage vendors to align on a common architecture.* Develop the open-source path forward for AI storage. Establish and guide an open-source strategy that broadens the AI storage ecosystem. Advocate for a GitHub-first, security-first stance. Engage deeply with upstream open-source communities. Formalize the APIs, SDKs, and protocols allowing partners and the industry to build, integrate, and create with NVIDIA at the AI storage level.* Lead an engineering culture centered on AI tools. Regularly use modern AI coding and agentic tools in your daily tasks. Show what 10× engineering means at NVIDIA. Distribute patterns, prompts, and evaluation harnesses across the storage organization.* Partner with peer Distinguished and Principal storage architects across the organization to tackle the most difficult, long-term technical challenges. Make automation the only acceptable solution for infrastructure management tasks like live software upgrades, node and drive replacements, capacity rebalancing, cross-DC data movement, and dataset lifecycle. Establish root-cause analysis and corrective action rigor on every major incident. Design the storage layer for workloads spanning the next several GPU generations, including disaggregated inference with storage-backed KV caching, large-scale write-once-read-many inference patterns, exabyte regional object stores, and cross-DC dataset versioning and copy management.* Mentor and develop senior, principal, and distinguished engineers across the storage organization and nearby business units. Raise the technical bar broadly. Represent NVIDIA externally in standards bodies, open-source communities, customer briefings, and industry forums (FAST, SC, OCP, SNIA, Linux Storage Summit).**What we need to see:*** BS, MS, or PhD in Computer Science, Electrical Engineering, or a related field — or equivalent experience.* A minimum of 18+ years of practical engineering experience in storage technology is needed. This involves extensive involvement with a high-performance parallel file system like Lustre, GPFS / Spectrum Scale, WEKA, VAST, BeeGFS, DAOS, or its equivalent, handling data at multi-petabyte scale. Candidates must also have wide-ranging expertise in object storage (S3 / Swift-class) and block storage (NVMe-oF, NVMesh-class, iSCSI).* A track record of crafting and managing storage platforms at exabyte scale for performance-critical workloads — AI training, HPC, video, or hyperscale data lakes — including direct responsibility for durability, availability, and performance SLOs measured in 9s.* Demonstrated ability to set technical strategy across business units and partner organizations. You have driven multi-year storage architectures adopted by multiple teams, vendors, or customers. You can point to measurable outcomes such as GPU utility lift, $/PB reduction, incidents eliminated, and time-to-bring-up compressed.* You are 100% hands-on in engineering. You write and review production code yourself. When a bug requires it, you read Lustre, NFS, kernel, NVMe-oF, or SPDK source code. You also run scale tests or recovery drills personally instead of delegating.* Strong proficiency in at least one systems language (C, C++, Rust, or Go) and proficiency in Python; comfortable in the Linux kernel storage and networking stacks (block layer, RDMA / RoCE / InfiniBand, NVMe, page cache, VFS, multipath).* Frequent daily use of advanced AI coding and autonomous tools, including specific examples showing how you accelerated building, coding, debugging, validation, and operations. Also, share your perspective on future trends.* Excellent written and verbal communication. You can write a one-pager that aligns a VP. You can also write a six-pager that aligns an entire org. You can explain a deep technical trade-off to an SRE, a vendor CTO, and an internal customer in the same week.* Comfort operating in a 24/7 production environment where storage incidents directly impact GPU revenue, with a security-first approach baked into every build.**Ways to stand out from the crowd:*** Proven background in designing or managing storage solutions for AI training or inference at 10k+ GPU scale, demonstrating clear improvements in GPU utilization or reducing I/O bottlenecks.* Open-source contributions or maintainership in Lustre, NFS, SPDK, NVMe / NVMe-oF, CSI, Ceph, MinIO, RocksDB, or related projects.* Built or led a disaggregated-inference or Inference-Time-Compute storage architecture — KV caching to fast in-cluster or GPU-adjacent storage, WORM at scale, storage-aware scheduling, or database-integrated inference.* Public technical contributions — patents, peer-reviewed papers (FAST, SOSP, NSDI, OSDI, ATC), keynote talks, or RFCs — that demonstrate expertise and leadership in storage for AI infrastructure.## NVIDIA led the way in accelerated computing. Today, our AI infrastructure drives global intelligence, changing industries worldwide. The AI Cloud Storage group forms the base that maintains the world's largest GPU fleet's productivity. Every model trained, every inference served, and every checkpoint saved passes through systems we develop, construct, and manage.Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 320,000 USD - 488,750 USD.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until May 17, 2026.
  • J-18808-Ljbffr NVIDIA Corporation

Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Distinguished Engineer, Storage - AI Cloud in Santa Clara, CA vacancy
  •  ...Hewlett Packard Enterprise Development LP is seeking a Distinguished Technologist & Director for AI Strategy and Enablement in Milpitas, California....  ...with extensive experience in AI/ML, software engineering, and cloud infrastructures. Candidates should be ready to engage... 
    Cloud

    Hewlett Packard Enterprise Development LP

    Milpitas, CA
    4 days ago
  •  ...Distinguished Technologist, Private Cloud AI This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week...  ...reusable AI components and agents and partner with engineering to take POCs into scalable, production-grade services... 
    Cloud
    Work at office
    2 days per week

    Hewlett Packard Enterprise

    San Jose, CA
    a month ago
  • $140k - $210k

     ...hiring a Developer Relations Engineer to help developers understand...  ...and use Tigris to solve real storage and architecture challenges. This...  ...engaging Solid experience in AI/ML (critical to the use cases...  ...open You know your way around cloud infrastructure, storage systems... 
    Cloud
    Full time
    Remote work

    Tigris Data, Inc.

    Sunnyvale, CA
    5 days ago
  • $150k - $300k

    ## Distinguished Engineer, Applied AIApplylocations: Palo Alto, CAtime type: Full timeposted on: Posted...  ...customers. We are building an AI-powered CRM platform using agentic capabilities...  ...with AWS, GCP, Azure, or another cloud service* 6+ years of experience in CICD... 
    Cloud
    Hourly pay
    Work experience placement
    Local area
    Flexible hours
    Shift work

    GEICO

    Palo Alto, CA
    3 days ago
  •  ...Senior Distinguished Technologist – Pre-Sales AI & Data Center Networking  This role has been designated as ‘...  ...Packard Enterprise is the global edge-to-cloud company advancing the way people...  ...of innovation. Our Sales Engineering team empowers customers and partners... 
    Cloud
    Work experience placement
    Remote work
    Work from home

    HPE

    San Jose, CA
    4 days ago
  • $320k

    Distinguished Engineer - Accelerated Apache Spark NVIDIA is seeking a Distinguished Engineer for the...  ...Multi-node GPU deployments will reduce cloud computing costs and lower latency of large...  ...for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA... 
    Cloud
    Work experience placement

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • ## Distinguished Technologist & Director, AI Strategy and Enablement (OCTO & HPE Labs)Applylocations: Milpitas,...  ...Packard Enterprise is the global edge-to-cloud company advancing the way people...  ...and maintainable capabilities for engineering and field teams. Your charter... 
    Cloud
    Work experience placement
    Work at office
    2 days per week

    Hewlett Packard Enterprise Development LP

    Milpitas, CA
    2 days ago
  • $365k

    Distinguished Engineer and UTL, Google Cloud Security Google Sunnyvale, CA, USA Director+ Bachelor's degree in Computer Science, a related technical field...  ...cloud computing, or a related field. Experience with AI and machine learning, and a passion for applying these... 
    Cloud
    Full time
    Worldwide

    Google Inc.

    Sunnyvale, CA
    2 days ago
  •  ...A leading cloud infrastructure company is seeking a Solutions Architect experienced in storage and cloud computing. This role involves developing tailored solutions for customers and showcasing CoreWeave’s offerings. Ideal candidates have strong technical skills, experience... 
    Cloud
    Remote work

    CoreWeave

    Sunnyvale, CA
    4 days ago
  • $144k - $209k

    Product Engineer, Servers and Storage Systems Google Sunnyvale, CA, USA Apply Bachelor's degree in Engineering...  ...for a seamless user experience. The AI and Infrastructure team is redefining...  ...customers include Googlers, Google Cloud customers, and billions of Google users... 
    Cloud
    Full time
    Contract work
    Worldwide

    Google Inc.

    Sunnyvale, CA
    1 day ago
  • $209.77k - $314.3k

     ...Distinguished Engineer Marvell's semiconductor solutions are the essential building blocks of the data infrastructure that connects our world. Across enterprise, cloud and AI, and carrier architectures, our innovative technology is enabling new possibilities. At... 
    Cloud
    Permanent employment
    Internship
    Work from home

    Marvell

    Santa Clara, CA
    2 days ago
  • $320k

    Distinguished Engineer - Rack Scale Architecture page is loaded## Distinguished...  ...the unlimited potential of AI to define the next era of computing...  ...growing enterprise and cloud provider businesses. Each...  ...architectures. Knowledge in storage and networking technologies.... 
    Cloud
    Shift work

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $115k - $135k

     ...Performance Engineer We are looking for a Performance Engineer to...  ...Lenovo's cross-device Personal AI that works across phones, PCs,...  .... Qira combines on-device and cloud intelligence to provide fast,...  ...tablets), infrastructure (server, storage, edge, high performance... 
    Cloud
    Work at office
    Local area
    Remote work

    Lenovo

    San Jose, CA
    2 days ago
  • $279.5k - $419.5k

     ...high-performance, energy efficient AI compute. Ampere is part of the...  ...driving sustainable computing for AI, Cloud, and edge applications. Join us at...  ...the role Our Hardware Design Engineering organization is seeking a Distinguished Signal Integrity Engineer to be part... 
    Cloud

    Ampere Computing

    Santa Clara, CA
    1 day ago
  • $320k

     ...data centers is the ability to engineer integrated system designs in...  ...a Global Connectivity Distinguished Engineer to accelerate next-generation...  ...and subsea—that interconnect AI Factories. You will act as...  ...leadership role within a Hyperscale Cloud Provider or a Tier-1 Global... 
    Cloud

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • **Distinguished Technologist, ASIC Design Architect** We are seeking an experienced ASIC Design Architect to own the hardware architecture...  ...traditional IT environments. Most are transitioning to a secure, cloud-enabled, mobile-friendly infrastructure. Many rely on a... 
    Cloud
    Local area

    Hewlett Packard Enterprise Development LP

    Sunnyvale, CA
    5 days ago
  •  ...Packard Enterprise is the global edge-to-cloud company advancing the way people live...  ...Open up opportunities with HPE. Role Distinguished Technologist, ASIC Design Architect — We...  ...to meet PPA targets. Mentor and grow engineering teams; promote architecture best practices... 
    Cloud
    Work experience placement
    Work at office

    Hewlett Packard Enterprise

    Sunnyvale, CA
    3 days ago
  • NVIDIA Corporation is hiring a Distinguished Engineer for AI Cloud Storage in Santa Clara, California. This critical role involves leading the storage strategy across various cloud providers and requires a deep technical background in storage technologies at an exabyte... 
    Cloud

    NVIDIA Corporation

    Santa Clara, CA
    2 days ago
  • $365k

    A leading technology company is seeking a Distinguished Engineer and Uber Tech Lead for its Google Cloud Security team in Sunnyvale, CA. In this pivotal role, you'll drive the long-term vision at the intersection of AI and cybersecurity. Responsibilities include leading... 
    Cloud

    Google Inc.

    Sunnyvale, CA
    2 days ago
  • $351.75k

    Job Summary We are seeking a visionary Distinguished Engineer to drive our AI‑First Transformation, bridging the gap between AI technology and business impact. You will re‑imagine how our enterprise operates by rewriting operational workflows to drive efficiency, personalization... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  •  ...experienced Senior DevOps Engineer to join NVIDIA’s...  ...CI failures to clearly distinguish infrastructure issues from...  ...experience using AWS or similar cloud platforms to support CI...  ...including networking, storage, performance, and...  ...vacancy. NVIDIA uses AI tools in its recruiting... 
    Cloud
    Night shift

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $320k

     ...and HGX, are core to our enterprise and cloud offerings. We seek a technical architect...  ...BS or MS in Computer Science, Electrical Engineering, or related field (or equivalent...  ..., cuDNN, DOCA). Knowledge of enterprise storage architectures and distributed parallel processing... 
    Cloud
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $180k - $270k

     ...A leading data storage company located in Santa Clara is seeking a Senior Software Engineer for their Hyperscale Line of Business. This role involves designing innovative...  ...algorithms and solutions for high-performance cloud storage. Candidates should have 5+ years of... 
    Cloud

    Pure Storage

    Santa Clara, CA
    4 days ago
  • $150k - $230k

     ...industry leader in data-driven, client-to-cloud networking for large data center, campus...  ...several prestigious awards, such as Best Engineering Team, Best Company for Diversity, Compensation...  ..., LRO, and LPO—the next generation of AI data center networking solutions. Lead... 
    Cloud

    Arista Networks

    Santa Clara, CA
    20 days ago
  •  ...to harness the power of production-grade AI agents, without the need for specialized skills...  .... Job title Security Compliance Engineer Position overview We are seeking a...  ...The ideal candidate will be adept at using cloud technologies, particularly AWS, and have... 
    Cloud
    Flexible hours

    Brevian.ai

    Sunnyvale, CA
    3 days ago
  • $149k - $216k

     ...experience. 6 years of experience with cloud native architecture in a customer-facing...  ...years of experience with conversational AI technology. Experience building or leveraging...  ...the product marketing management and engineering teams to stay on top of industry trends and... 
    Cloud
    Full time

    Google Inc.

    Sunnyvale, CA
    5 days ago
  •  ...Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel...  ...10 times faster than GPU-based hyperscale cloud inference services. This order of...  ...experienced AI Infrastructure Operations Engineer to manage and operate our cutting-edge machine... 
    Cloud

    CEREBRAS SYSTEMS INC.

    Sunnyvale, CA
    5 days ago
  •  ...Role: Materials Science AI Engineer Location: Santa Clara, CA - 5D Onsite Duration: 6-12+ Months Contract Must Have Skills...  ...aggregating and structuring training data, statistical theory, and cloud-based compute for parallelized, scalable, and automated workflows... 
    Cloud
    Contract work
    Work experience placement

    Cardinal Integrated Technologies, Inc.

    Santa Clara, CA
    2 days ago
  • $139k - $204k

     ...Description Job Description CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform...  ...more at What You'll Do: We're looking for a Senior Storage Engineer, Control Plane to play a key role in designing, building, and... 
    Cloud
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    3 days ago
  • $165k - $220k

     ...Job Description Job Description CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a...  ...CX organization aligns closely with the internal and customer engineering teams, offering valuable insights from the field and having the... 
    Cloud
    Permanent employment
    Temporary work
    Casual work
    Work at office
    Flexible hours

    CoreWeave

    Sunnyvale, CA
    22 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Distinguished Engineer, Storage - AI Cloud. Be the first to apply!