Senior Systems Engineer, Storage - DGX Cloud
$208k - $333.5kNVIDIA Corporation
- # Senior Systems Engineer, Storage - DGX CloudApplylocations: US, CA, Remote: US, NC, Remote: US, IL, Remote: US, CO, Remote: US, OR, Remotetime type: Full timeposted on: Posted Todayjob requisition id: JR2019408Systems Engineering is an engineering discipline focused on building, automating, and operating the platforms and tooling that deliver large-scale production systems with high efficiency, reliability, and velocity. It combines software and systems engineering practices across infrastructure automation, containerized platforms, storage, telemetry, and observability. Systems engineers are highly specialized and possess expertise across domains such as Kubernetes and container orchestration, infrastructure-as-code, CI/CD, storage systems, monitoring, and analytical troubleshooting. Their responsibilities center on deploying and operating reliable, automated platforms and on building the tools and services that keep storage and data infrastructure healthy and performant.Our team at NVIDIA ensures that our internal and external facing GPU cloud services are deployed reliably, observable end-to-end, and continuously improved through automation. We enable developers to ship changes safely through repeatable CI/CD pipelines and Kubernetes-based deployments while keeping an eye on capacity, latency, and performance. A core part of this work is an SRE mindset: eliminating manual toil through automation, building self-service tooling, and growing the efficiency of production systems. We use a breadth of tools and approaches to tackle a broad spectrum of problems, and practices such as blameless postmortems, proactive identification of failure modes, and iterative improvement are key to product quality and to an interesting, dynamic day-to-day. Our culture of diversity, intellectual curiosity, problem-solving, and openness is important to our success. Our organization brings together people with a wide variety of backgrounds, experiences, and perspectives. We encourage them to collaborate, think big, and take risks in a blame-free environment. We promote self-direction to work on meaningful projects while striving to build an environment that provides the support and mentorship needed to learn and grow.**What You Will Be Doing:*** Design, deploy, and operate solutions on Kubernetes for large-scale storage and data platforms, including the manifests, Helm charts, and operators that run them.* Build tools, services, and automation that improve the lifecycle of storage and data systems – from provisioning and configuration through deployment, scaling, and day-2 operations.* Develop and operate telemetry and observability for production systems – metrics, logging, tracing, dashboards, and alerting – so that system health, availability, and latency are measurable and actionable.* Apply strong analytical troubleshooting skills to diagnose and resolve complex issues across distributed, containerized infrastructure.* Work closely with peers and partner teams to improve the lifecycle of services, from inception and design through deployment, operation, and refinement.* Scale systems sustainably through automation, infrastructure-as-code, and CI/CD, and evolve systems by pushing for changes that improve reliability and velocity.* Support services before they go live through activities such as deployment automation, capacity planning, and launch and readiness reviews.* Practice sustainable incident response and postmortems, and participate in an on-call rotation to support production systems.**What We Need To See:*** BS degree (or equivalent experience) in Computer Science or related technical field involving coding.* 12+ years of practical experience.* Hands-on experience with Kubernetes – deploying, configuring, and operating workloads and solutions on Kubernetes in production.* Experience building tools and services for storage, data, or platform infrastructure, with solid software design fundamentals (algorithms, data structures, complexity analysis) on large-scale Linux-based systems.* Experience building and operating telemetry and observability using tools such as Prometheus, InfluxDB, Grafana, and the Elastic stack.* Strong analytical troubleshooting skills with a systematic, root-cause-driven approach to identifying and resolving complex problems.* Proficiency in one or more of the following: Python, Go, or Java.* Good knowledge of infrastructure configuration management and infrastructure-as-code tools such as Ansible, Chef, Puppet, ArgoCD, Git Pipelines, and Terraform.**Ways to Stand Out from the Crowd:*** Customer-first mindset with a focus on customer satisfaction and a passion for ensuring customer success.* Experience with Git, code review, pipelines, and CI/CD. Experience using or running large private and public cloud systems based on Kubernetes, OpenStack, and Docker.* Interest in crafting, analyzing, and fixing large-scale distributed systems, with strong debugging skills and a systematic problem-solving approach.* Experience designing storage- or data-focused tooling and automating their operations at scale.* Thrive in collaborative environments and enjoy working with various teams, and are flexible in adapting to different working styles.NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 208,000 USD - 333,500 USD for Level 5, and 256,000 USD - 414,000 USD for Level 6.You will also be eligible for equity and benefits.Applications for this job will be accepted at least until June 12, 2026.This posting is for an existing vacancy.NVIDIA uses AI tools in its recruiting processes.NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
- J-18808-Ljbffr NVIDIA Corporation
Vacancy posted 2 days ago
Similar jobs that could be interesting for youBased on the Senior Systems Engineer, Storage - DGX Cloud in California, MO vacancy
- Y-Axis is seeking a Systems Engineer III in California, Missouri to architect innovative solutions for computer and storage systems. This role demands a lead in project benefit analysis... ...with preferred certifications and cloud experience are a plus. #J-18808-Ljbffr Y...CloudSenior
- FedWriters, Inc. is seeking a System Engineer to join their team in California. This full-time on-site position involves managing and supporting systems in a hybrid cloud/datacenter environment, ensuring robust and secure infrastructures for the Defense Language Institute...CloudSeniorFull time
$168k - $270.25k
NVIDIA is hiring experienced Senior Production Engineers to help scale up its AI Infrastructure. You will... ...management processes, production system observability, monitoring and alerting... ...will be doing: You will be part of a DGX Cloud team responsible for production systems...CloudSenior$184k - $287.5k
A leading tech company is seeking a Senior Performance Engineer in California to enhance AI system performance and datacenter applications. The role requires extensive... ...computing, deep learning frameworks, and cloud/container architecture. Applicants should possess...CloudSenior- Cohesity is seeking a Software Engineer to design and deliver cloud-native SaaS products. Located in Santa Clara, CA, the role includes building scalable distributed data systems and collaborating across teams. The ideal candidate has a BS/MS/PhD in Computer Science and...CloudSenior
- ...behavior for over a decade. We’re looking for a senior backend engineer to help design and operate high-volume, distributed backend systems . This role focuses on data ingestion,... ...observability, and graceful failure handling. Cloud & containerized environments Deploy and...CloudSenior
- ...We Are Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies... ...will embrace you. Open up opportunities with HPE. Senior Presales Systems Engineer Job Family Definition: Responsible for providing...CloudSeniorWork experience placementWork at officeRemote workWork from home
- Broadcom Inc. is seeking a Senior Software Engineer for the Cloud Native Storage team based in California, Missouri. You will design and deliver features for... ...Applicants should have a strong background in distributed systems and Kubernetes, with at least 5 years of experience....CloudSenior
- Converge Technology is looking for a Senior Storage Architect to play a crucial role in supporting data protection growth. This position, based... ...functions include leading technical sessions, identifying cloud opportunities, and delivering solutions that align with client...CloudSeniorRemote job
- VAST Data is seeking a Senior Systems Engineer to join their team in California, Missouri. This role involves assisting customers with storage evaluations and installations while working closely with R&D on product development. Candidates must have over 5 years of experience...Senior
- Broadcom is seeking a Senior Software Engineer specializing in Cloud Native Storage to design and implement scalable storage workflows for Kubernetes environments... ...five years of experience in building distributed systems, strong knowledge of Kubernetes, and expertise in...CloudSenior
$112.6k - $168.85k
...software, analytics, Site reliability engineers, Cloud Operations, Medical, Marketing, Data engineering... ...to achieve project deliverables• Lead system definition tasks such as generating... ...or status as a protected veteran.()The Senior Systems Engineer is a member of the...CloudSeniorWork experience placementWork at office- NVIDIA Corporation is seeking a Senior Systems Engineer for Storage to design and deploy solutions on Kubernetes. You will improve the lifecycle of storage systems while working collaboratively in a diverse environment. The role requires significant experience in systems...SeniorRemote job
- ...expertise in ETL processes, and strong communication skills for collaboration across teams. The position requires familiarity with cloud environments and Agile methodologies while offering opportunities to engage in exciting B2B eCommerce projects. #J-18808-Ljbffr TechDigital...CloudSenior
- ONSITE Sr. Model-Based Systems Design Engineer (MBSE) in South Orange County, CA Preferred exp w/Ground Systems, and/or related MOSA, SOSA, FACE... ...workflows and Integrated Modeling Environments (i.e. Teamwork Cloud) Exp with Systems Engineering, Requirements Elicitation and...CloudSenior
- Teradata Group in California is seeking a DevOps Engineer to design, implement, and maintain software solutions to ensure system reliability. You will work on a globally... ...experience in the software industry, particularly in cloud services and DevOps practices. We promote a...CloudSeniorFlexible hours
- A leading tech firm in Oakland is hiring a Senior Software Engineer focused on building robust observability tooling and infrastructure. Candidates... ...skills, and a strong background in algorithms and cloud environments. This full-time position offers competitive compensation...CloudSeniorFull time
$130k - $260k
Government Employees Insurance Company (GEICO) is seeking a Senior Staff Engineer in California to drive the transformation of our tech organization... ...engineering. The role involves technical leadership, system improvements, and mentoring other engineers using technologies...CloudSenior- XPEL, Inc. is seeking a Senior Software Developer - Backend to join our team. You will design and build scalable APIs and... ..., and mentor teammates. Your experience with cloud platforms and backend systems will be key in shaping our evolving platform. The ideal...CloudSenior
$151k - $227k
Sr Pre-Sales Systems Engineer Location: Virtual, California, United States This position is remote, requires 50% or more travel, and the candidate... ...General knowledge of the following areas of specialization: Cloud services (SaaS); VLAN; security; VoIP; QoS wired/wireless;...CloudSeniorLive inRemote workRelocation$112.6k - $168.85k
**Position Overview**The ideal candidate has deep experience with large-scale distributed systems, cloud-native architectures, and formal systems engineering processes. You will serve as the technical owner of NFRs across the platform lifecycle—from concept and requirements...CloudSeniorWork at officeRemote workFlexible hours$157.5k - $254.35k
...these were disconnected from business systems of record, costing businesses time,... ...self‑motivated, driven and creative Senior Site Reliability Engineer to join the Site Reliability team. Metrics... ...available, scalable services in cloud environments (primarily Azure, with some...CloudSeniorContract workWork at officeLocal areaRemote work- ...pipelines to enhance application security and performance. We seek a dedicated candidate with a strong background in software engineering, cloud technologies, and effective leadership skills. Join us to foster a collaborative work environment and significantly impact...CloudSenior
- A leading tech company is seeking a Software Engineer to work on AuthZed Cloud, an Authorization Infrastructure as a Service. You will design, implement, and maintain backend services while ensuring their reliability in a high-performance cloud environment. Ideal candidates...CloudSeniorRemote jobFlexible hours
$66.52 - $88.14 per hour
Stanford Health Care seeks a Cloud Engineer in California to manage the Enterprise Information Management platform. The role requires expertise in Azure and Databricks, 4+ years experience, and a strong understanding of data operations. You will lead automation projects...CloudSeniorHourly pay- ...over 10 years in tech, with expertise in big data solutions and cloud platforms like Databricks and AWS. Responsibilities include designing data models, collaborating with teams, and mentoring engineers. This position offers a competitive salary and the opportunity to...CloudSeniorRemote job
$158.41k - $224.1k
...8+ years of SAP BASIS experience, strong leadership skills, and familiarity with GxP and FDA regulations. Responsibilities include system stability, upgrades, and integration solutions. Competitive salary of $158,411 - $224,103 annually, along with a robust benefits package...CloudSenior- The California State University is looking for a System Administrator III to manage and support enterprise systems and technology infrastructure. This role involves administering cloud and on-premises environments, troubleshooting complex issues, and collaborating with...CloudSenior
$125k - $165k
A leading SAP solutions provider is seeking a Senior SAP FICO Consultant in California, Missouri. This role requires 8 years of SAP experience with expertise in the FICO module and cross-functional integration. The ideal candidate will have strong problem-solving skills...CloudSenior$102k - $140.5k
...Salesforce instances. The role involves collaborating with business stakeholders, optimizing Service Cloud features, and developing integrations with third-party systems. The ideal candidate has over 5 years of Salesforce experience and strong communication skills. This...CloudSeniorRemote job
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Systems Engineer, Storage - DGX Cloud. Be the first to apply!
Related searches
- healthcare systems engineer California, MO
- application system engineer California, MO
- operating system engineer California, MO
- space systems engineer California, MO
- system engineer remote California, MO
- advanced systems engineer California, MO
- distributed systems engineer California, MO
- system design engineer California, MO
- system performance engineer California, MO
- sr systems engineer California, MO
