Senior System Software Engineer - DevOps and Infrastructure Automation
$184k - $287.5kNVIDIA
Become a Senior System Software Engineer on NVIDIA's AI Inference Operations Team, focusing on DevOps and Infrastructure Automation. Join a company revolutionizing computer graphics, PC gaming, and accelerated computing. You will be working alongside a team of passionate and skilled engineers who are continuously building better tools to deploy and manage this infrastructure. With your help, we will forge the next generation of compute infrastructure. If you thrive at the intersection of systems programming, cloud-native infrastructure, and developer productivity, this is your opportunity to make a lasting impact at a leading technology company.
What you'll be doing:
Design, build, and operate the infrastructure backbone powering AI inference products — reliable, performant, and scalable at every layer!
Own Kubernetes deployments end-to-end across cloud and on-prem: runbooks, canary checks, post-deploy validation, and rollbacks when needed.
Architect CI/CD pipelines for automated build, test, packaging, and release of inference libraries and their container-based software stacks.
Build observability that actually tells the truth about platform health — dashboards, logs, metrics, automated checks — and lead first-level incident triage with clean, actionable handoffs to engineering.
Manage cloud and on-prem environments with infrastructure-as-code (Terraform, Ansible, Helm, Crossplane), and chip away at toil using GitHub Actions, GitLab CI, and custom tooling.
Own the security posture for infrastructure components: vulnerability scans, CVE remediation, and compliance with internal policies.
Collaborate closely with deep learning framework engineers, compiler teams, and platform architects to streamline end-to-end deployment!
What we need to see:
BS/MS in CS/CE or equivalent experience, plus 7+ years operating production distributed systems (SRE / DevOps / Platform Ops).
Deep Kubernetes expertise — components, subsystems, on-prem setup, and hands-on debugging of telemetry-heavy microservices across AWS, Azure, GCP, and on-prem.
Strong CI/CD chops (GitLab CI, GitHub Actions), Git-based workflows, Linux systems programming, and scripting in Python and Bash.
IaC fluency (Terraform, Ansible, Helm, Crossplane) and containerization depth (Docker, containerd, OCI).
Proven reliability ownership — SLOs/SLIs, on-call, incident response, and post-incident reviews that drive measurable improvements — backed by hands-on experience with observability stacks like Prometheus, Grafana, and Loki.
A clear communicator who writes runbooks people actually use!
Ways to stand out from the crowd:
MLOps experience — crafting, deploying, and operating machine learning pipelines end to end.
Experience in open-source development workflows and community engagement on projects like Triton Inference Server or ONNX Runtime.
Familiarity with GPU software stacks — CUDA, cuDNN, TensorRT, and inference serving frameworks.
Experience building custom test automation frameworks and using data-driven metrics to improve platform health and developer efficiency.
Demonstrated ability to debug complex issues spanning kernel modules, container runtimes, and distributed networking.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.
You will also be eligible for equity and benefits ( .
Applications for this job will be accepted at least until May 16, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
- A leading tech firm in San Jose is seeking a Senior Infrastructure Engineer focused on development and automation. Responsibilities include creating scalable infrastructure... ...should have over 6 years of experience in software development and significant expertise in Java,...SeniorDevops
$120.3k - $194.53k
...forefront of cloud-native infrastructure, where reliability,... ..., and intelligent automation define the future of operations. As a Senior Site Reliability Engineer, you will design... ...build intelligent systems that predict... ...years of experience in DevOps, Site Reliability,...SeniorDevopsFull timeWork at officeVisa sponsorshipWork visa$155k - $230k
A leading cybersecurity company in Santa Clara is seeking a Senior/Staff Software Engineer to provide technology leadership in their DevOps Team. You'll design and manage resilient infrastructures and implement CI/CD pipelines while mentoring junior engineers. The ideal...SeniorDevops- A technology company in Santa Clara is seeking a Senior Software Infrastructure Engineer to develop and maintain software tools for its engineering team... ...7 years of experience in software infrastructure and DevOps. This position offers a hybrid work model and emphasizes...SeniorDevops
$175k - $290k
...Senior Software Infrastructure Engineer Santa Clara, CA This role is part of the Software Infrastructure... ...enable development of ML accelerator systems across both hardware and software... ...REST APIs, containers, and modern DevOps practices, with the ability to debug...SeniorDevopsRemote work- Stellar IT Solutions LLC is seeking a Senior DevOps Engineer in Santa Clara, CA to design, build, and scale infrastructure for a site-builder/network automation platform. This role involves transitioning the platform to a production-ready DevOps model, focusing on CI/CD...SeniorDevopsLong term contract
$120k - $145k
Fortinet, Inc. is seeking a Staff SRE to scale FortiSASE’s cloud infrastructure. The ideal candidate will have over 7 years of SRE/DevOps experience, focusing on design and implementation of multi-cloud systems. Responsibilities include leading initiatives across teams,...SeniorDevops- ...Software Infrastructure Engineer, Senior Staff At d-Matrix, we are focused on unleashing the potential of generative... ...development of our ML accelerator systems both on hardware and software. The... ...in software infrastructures and DevOps. Proficient in C/C++ and Python....SeniorDevopsWork experience placementRemote work3 days per week
$148k - $235.75k
...informative telemetry and data systems that provide real-time... ..., distributed infrastructure. As an engineer on our team, you will play... ...an outstanding mix of core software engineering, data management... ...with EDA (Electronic Design Automation) workflows and tools used...Senior$224k - $356.5k
...NVIDIA is hiring engineers to scale up the introduction of... ...architecture into its EDA Infrastructure. We expect you to have a... ...introductions (NPIs), distributed systems, familiarity with software testing and deployment,... ...at scale infrastructure, DevOps and/or SRE practices and/...SeniorDevops$150k - $250k
...at the forefront of software and hardware innovation... ...You Will Do Linux Systems Administration... ...(Bash, Python) for automation System monitoring,... ...and schema changes Infrastructure & DevOps Containerization technologies... ..., quality, and engineering teams Preferred Experience...SeniorDevopsFull timeRemote work3 days per week$184k - $287.5k
...skilled and experienced Senior DevOps Engineer to join NVIDIA’s... ...deep expertise in CI/CD infrastructure along with hands‑on experience... ...supporting robotics software, including ROS 2–based systems. In this role, you... ...infrastructure and automation, partner closely with...SeniorDevopsNight shift$184k - $287.5k
...experienced GPU and network systems Solutions Architect & Engineer. Do you want to be part of a... ...Intelligence (AI) hardware and software technologies to production... ...Infiniband, and Data Center infrastructure (power/cooling) Knowledge of DevOps/MLOps technologies such as...SeniorDevopsRemote work$152k - $241.5k
...expertise in backend infrastructure, inference and cloud-... ...product management, engineering, and business teams to... ...Collaborate with DevOps teams to orchestrate... ...Engineering, advancing AI/ML systems from proof of concept... ...orchestration softwares (Airflow, Argo, etc),...SeniorDevops- ...The FlexAI Compute Infrastructure Platform provides an... ...orchestration, security, and automation" under the hood.... ...is looking for a Senior Backend Engineer (Infrastructure & AI... ...the core backend systems powering our next-generation... ...Work with DevOps/SRE on CI/CD, deployment...SeniorDevopsWork at office
$175k - $290k
...an investment in ours. The Role: Software Infrastructure Engineer, Senior Staff The role requires you to be part... ...development of our ML accelerator systems both on hardware and software. The... ...experience in software infrastructures and DevOps. Proficient in C/C++, Python....SeniorDevopsFull timeWork experience placementRemote work$384k
NVIDIA is seeking a Senior Director, System Software Engineering, to lead strategy and execution... ...system software that automates GPU management at scale,... ...closely with security, DevOps, research, and product organizations... ...architecture, cloud infrastructure, or large-scale systems...SeniorDevopsFull time$200k - $322k
...We're hiring a Senior Staff Software Engineer to own the engineering... ...NVIDIA enterprise systems. You'll partner with... ...strategic, AI infused automated resolution systems... ...full stack from infrastructure to user facing tools... ...Enterprise Support or Devops ~ Experience with...SeniorDevops- An innovative AI solutions company is seeking a Senior DevOps Engineer to architect and maintain the core infrastructure supporting cutting-edge AI applications. The... ...deployments, and championing best practices in system reliability. Ideal candidates should have over...SeniorDevopsFull timeRemote workFlexible hours
$152k - $241.5k
Senior Infrastructure Software Engineer, Deep Learning Libraries page is loaded## Senior Infrastructure... ...experts to design the systems that enable NVIDIA to... ...* Building scalable automation for build, test,... ...GitLab pipelines, Azure DevOps)* Experience in HTML5, CSS...SeniorDevops$70 - $75 per hour
...Multi-Cloud IaC: Architect and maintain modular, reusable infrastructure components across AWS and GCP using Terraform. Utilize Terragrunt... ...: Own the deployment lifecycle using ArgoCD. Implement automated sync policies, rollouts, and "Self-Healing" infrastructure...SeniorDevopsRemote work$180k - $250k
A high-growth AI infrastructure company in San Jose is looking for a Cloud Infrastructure / DevOps Engineer to design and maintain production-grade cloud infrastructure. The ideal candidate will have over 5 years of experience and strong hands-on knowledge of Kubernetes...SeniorDevops$155k - $230k
...digital future. As the Senior/Staff Software Engineer - Infrastructure and Devops, you will be a part of our Infrastructure... ...DevOps processes and tools. Automating and accelerating the development... ...Computer Science, Information Systems, or equivalent. ~6+...SeniorDevopsTemporary workH1bWorldwideShift work$133.4k - $200k
Insider, Inc. is seeking a Senior DevOps Engineer in San Jose, California, to contribute to its infrastructure strategy focusing on automation and performance in cloud and on-premise environments. The role requires expertise in Kubernetes, Docker, and LLM technologies,...SeniorDevops$200k - $322k
...Santa Clara is seeking a Senior Staff Software Engineer. The candidate will design... ...AI workflows and integrate automation across various platforms.... ...years of experience in SRE or DevOps. The role involves full... ...managing enterprise-scale systems. Base salary ranges from $...SeniorDevopsFull time- A leading technology firm is seeking a Senior DevOps Engineer to enhance the reliability and automation of its cutting-edge Desktop-as-a-Service platform. Responsibilities... ..., managing Kubernetes clusters, and ensuring infrastructure security. Candidates should have strong Python...SeniorDevops
- ServiceNow is looking for an experienced infrastructure engineer to build and scale the Moveworks AI cloud platform. The candidate will collaborate closely with various teams, focusing on DevOps, design, and security. Responsibilities include managing cloud infrastructure...SeniorDevopsFlexible hours
- ...in Palo Alto, CA is seeking a Senior DevOps Engineer to manage cloud-based medical analysis software. The ideal candidate will... ...management and enhances cloud infrastructure. Responsibilities include CI/... ...setup, performance tuning, and system monitoring. Join our innovative...SeniorDevopsFull time
- ...delivery, and intelligent automation. We are looking to add a software engineer who will contribute to... ..., Cursor, Git, Azure DevOps, React, and AI Agents to... ...automated testing, and infrastructure as code. Familiarity... ...Ability to work with legacy systems while contributing to...SeniorDevopsRemote workFlexible hours
- We are seeking a Senior Infrastructure Engineer with a strong focus on development and automation to build scalable, reliable... ...to enhance system efficiency and reliability... ...best practices in DevOps, CI/CD, and cloud automation... ...of experience in software development,...SeniorDevops
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior System Software Engineer - DevOps and Infrastructure Automation. Be the first to apply!
- systems software developer Santa Clara, CA
- IT system engineer Santa Clara, CA
- system programmer Santa Clara, CA
- data infrastructure engineer Santa Clara, CA
- infrastructure engineering manager Santa Clara, CA
- remote infrastructure engineer Santa Clara, CA
- principal infrastructure engineer Santa Clara, CA
- senior infrastructure engineer Santa Clara, CA
- security infrastructure engineer Santa Clara, CA
- infrastructure engineer Santa Clara, CA

