Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Senior System Software Engineer - DevOps and Infrastructure Automation

$184k - $287.5k

NVIDIA

Become a Senior System Software Engineer on NVIDIA's AI Inference Operations Team, focusing on DevOps and Infrastructure Automation. Join a company revolutionizing computer graphics, PC gaming, and accelerated computing. You will be working alongside a team of passionate and skilled engineers who are continuously building better tools to deploy and manage this infrastructure. With your help, we will forge the next generation of compute infrastructure. If you thrive at the intersection of systems programming, cloud-native infrastructure, and developer productivity, this is your opportunity to make a lasting impact at a leading technology company.

What you'll be doing:

  • Design, build, and operate the infrastructure backbone powering AI inference products — reliable, performant, and scalable at every layer!

  • Own Kubernetes deployments end-to-end across cloud and on-prem: runbooks, canary checks, post-deploy validation, and rollbacks when needed.

  • Architect CI/CD pipelines for automated build, test, packaging, and release of inference libraries and their container-based software stacks.

  • Build observability that actually tells the truth about platform health — dashboards, logs, metrics, automated checks — and lead first-level incident triage with clean, actionable handoffs to engineering.

  • Manage cloud and on-prem environments with infrastructure-as-code (Terraform, Ansible, Helm, Crossplane), and chip away at toil using GitHub Actions, GitLab CI, and custom tooling.

  • Own the security posture for infrastructure components: vulnerability scans, CVE remediation, and compliance with internal policies.

  • Collaborate closely with deep learning framework engineers, compiler teams, and platform architects to streamline end-to-end deployment!

What we need to see:

  • BS/MS in CS/CE or equivalent experience, plus 7+ years operating production distributed systems (SRE / DevOps / Platform Ops).

  • Deep Kubernetes expertise — components, subsystems, on-prem setup, and hands-on debugging of telemetry-heavy microservices across AWS, Azure, GCP, and on-prem.

  • Strong CI/CD chops (GitLab CI, GitHub Actions), Git-based workflows, Linux systems programming, and scripting in Python and Bash.

  • IaC fluency (Terraform, Ansible, Helm, Crossplane) and containerization depth (Docker, containerd, OCI).

  • Proven reliability ownership — SLOs/SLIs, on-call, incident response, and post-incident reviews that drive measurable improvements — backed by hands-on experience with observability stacks like Prometheus, Grafana, and Loki.

  • A clear communicator who writes runbooks people actually use!

Ways to stand out from the crowd:

  • MLOps experience — crafting, deploying, and operating machine learning pipelines end to end.

  • Experience in open-source development workflows and community engagement on projects like Triton Inference Server or ONNX Runtime.

  • Familiarity with GPU software stacks — CUDA, cuDNN, TensorRT, and inference serving frameworks.

  • Experience building custom test automation frameworks and using data-driven metrics to improve platform health and developer efficiency.

  • Demonstrated ability to debug complex issues spanning kernel modules, container runtimes, and distributed networking.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 184,000 USD - 287,500 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits ( .

Applications for this job will be accepted at least until May 16, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Vacancy posted 5 days ago
Similar jobs that could be interesting for youBased on the Senior System Software Engineer - DevOps and Infrastructure Automation in Santa Clara, CA vacancy
  • A leading tech firm in San Jose is seeking a Senior Infrastructure Engineer focused on development and automation. Responsibilities include creating scalable infrastructure...  ...should have over 6 years of experience in software development and significant expertise in Java,... 
    Senior
    Devops

    TechDigital Group

    San Jose, CA
    1 day ago
  • $120.3k - $194.53k

     ...forefront of cloud-native infrastructure, where reliability,...  ..., and intelligent automation define the future of operations. As a Senior Site Reliability Engineer, you will design...  ...build intelligent systems that predict...  ...years of experience in DevOps, Site Reliability,... 
    Senior
    Devops
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    2 days ago
  • $155k - $230k

    A leading cybersecurity company in Santa Clara is seeking a Senior/Staff Software Engineer to provide technology leadership in their DevOps Team. You'll design and manage resilient infrastructures and implement CI/CD pipelines while mentoring junior engineers. The ideal... 
    Senior
    Devops

    Fortanix Inc.

    Santa Clara, CA
    4 days ago
  • A technology company in Santa Clara is seeking a Senior Software Infrastructure Engineer to develop and maintain software tools for its engineering team...  ...7 years of experience in software infrastructure and DevOps. This position offers a hybrid work model and emphasizes... 
    Senior
    Devops

    d-Matrix inc.

    Santa Clara, CA
    4 days ago
  • $175k - $290k

     ...Senior Software Infrastructure Engineer Santa Clara, CA This role is part of the Software Infrastructure...  ...enable development of ML accelerator systems across both hardware and software...  ...REST APIs, containers, and modern DevOps practices, with the ability to debug... 
    Senior
    Devops
    Remote work

    Phizenix

    Santa Clara, CA
    3 days ago
  • Stellar IT Solutions LLC is seeking a Senior DevOps Engineer in Santa Clara, CA to design, build, and scale infrastructure for a site-builder/network automation platform. This role involves transitioning the platform to a production-ready DevOps model, focusing on CI/CD... 
    Senior
    Devops
    Long term contract

    Stellar IT Solutions LLC

    Santa Clara, CA
    3 days ago
  • $120k - $145k

    Fortinet, Inc. is seeking a Staff SRE to scale FortiSASE’s cloud infrastructure. The ideal candidate will have over 7 years of SRE/DevOps experience, focusing on design and implementation of multi-cloud systems. Responsibilities include leading initiatives across teams,... 
    Senior
    Devops

    Fortinet, Inc.

    Sunnyvale, CA
    2 days ago
  •  ...Software Infrastructure Engineer, Senior Staff At d-Matrix, we are focused on unleashing the potential of generative...  ...development of our ML accelerator systems both on hardware and software. The...  ...in software infrastructures and DevOps. Proficient in C/C++ and Python.... 
    Senior
    Devops
    Work experience placement
    Remote work
    3 days per week

    d-Matrix

    Santa Clara, CA
    5 days ago
  • $148k - $235.75k

     ...informative telemetry and data systems that provide real-time...  ..., distributed infrastructure. As an engineer on our team, you will play...  ...an outstanding mix of core software engineering, data management...  ...with EDA (Electronic Design Automation) workflows and tools used... 
    Senior

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • $224k - $356.5k

     ...NVIDIA is hiring engineers to scale up the introduction of...  ...architecture into its EDA Infrastructure. We expect you to have a...  ...introductions (NPIs), distributed systems, familiarity with software testing and deployment,...  ...at scale infrastructure, DevOps and/or SRE practices and/... 
    Senior
    Devops

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $150k - $250k

     ...at the forefront of software and hardware innovation...  ...You Will Do Linux Systems Administration...  ...(Bash, Python) for automation System monitoring,...  ...and schema changes Infrastructure & DevOps Containerization technologies...  ..., quality, and engineering teams Preferred Experience... 
    Senior
    Devops
    Full time
    Remote work
    3 days per week

    MixMode

    Santa Clara, CA
    5 days ago
  • $184k - $287.5k

     ...skilled and experienced Senior DevOps Engineer to join NVIDIA’s...  ...deep expertise in CI/CD infrastructure along with hands‑on experience...  ...supporting robotics software, including ROS 2–based systems. In this role, you...  ...infrastructure and automation, partner closely with... 
    Senior
    Devops
    Night shift

    NVIDIA

    Santa Clara, CA
    4 days ago
  • $184k - $287.5k

     ...experienced GPU and network systems Solutions Architect & Engineer. Do you want to be part of a...  ...Intelligence (AI) hardware and software technologies to production...  ...Infiniband, and Data Center infrastructure (power/cooling) Knowledge of DevOps/MLOps technologies such as... 
    Senior
    Devops
    Remote work

    NVIDIA

    Santa Clara, CA
    12 days ago
  • $152k - $241.5k

     ...expertise in backend infrastructure, inference and cloud-...  ...product management, engineering, and business teams to...  ...Collaborate with DevOps teams to orchestrate...  ...Engineering, advancing AI/ML systems from proof of concept...  ...orchestration softwares (Airflow, Argo, etc),... 
    Senior
    Devops

    NVIDIA

    Santa Clara, CA
    5 days ago
  •  ...The FlexAI Compute Infrastructure Platform provides an...  ...orchestration, security, and automation" under the hood....  ...is looking for a Senior Backend Engineer (Infrastructure & AI...  ...the core backend systems powering our next-generation...  ...Work with DevOps/SRE on CI/CD, deployment... 
    Senior
    Devops
    Work at office

    FlexAI

    Santa Clara, CA
    3 days ago
  • $175k - $290k

     ...an investment in ours. The Role: Software Infrastructure Engineer, Senior Staff The role requires you to be part...  ...development of our ML accelerator systems both on hardware and software. The...  ...experience in software infrastructures and DevOps. Proficient in C/C++, Python.... 
    Senior
    Devops
    Full time
    Work experience placement
    Remote work

    MixMode

    Santa Clara, CA
    4 days ago
  • $384k

    NVIDIA is seeking a Senior Director, System Software Engineering, to lead strategy and execution...  ...system software that automates GPU management at scale,...  ...closely with security, DevOps, research, and product organizations...  ...architecture, cloud infrastructure, or large-scale systems... 
    Senior
    Devops
    Full time

    NVIDIA

    Santa Clara, CA
    21 hours ago
  • $200k - $322k

     ...We're hiring a Senior Staff Software Engineer to own the engineering...  ...NVIDIA enterprise systems. You'll partner with...  ...strategic, AI infused automated resolution systems...  ...full stack from infrastructure to user facing tools...  ...Enterprise Support or Devops ~ Experience with... 
    Senior
    Devops

    NVIDIA

    Santa Clara, CA
    5 days ago
  • An innovative AI solutions company is seeking a Senior DevOps Engineer to architect and maintain the core infrastructure supporting cutting-edge AI applications. The...  ...deployments, and championing best practices in system reliability. Ideal candidates should have over... 
    Senior
    Devops
    Full time
    Remote work
    Flexible hours

    New Code Inc

    Palo Alto, CA
    1 day ago
  • $152k - $241.5k

    Senior Infrastructure Software Engineer, Deep Learning Libraries page is loaded## Senior Infrastructure...  ...experts to design the systems that enable NVIDIA to...  ...* Building scalable automation for build, test,...  ...GitLab pipelines, Azure DevOps)* Experience in HTML5, CSS... 
    Senior
    Devops

    NVIDIA Corporation

    Santa Clara, CA
    5 days ago
  • $70 - $75 per hour

     ...Multi-Cloud IaC: Architect and maintain modular, reusable infrastructure components across AWS and GCP using Terraform. Utilize Terragrunt...  ...: Own the deployment lifecycle using ArgoCD. Implement automated sync policies, rollouts, and "Self-Healing" infrastructure... 
    Senior
    Devops
    Remote work

    Cynet Systems

    San Jose, CA
    1 day ago
  • $180k - $250k

    A high-growth AI infrastructure company in San Jose is looking for a Cloud Infrastructure / DevOps Engineer to design and maintain production-grade cloud infrastructure. The ideal candidate will have over 5 years of experience and strong hands-on knowledge of Kubernetes... 
    Senior
    Devops

    Cerebras

    San Jose, CA
    1 day ago
  • $155k - $230k

     ...digital future.  As the Senior/Staff Software Engineer - Infrastructure and Devops, you will be a part of our Infrastructure...  ...DevOps processes and tools. Automating and accelerating the development...  ...Computer Science, Information Systems, or equivalent. ~6+... 
    Senior
    Devops
    Temporary work
    H1b
    Worldwide
    Shift work

    Fortanix

    Santa Clara, CA
    4 days ago
  • $133.4k - $200k

    Insider, Inc. is seeking a Senior DevOps Engineer in San Jose, California, to contribute to its infrastructure strategy focusing on automation and performance in cloud and on-premise environments. The role requires expertise in Kubernetes, Docker, and LLM technologies,... 
    Senior
    Devops

    Insider, Inc.

    San Jose, CA
    2 days ago
  • $200k - $322k

     ...Santa Clara is seeking a Senior Staff Software Engineer. The candidate will design...  ...AI workflows and integrate automation across various platforms....  ...years of experience in SRE or DevOps. The role involves full...  ...managing enterprise-scale systems. Base salary ranges from $... 
    Senior
    Devops
    Full time

    NVIDIA Corporation

    Santa Clara, CA
    4 days ago
  • A leading technology firm is seeking a Senior DevOps Engineer to enhance the reliability and automation of its cutting-edge Desktop-as-a-Service platform. Responsibilities...  ..., managing Kubernetes clusters, and ensuring infrastructure security. Candidates should have strong Python... 
    Senior
    Devops

    Omnissa, LLC

    Mountain View, CA
    3 days ago
  • ServiceNow is looking for an experienced infrastructure engineer to build and scale the Moveworks AI cloud platform. The candidate will collaborate closely with various teams, focusing on DevOps, design, and security. Responsibilities include managing cloud infrastructure... 
    Senior
    Devops
    Flexible hours

    Servicenow

    Mountain View, CA
    1 day ago
  •  ...in Palo Alto, CA is seeking a Senior DevOps Engineer to manage cloud-based medical analysis software. The ideal candidate will...  ...management and enhances cloud infrastructure. Responsibilities include CI/...  ...setup, performance tuning, and system monitoring. Join our innovative... 
    Senior
    Devops
    Full time

    Carlsbad Tech

    Palo Alto, CA
    3 days ago
  •  ...delivery, and intelligent automation. We are looking to add a software engineer who will contribute to...  ..., Cursor, Git, Azure DevOps, React, and AI Agents to...  ...automated testing, and infrastructure as code. Familiarity...  ...Ability to work with legacy systems while contributing to... 
    Senior
    Devops
    Remote work
    Flexible hours

    GrabJobs

    San Jose, CA
    2 days ago
  • We are seeking a Senior Infrastructure Engineer with a strong focus on development and automation to build scalable, reliable...  ...to enhance system efficiency and reliability...  ...best practices in DevOps, CI/CD, and cloud automation...  ...of experience in software development,... 
    Senior
    Devops

    TechDigital Group

    San Jose, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Senior System Software Engineer - DevOps and Infrastructure Automation. Be the first to apply!