Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Engineer Software

$147k - $237.5k

Palo Alto Networks

Senior Data Center & OpenShift Operations Engineer Position Overview: The Senior Data Center Operations Engineer is responsible for the core of our high‑availability infrastructure. This role bridges the gap between physical hardware and the Red Hold OpenShift Container Platform (OCP). Your mission is to ensure 99.99% availability by architecting resilient physical layouts and automating the deployment, scaling, and self‑healing of our production clusters. Key Responsibilities High‑Availability (HA) Infrastructure: Monitor and maintain data center systems with a focus on Zero Single Point of Failure (ZSPoF) architecture for OpenShift control planes and worker nodes. Cluster Reliability Engineering: Implement and manage OpenShift 4.x clusters across multiple power and cooling zones to ensure 99.99% uptime. Disaster Recovery & Business Continuity: Design, test, and execute automated failover strategies and backup/restore procedures using OADP (Velero) and Red Hold ACM. Automated Maintenance: Perform routine maintenance and upgrades using GitOps (ArgoCD) and the Machine Config Operator to ensure zero‑downtime node evacuations and patching. Complex Troubleshooting: Resolve deep‑stack hardware and software issues, from faulty GPU firmware to OpenShift SDN (OVN‑Kubernetes) network latencies. Vendor & Lifecycle Management: Coordinate with vendors for specialized hardware (e.g., NVIDIA, Dell, Cisco) while maintaining strict security and firmware compliance. Efficiency & Capacity Architecture: Optimize rack density for high‑performance GPU clusters while managing thermal loads and power distribution (PDU) to prevent circuit‑trip outages. Observability Implementation: Maintain accurate documentation and integrate hardware health metrics (IPMI/SNMP) into Prometheus/Grafana for proactive alerting. Physical Deployment: Rack and stack high‑density GPU servers, ensuring redundant power‑pathing and high‑speed (100G/200G) InfiniBand or Ethernet cabling. Hardware Lifecycle: Perform precision physical installation and replacement of critical components (CPUs, GPUs, NVMe storage) in a live production environment without impacting cluster quorum. Qualifications Education: Bachelor’s degree in Computer Science, IT, or equivalent experience. Platform Expertise: 5+ years of experience specifically operating Red Hold OpenShift (OCP) in a production environment. Hardware Fluency: Deep experience racking/stacking and cabling high‑density GPU systems (e.g., NVIDIA DGX or similar) and specialized AI/ML hardware. Infrastructure as Code (IaC): Advanced proficiency in Ansible or Pulumi for automating bare‑metal provisioning and cluster configuration. Scripting: Strong Python and Bash skills for developing custom health‑check scripts and API integrations. Linux Mastery: Expert‑level CoreOS and RHEL administration, including kernel tuning and systemd management. Networking: Solid understanding of BGP, VLAN tagging, LACP, and Load Balancing (F5/NGINX) essential for cluster ingress. Virtualization & Storage: Experience with vSphere or KVM, and persistent storage solutions like OpenShift Data Foundation (ODF) or Ceph. Tooling: Familiarity with DCIM tools (Netbox) and monitoring stacks (ELK/Loki, etc.). Physical Requirements Lifting: Ability to lift and move equipment up to 50 pounds. Environment: Comfortable working in high‑decibel, climate‑controlled data center aisles. Dexterity: Capable of standing, walking, and performing precision cabling in tight rack spaces for extended periods. Travel: May require occasional travel to remote data center sites or edge locations. Benefits Competitive salary commensurate with high‑availability expertise. Comprehensive health, dental, and vision insurance. 401(k) retirement plan with company match. Compensation Disclosure The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary is expected to be in the range $147,000.00 – $237,500.00 per year. The offered compensation may also include restricted stock units and a bonus. Equal Employment Opportunity We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at View email address on click.appcast.io. Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics. All your information will be kept confidential according to EEO guidelines. #J-18808-Ljbffr Palo Alto Networks, Inc.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Principal Engineer Software in Santa Clara, CA vacancy
  •  ...r - G P U S o f t w a r e A r c h i t e c t THE ROLE: As GPU Software Architect, you will provide technical leadership at the intersection...  ...of technical leadership across distributed, cross‑functional engineering teams. Strong background in systems software, firmware,... 
    Suggested
    Remote work

    Advanced Micro Devices

    Santa Clara, CA
    3 days ago
  •  ...the kind of precision that drives great outcomes. The Team Engineering—our engineering team is at the core of our products and...  ...enabled by a secure digital environment. Job Summary As a Principal Software Engineer, you will provide technical leadership in designing... 
    Suggested
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • $147k - $237.5k

    Job Summary We are seeking a Principal Software Engineer to join our Machine Identity Management CyberArk team, focused on building and scaling frontend experiences that enable visibility, control, and orchestration of machine identities. Responsibilities Lead the design... 
    Suggested

    Palo Alto Networks, Inc.

    Santa Clara, CA
    3 days ago
  • $147k - $237.5k

    Our Mission At Palo Alto Networks®, we’re united by a shared mission—to protect our digital way of life. We thrive at the intersection of innovation and impact, solving real‑world problems with cutting‑edge technology and bold thinking. Here, everyone has a voice, and every...
    Suggested
    Full time
    Work at office
    Local area
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    5 days ago
  •  ...us all evolve, together. Your Career We are the WildFire Team in the Content Delivered Security Service (CDSS) organization. Our engineering and Security Research team is at the core of our products and delivers the best of security services in the cloud to prevent... 
    Suggested
    Full time
    Work at office
    Visa sponsorship
    Work visa
    Flexible hours

    Palo Alto Networks, Inc.

    Santa Clara, CA
    4 days ago
  • $170k - $277k

    Job Summary We are seeking a Senior or Principal Backend Engineer for our Santa Clara Headquarters center to join the Talon team, which is responsible for our Enterprise Browser. In this role, you will be instrumental in developing a key security solution that protects... 

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • $170k - $277k

    Job Summary We are seeking a Module FW Integration Engineer - For Module Integration and Cellular Certification to drive software validation, automation, and the cellular certification lifecycle for our wireless products. This role bridges embedded application development... 

    Palo Alto Networks, Inc.

    Santa Clara, CA
    1 day ago
  • $147k - $237.5k

     ...lifecycle. Experience implementing LLM-based features in frontend applications. Familiarity with AI model integration and prompt engineering for optimizing UI performance and accessibility. Collaborative Engineering & Execution: Contribute to infrastructure design,... 

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • MixMode is seeking a Principal Software Engineer to innovate and develop kernels for our advanced AI compute engine based in Santa Clara, CA. This role demands expertise in software development for next-generation hardware with a strong requirement for knowledge in C/C++... 

    MixMode

    Santa Clara, CA
    1 day ago
  • $233.99k - $330.34k

    Job Details Job Description: We are seeking a highly capable Principal Engineer to help shape the future of datacenter and cloud software. In this pivotal role, you will engage deeply with customer technologists, define optimization opportunities across the software stack... 
    Internship
    Local area
    Immediate start
    Shift work

    Intel Corporation

    Santa Clara, CA
    4 days ago
  • $241.8k - $409.2k

     ...GPGPU Software Architect/ Principal Engineer XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and... 
    Full time

    XPENG

    Santa Clara, CA
    23 hours ago
  •  ...Requirements We Are Synopsys is the leader in engineering solutions from silicon to systems, enabling customers to...  ...and can analyze and resolve intricate design verification and software validation issues. Your strong communication abilities enable... 

    Synopsys

    Sunnyvale, CA
    4 days ago
  • $272k - $431.25k

    We are seeking software engineers to work on next-generation high-speed interconnect technologies. Our charter is to develop the most demanding high-speed IO applications a GPU or high-performance computing server will encounter in its lifecycle, by collaborating closely... 

    NVIDIA Gruppe

    Santa Clara, CA
    5 days ago
  • $211.8k - $317.8k

    Company Qualcomm Technologies, Inc. Job Area Engineering Group, Engineering Group > Software Engineering General Summary Hiring for Sr. Staff Engineer and Principal Engineer level. Relocation Required: Candidates must be willing to relocate to Santa Clara, CA or Austin,... 
    Work experience placement
    Relocation

    Qualcomm

    Santa Clara, CA
    3 days ago
  • $147k - $237.5k

     ...Palo Alto Networks, Inc. is seeking a Principal Software Engineer to design and implement Threat Intelligence Services for cloud features. The role includes all phases of product development and collaboration with teams to achieve high-quality releases. Ideal candidates... 

    Palo Alto Networks

    Santa Clara, CA
    23 hours ago
  •  ...Job Description Job Description Senior Principal Engineer, Software/Firmware Salary : 250,000 - 280,000/yr Location : Santa Clara, CA (onsite) Must be eligible to work in the United States. Sponsorship is unavailable. Relocation assistance is unavailable... 
    Relocation package

    Acloche-Direct Hire

    Santa Clara, CA
    20 days ago
  • $200k - $260k

     ...Job Description Job Description Job Opportunity: Senior Principal Engineer, Software/Firmware Location: Onsite, Santa Clara, CA, US Industry: Engineering / Architecture Salary: USD $200,000 – $260,000 / year Sponsorship: None available at this time... 
    Visa sponsorship

    Fox Point Recruitment LLc

    Santa Clara, CA
    23 days ago
  • $200k - $260k

     ...Position Title: Senior Principal Engineer, Software/Firmware - Coherent Optical Module Firmware/SoC-based Embedded Platforms/CPO (Confidential Client) Location: Santa Clara, CA | Onsite Employment Type: Permanent Compensation ~ Salary: $200,000 - $260,000/yr... 
    Permanent employment

    YK Solutions LLC

    Santa Clara, CA
    6 days ago
  • $185k - $278k

     ...empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous technological...  ...them into scalable solutions. * Debugging OS and engineering issues within our provided Linux environment. * Collaborating... 
    Remote work

    Synopsys

    Sunnyvale, CA
    2 days ago
  • $147k - $237.5k

     ...platform, consisting of XDR, XSIAM, XSOAR, and XPANSE. As a Principal Site Reliability Engineer within the Cortex DevOps team, you will serve as a...  ..., Docker, and cloud-native architectures. ~ Strong software engineering and automation skills using Python, Linux, Terraform... 
    Full time
    Work at office

    Palo Alto Networks

    Santa Clara, CA
    1 day ago
  • CoreWeave is hiring a Principal Engineer to lead the design and evolution of its AI infrastructure's cluster orchestration systems. The role demands expertise in Kubernetes and Slurm, working directly on technology that influences how efficiently GPUs are utilized. Your... 

    Dormont Manufacturing Co

    Sunnyvale, CA
    5 days ago
  • About the Role We are seeking a skilled and passionate Back-End Engineer with a strong background in data engineering to join our growing team. In this role, you will be responsible for designing, developing, and maintaining scalable and reliable data pipelines and back... 
    Permanent employment
    Contract work
    Local area

    Robotics Technologies LLC

    Sunnyvale, CA
    2 days ago
  • $175.8k - $293k

     ...they can seize a competitive advantage. We're looking for a Principal AI Engineer to architect, build, and harden the agentic AI systems that...  ...years of building and operating scalable, production‑grade software, with significant recent depth in AI/ML. Proven experience... 

    BMC Software, Inc.

    Santa Clara, CA
    4 days ago
  • $105k - $169.05k

    Position Overview As Principal Applications Integration Engineer, you will be responsible for designing, developing, and maintaining integration solutions to connect various software applications and systems within the organization. This role involves working closely with... 
    Temporary work
    Local area

    6947-SHOCKWAVE MEDICAL INC. Legal Entity

    Santa Clara, CA
    3 days ago
  • $145k - $236k

    GlobalFoundries is seeking a Principal Field Applications Engineer in Santa Clara, CA. The role involves providing ongoing support to customers, developing technical presentations, and managing client relationships. The ideal candidate will have a Bachelor's Degree in Electrical... 

    GlobalFoundries

    Santa Clara, CA
    11 days ago
  •  ...infrastructure, where reliability, scale, and intelligent automation define the future of operations. As a Senior Site Reliability Engineer, you will design and operate the platforms that power our applications across GCP, AWS, and global data centers - and you'll push... 
    Visa sponsorship
    Work visa

    Palo Alto Networks

    Santa Clara, CA
    4 days ago
  • $167k - $270.5k

    Palo Alto Networks, Inc. is seeking a Technical Leader to develop AI applications within the GTM/CX domain. This role involves defining the architecture for scalable AI/ML systems and leading the design of intelligent agents. Ideal candidates will have 15+ years of experience...

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  •  ...workloads with ultra high‑speed inference. We're hiring a Principal Engineer for our Inference Cloud Platform. This team owns the cloud layer...  .... Skills & Qualifications 10+ years of experience in software engineering, with substantial individual contributor experience... 

    Cerebras Systems, Inc.

    Sunnyvale, CA
    1 day ago
  • $272k - $431.25k

    NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing improvements... 

    Jobleads-US

    Santa Clara, CA
    3 days ago
  •  ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll work closely with AI research teams to enhance efficiency by addressing infrastructure deficiencies for GPU Clusters... 

    Jobleads-US

    Santa Clara, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Engineer Software. Be the first to apply!