Principal Engineer Software
$147k - $237.5kPalo Alto Networks
Senior Data Center & OpenShift Operations Engineer Position Overview: The Senior Data Center Operations Engineer is responsible for the core of our high‑availability infrastructure. This role bridges the gap between physical hardware and the Red Hold OpenShift Container Platform (OCP). Your mission is to ensure 99.99% availability by architecting resilient physical layouts and automating the deployment, scaling, and self‑healing of our production clusters. Key Responsibilities High‑Availability (HA) Infrastructure: Monitor and maintain data center systems with a focus on Zero Single Point of Failure (ZSPoF) architecture for OpenShift control planes and worker nodes. Cluster Reliability Engineering: Implement and manage OpenShift 4.x clusters across multiple power and cooling zones to ensure 99.99% uptime. Disaster Recovery & Business Continuity: Design, test, and execute automated failover strategies and backup/restore procedures using OADP (Velero) and Red Hold ACM. Automated Maintenance: Perform routine maintenance and upgrades using GitOps (ArgoCD) and the Machine Config Operator to ensure zero‑downtime node evacuations and patching. Complex Troubleshooting: Resolve deep‑stack hardware and software issues, from faulty GPU firmware to OpenShift SDN (OVN‑Kubernetes) network latencies. Vendor & Lifecycle Management: Coordinate with vendors for specialized hardware (e.g., NVIDIA, Dell, Cisco) while maintaining strict security and firmware compliance. Efficiency & Capacity Architecture: Optimize rack density for high‑performance GPU clusters while managing thermal loads and power distribution (PDU) to prevent circuit‑trip outages. Observability Implementation: Maintain accurate documentation and integrate hardware health metrics (IPMI/SNMP) into Prometheus/Grafana for proactive alerting. Physical Deployment: Rack and stack high‑density GPU servers, ensuring redundant power‑pathing and high‑speed (100G/200G) InfiniBand or Ethernet cabling. Hardware Lifecycle: Perform precision physical installation and replacement of critical components (CPUs, GPUs, NVMe storage) in a live production environment without impacting cluster quorum. Qualifications Education: Bachelor’s degree in Computer Science, IT, or equivalent experience. Platform Expertise: 5+ years of experience specifically operating Red Hold OpenShift (OCP) in a production environment. Hardware Fluency: Deep experience racking/stacking and cabling high‑density GPU systems (e.g., NVIDIA DGX or similar) and specialized AI/ML hardware. Infrastructure as Code (IaC): Advanced proficiency in Ansible or Pulumi for automating bare‑metal provisioning and cluster configuration. Scripting: Strong Python and Bash skills for developing custom health‑check scripts and API integrations. Linux Mastery: Expert‑level CoreOS and RHEL administration, including kernel tuning and systemd management. Networking: Solid understanding of BGP, VLAN tagging, LACP, and Load Balancing (F5/NGINX) essential for cluster ingress. Virtualization & Storage: Experience with vSphere or KVM, and persistent storage solutions like OpenShift Data Foundation (ODF) or Ceph. Tooling: Familiarity with DCIM tools (Netbox) and monitoring stacks (ELK/Loki, etc.). Physical Requirements Lifting: Ability to lift and move equipment up to 50 pounds. Environment: Comfortable working in high‑decibel, climate‑controlled data center aisles. Dexterity: Capable of standing, walking, and performing precision cabling in tight rack spaces for extended periods. Travel: May require occasional travel to remote data center sites or edge locations. Benefits Competitive salary commensurate with high‑availability expertise. Comprehensive health, dental, and vision insurance. 401(k) retirement plan with company match. Compensation Disclosure The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary is expected to be in the range $147,000.00 – $237,500.00 per year. The offered compensation may also include restricted stock units and a bonus. Equal Employment Opportunity We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at View email address on click.appcast.io. Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics. All your information will be kept confidential according to EEO guidelines. #J-18808-Ljbffr Palo Alto Networks, Inc.
- ...r - G P U S o f t w a r e A r c h i t e c t THE ROLE: As GPU Software Architect, you will provide technical leadership at the intersection... ...of technical leadership across distributed, cross‑functional engineering teams. Strong background in systems software, firmware,...SuggestedRemote work
- ...the kind of precision that drives great outcomes. The Team Engineering—our engineering team is at the core of our products and... ...enabled by a secure digital environment. Job Summary As a Principal Software Engineer, you will provide technical leadership in designing...SuggestedFull timeWork at office
$147k - $237.5k
Job Summary We are seeking a Principal Software Engineer to join our Machine Identity Management CyberArk team, focused on building and scaling frontend experiences that enable visibility, control, and orchestration of machine identities. Responsibilities Lead the design...Suggested$147k - $237.5k
Our Mission At Palo Alto Networks®, we’re united by a shared mission—to protect our digital way of life. We thrive at the intersection of innovation and impact, solving real‑world problems with cutting‑edge technology and bold thinking. Here, everyone has a voice, and every...SuggestedFull timeWork at officeLocal areaVisa sponsorshipWork visa- ...us all evolve, together. Your Career We are the WildFire Team in the Content Delivered Security Service (CDSS) organization. Our engineering and Security Research team is at the core of our products and delivers the best of security services in the cloud to prevent...SuggestedFull timeWork at officeVisa sponsorshipWork visaFlexible hours
$170k - $277k
Job Summary We are seeking a Senior or Principal Backend Engineer for our Santa Clara Headquarters center to join the Talon team, which is responsible for our Enterprise Browser. In this role, you will be instrumental in developing a key security solution that protects...$170k - $277k
Job Summary We are seeking a Module FW Integration Engineer - For Module Integration and Cellular Certification to drive software validation, automation, and the cellular certification lifecycle for our wireless products. This role bridges embedded application development...$147k - $237.5k
...lifecycle. Experience implementing LLM-based features in frontend applications. Familiarity with AI model integration and prompt engineering for optimizing UI performance and accessibility. Collaborative Engineering & Execution: Contribute to infrastructure design,...- MixMode is seeking a Principal Software Engineer to innovate and develop kernels for our advanced AI compute engine based in Santa Clara, CA. This role demands expertise in software development for next-generation hardware with a strong requirement for knowledge in C/C++...
$233.99k - $330.34k
Job Details Job Description: We are seeking a highly capable Principal Engineer to help shape the future of datacenter and cloud software. In this pivotal role, you will engage deeply with customer technologists, define optimization opportunities across the software stack...InternshipLocal areaImmediate startShift work$241.8k - $409.2k
...GPGPU Software Architect/ Principal Engineer XPENG is a leading smart technology company at the forefront of innovation, integrating advanced AI and autonomous driving technologies into its vehicles, including electric vehicles (EVs), electric vertical take-off and...Full time- ...Requirements We Are Synopsys is the leader in engineering solutions from silicon to systems, enabling customers to... ...and can analyze and resolve intricate design verification and software validation issues. Your strong communication abilities enable...
$272k - $431.25k
We are seeking software engineers to work on next-generation high-speed interconnect technologies. Our charter is to develop the most demanding high-speed IO applications a GPU or high-performance computing server will encounter in its lifecycle, by collaborating closely...$211.8k - $317.8k
Company Qualcomm Technologies, Inc. Job Area Engineering Group, Engineering Group > Software Engineering General Summary Hiring for Sr. Staff Engineer and Principal Engineer level. Relocation Required: Candidates must be willing to relocate to Santa Clara, CA or Austin,...Work experience placementRelocation$147k - $237.5k
...Palo Alto Networks, Inc. is seeking a Principal Software Engineer to design and implement Threat Intelligence Services for cloud features. The role includes all phases of product development and collaboration with teams to achieve high-quality releases. Ideal candidates...- ...Job Description Job Description Senior Principal Engineer, Software/Firmware Salary : 250,000 - 280,000/yr Location : Santa Clara, CA (onsite) Must be eligible to work in the United States. Sponsorship is unavailable. Relocation assistance is unavailable...Relocation package
$200k - $260k
...Job Description Job Description Job Opportunity: Senior Principal Engineer, Software/Firmware Location: Onsite, Santa Clara, CA, US Industry: Engineering / Architecture Salary: USD $200,000 – $260,000 / year Sponsorship: None available at this time...Visa sponsorship$200k - $260k
...Position Title: Senior Principal Engineer, Software/Firmware - Coherent Optical Module Firmware/SoC-based Embedded Platforms/CPO (Confidential Client) Location: Santa Clara, CA | Onsite Employment Type: Permanent Compensation ~ Salary: $200,000 - $260,000/yr...Permanent employment$185k - $278k
...empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous technological... ...them into scalable solutions. * Debugging OS and engineering issues within our provided Linux environment. * Collaborating...Remote work$147k - $237.5k
...platform, consisting of XDR, XSIAM, XSOAR, and XPANSE. As a Principal Site Reliability Engineer within the Cortex DevOps team, you will serve as a... ..., Docker, and cloud-native architectures. ~ Strong software engineering and automation skills using Python, Linux, Terraform...Full timeWork at office- CoreWeave is hiring a Principal Engineer to lead the design and evolution of its AI infrastructure's cluster orchestration systems. The role demands expertise in Kubernetes and Slurm, working directly on technology that influences how efficiently GPUs are utilized. Your...
- About the Role We are seeking a skilled and passionate Back-End Engineer with a strong background in data engineering to join our growing team. In this role, you will be responsible for designing, developing, and maintaining scalable and reliable data pipelines and back...Permanent employmentContract workLocal area
$175.8k - $293k
...they can seize a competitive advantage. We're looking for a Principal AI Engineer to architect, build, and harden the agentic AI systems that... ...years of building and operating scalable, production‑grade software, with significant recent depth in AI/ML. Proven experience...$105k - $169.05k
Position Overview As Principal Applications Integration Engineer, you will be responsible for designing, developing, and maintaining integration solutions to connect various software applications and systems within the organization. This role involves working closely with...Temporary workLocal area$145k - $236k
GlobalFoundries is seeking a Principal Field Applications Engineer in Santa Clara, CA. The role involves providing ongoing support to customers, developing technical presentations, and managing client relationships. The ideal candidate will have a Bachelor's Degree in Electrical...- ...infrastructure, where reliability, scale, and intelligent automation define the future of operations. As a Senior Site Reliability Engineer, you will design and operate the platforms that power our applications across GCP, AWS, and global data centers - and you'll push...Visa sponsorshipWork visa
$167k - $270.5k
Palo Alto Networks, Inc. is seeking a Technical Leader to develop AI applications within the GTM/CX domain. This role involves defining the architecture for scalable AI/ML systems and leading the design of intelligent agents. Ideal candidates will have 15+ years of experience...- ...workloads with ultra high‑speed inference. We're hiring a Principal Engineer for our Inference Cloud Platform. This team owns the cloud layer... .... Skills & Qualifications 10+ years of experience in software engineering, with substantial individual contributor experience...
$272k - $431.25k
NVIDIA Corporation seeks a Principal AI and ML Infra Software Engineer in Santa Clara, California, to enhance the efficiency of AI/ML research on GPU Clusters. The role involves collaboration with various teams, monitoring infrastructure performance, and implementing improvements...- ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll work closely with AI research teams to enhance efficiency by addressing infrastructure deficiencies for GPU Clusters...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Engineer Software. Be the first to apply!
- chief design engineer Santa Clara, CA
- principal developer Santa Clara, CA
- engineering director Santa Clara, CA
- principal data engineer Santa Clara, CA
- director of product engineering Santa Clara, CA
- senior chief engineer Santa Clara, CA
- chief engineer Santa Clara, CA
- data center chief engineer Santa Clara, CA
- senior civil engineer project manager Santa Clara, CA
- director systems engineering Santa Clara, CA

