Principal Cloud and Production Operations Engineer
Qode
Principal Cloud And Production Operations Engineer
The Principal Cloud and Production Operations Engineer serves as the senior technical authority responsible for architecting, automating, and optimizing hybrid and cloud-native production environments that power critical customer-facing services and enterprise applications.
This role combines deep cloud infrastructure expertise with strong production reliability and operational engineering skills. The Principal Engineer acts as both architect and hands-on builder, ensuring scalability, resilience, and security across multi-cloud and on-prem environments.
Reporting to the Associate Director of IT and Infrastructure, this position will collaborate closely with Engineering, DevOps, Security, and IT Operations to drive a culture of automation, observability, and continuous improvement across the production ecosystem.
Key Responsibilities:
Cloud Architecture and Engineering
•Design, implement, and maintain cloud and hybrid infrastructure supporting production workloads, enterprise systems, and CI/CD pipelines
•Lead the adoption of infrastructure-as-code (IaC) using Terraform, CloudFormation, or similar tools to enable repeatable, auditable, and secure deployments
•Architect scalable and fault-tolerant solutions across OCI, AWS, Azure, and on-prem data centers, ensuring high availability and cost efficiency
•Evaluate emerging cloud services and technologies for applicability to business needs and long-term scalability goals
Production Operations and Reliability
•Serve as the technical lead for production operations, ensuring uptime, performance, and reliability of customer-facing and internal systems
•Develop and maintain observability frameworks leveraging metrics, logs, and traces to ensure proactive detection and rapid response
•Partner with engineering teams to implement SRE-inspired practices, including service level objectives (SLOs), error budgets, and post-incident reviews
•Drive root cause analysis, performance tuning, and continuous improvement of production services
Automation and CI/CD Enablement
•Collaborate with DevOps and application engineering teams to build and optimize automated deployment pipelines supporting frequent, low-risk releases
•Integrate security and compliance checks into CI/CD workflows to ensure production readiness and alignment with internal standards
•Design self-healing infrastructure and automated rollback mechanisms to reduce operational risk
•Ensure secure and reliable configuration management and environment orchestration using tools such as Ansible, Chef, or Puppet
Operational Governance and Collaboration
•Establish and enforce operational best practices for monitoring, patching, and change management across production systems
•Lead production readiness reviews for new releases and large-scale changes
•Collaborate with the Security and Compliance teams to ensure systems adhere to policy, hardening standards, and regulatory requirements
•Participate in and occasionally lead on-call rotations for critical production systems, ensuring rapid triage and resolution
Leadership and Mentorship
•Act as a technical mentor to cloud and infrastructure engineers, fostering a culture of knowledge sharing and engineering excellence
•Lead architectural reviews, design sessions, and capacity planning discussions
•Serve as a trusted advisor to management on cloud modernization, resilience engineering, and cost optimization strategies
Qualifications:
•Bachelor's degree in Computer Science, Information Systems, or related field; Master's preferred
•10+ years of experience in cloud and infrastructure engineering, including 3+ years in a senior or principal role
•Expertise with OCI (preferred), AWS and/or Azure cloud services, including networking, compute, storage, and identity management
•Proven experience managing production-scale environments supporting mission-critical applications and services
•Strong proficiency in:
-Infrastructure-as-code (Terraform, CloudFormation)
-CI/CD and DevOps toolchains (Jenkins, GitLab, ArgoCD)
-Container orchestration (Kubernetes, Docker)
-Monitoring and observability platforms (Prometheus, Grafana, Datadog, ELK)
-Scripting and automation (Python, Bash, PowerShell)
•Solid understanding of security, compliance, and networking principles in hybrid environments
•Exceptional analytical, problem-solving, and incident management skills
•Demonstrated ability to lead complex, cross-functional initiatives from concept to execution
Preferred Experience:
•Experience in high-availability SaaS or networking environments
•Knowledge of FinOps, cost optimization, and multi-cloud governance frameworks
•Familiarity with Zero Trust, identity federation, and cloud access security model
•Exposure to AI/ML infrastructure or data-driven pipelines is a plus
- ...Senior Principal Video Player And Video Encoding Engineer Oracle Cloud Infrastructure is building Oracle Video @ Edge, a next-generation streaming platform for... ...Programming experience in JavaScript, TypeScript, C++, Java, Go, Python, or similar production languages....PrincipalCloud
- ...Principal Data Platform Engineer The Principal Data Platform Engineer is a senior... ...reliability, and maintainability. Operating across multiple scrum... ...delivering reusable data products, documentation, and clear... ...Strong experience with modern cloud data platforms, including...PrincipalCloudImmediate start
$109k - $185k
...Principal FinOps Analyst At MiniMed, you can begin a lifelong career of... ...optimization across the organization's cloud footprint. This role partners with engineering, finance, and business leaders... ...-efficient architectural and operational decisions. This role ensures...PrincipalCloudWork at officeLocal areaFlexible hours- ...Sr. Cloud Network Automation Engineer SME Automation Framework Development: Design, develop, and implement automation workflows using Ansible... ...automation processes to improve performance and reduce operational overhead. Collaboration and Mentorship: Work...Cloud
$147k - $237.5k
...Engineering Manager Palo Alto Networks® is shaping the future with technology that is... ...transforming the way people and organizations operate in the cloud, at the network edge, and everywhere... ...Access. Partner closely with Product Managers and cross-functional groups to...PrincipalCloud$250.5k - $335.9k
...Sr Principal Site Reliability Engineer P5/P6: SRE Lead, Content Distribution Engineering... ...our streaming and digital products in new and immersive ways,... ...this group builds and operates delight millions of consumers... ..., in both datacenter and cloud environments....PrincipalCloudLocal areaWorldwide- ...Your Role As a Full Stack Engineer in the healthcare space, you'... ...integrations ~ Experience developing cloud-native applications (Azure... ...Effective collaboration with product, design, QA, security, and platform teams ~ Ability to operate in dynamic, evolving...PrincipalCloudWork at officeShift work2 days per week
- ...Principal Site Reliability Engineer At Palo Alto Networks®, we're united by a shared mission—to protect... ...as the technical authority for our cloud-native infrastructure. You aren't just... ...Mastery: Expert-level experience managing production K8s workloads (preferably within GKE...PrincipalCloudFull timeWork at officeVisa sponsorshipWork visaShift work
$164k - $278k
...About the Role The Senior Principal Integration Architect will focus... ...Data & Analytics, Security, Product, and Business leadership to... ...B/EDI integrations Hybrid cloud and on-prem data flows Drive... ...and analytics platforms Operational data stores Define patterns...PrincipalCloudWork at officeLocal areaFlexible hours$147k - $237.5k
...Principal Software Automation/Test Engineer At Palo Alto Networks®, we're united by a shared mission—to protect... ...Networks, we're not just building products — we're redefining what's possible... ...team that's leading the charge in cloud security innovation, we want to hear...PrincipalCloudFull timeWork at office$151.6k - $245.3k
...Site Reliability Engineer Palo Alto Networks runs a large hybrid infrastructure and is one... ...and security experts Design, build, and operate reliable, secure Cloud infrastructure Ensure that applications are production-ready, scalable, and reliable Develop tools...PrincipalCloud$120.1k - $251.6k
...The Data Center Infrastructure Construction team at Oracle Cloud Infrastructure is a dynamic group of professionals dedicated to... ...conditions and locations, as well as reflect Oracle's differing products, industries and lines of business. Candidates are typically placed...PrincipalCloudTemporary workFlexible hours$142.5k - $190k
A prominent entertainment agency is seeking a Principal Architect to lead the design of technology infrastructure spanning on-premises and cloud environments. This role focuses on Microsoft Azure, driving zero-trust security models, and creating a multi-year infrastructure...PrincipalCloud- ...Lead Cloud Engineering And Production Operations Engineer This role acts as a hands-on technical lead, driving cloud engineering initiatives, automating infrastructure, and ensuring high-availability and performance across customer-facing systems. The Lead Engineer...Cloud
$167k - $270.5k
...experience motion. The Sr. Principal/Principal person will... ...processes and operations workflows. Establish... ...development from prototype to production, ensuring scalability,... ...Partner with data engineering to design high-quality... ...architecture, and cloud platforms (AWS/Azure/GCP...PrincipalCloudFull timeWork at office$185.2k - $299.48k
...Principal AI Engineer For The Enterprise AI Platform At Palo Alto Networks... ...powered solutions. You will operate where AI innovation, top-tier... ...data science, engineering, and product stakeholders to translate... ...streaming data platforms, and cloud AI/ML platforms (e.g., GCP...PrincipalCloudFull timeWork at office$147k - $237.5k
...Principal Engineer At Palo Alto Networks®, we're united by a shared mission—to protect our digital... ...join us to build the next generation Cloud Security to discover cloud resources,... ...you to collaborate with cross-functional product management, development and quality...PrincipalCloudFull timeWork at officeVisa sponsorshipWork visa$99.6k - $223.4k
...Oracle Video @ Edge Engineer Oracle Cloud Infrastructure (OCI) is building Oracle Video @ Edge (OVE), a next-generation video delivery platform... ...cross-functionally with networking, playback, and product teams Drive architectural decisions and technical strategy...PrincipalCloudTemporary workFlexible hours$168k - $195k
...Principal Lead Analyst of DART At Corebridge Financial, we believe... ...strategies through IT and operations services and ensures the... ...workstreams (Forensics, Network, Cloud, Legal, and PR). Crisis... ...infrastructure. Detection Engineering Oversight: Collaborate with...PrincipalCloudWork at officeLocal areaImmediate startRemote workShift work$109k - $185k
...Principal Technical Product Owner At MiniMed, you can begin a lifelong career... ...technical bridge between AI Engineering, Data Science, and cross-functional... ...platform concepts, and cloud AI services to effectively... ...(WSJF, RICE, MoSCoW) and operate within a SAFe environment including...PrincipalCloudTemporary workWork at officeLocal areaFlexible hours- ...Full Stack Engineer As a Full Stack Engineer in the healthcare... ...technical authority across multiple products, platforms, or domains,... ...components Ensure applications are cloud-ready and compatible with... ...teams ~ Ability to operate in dynamic, evolving environments...PrincipalCloudFull timePart timeWork at officeLocal areaWork from homeHome officeShift work2 days per week
$180k - $200k
...Principal Ultrasound Engineer Burlington, MA Office; California - Hybrid Company Description Butterfly... ...combines its advanced hardware with cloud software and AI, an enterprise... ...In addition to its medical imaging products, Butterfly Embedded™ is the Company's...PrincipalCloudWork at officeImmediate startWork visa2 days per week$100k - $172.5k
...Description: We are searching for the best talent for a Principal Product Security Engineer to be located in Danvers, MA or Raritan, NJ. Remote work... ...distance to site). Partner with engineering teams (cloud, console, pump, etc.) to drive successful adherence to Abiomed...PrincipalCloudFull timeTemporary workWork at officeLocal areaImmediate startRemote work3 days per week$120k - $190k
...includes MSG Networks, which operates two regional sports and entertainment... ...and authenticated streaming product, MSG+, delivering a wide... ...and Production Technology Engineer works closely with Corporate... ...Resolve, Adobe Creative Cloud, Unreal Engine, Baselight, RV...CloudLocal areaRemote work$25.95 - $28.84 per hour
...bachelor’s degree and has 1-2 years of experience in email marketing or digital communications. Proficiency in Salesforce Marketing Cloud and Google Apps is preferred. The role may allow for remote or hybrid work options and offers a pay range of $25.95 - $28.84 per hour...CloudRemote jobHourly pay$110.04k - $204.36k
...Network Reliabili ty and Automation Engineer function will be responsible for working with the Operations and Engineering Teams around... ...in an enterprise network production environment Additional years... ...Ansible, Puppet, Terraform , Arista Cloud Vision and or GitHub Actions...CloudTemporary workLocal areaFlexible hours$170k - $200k
...Senior Software Engineer – AI & Workflow Automation POSITION TITLE: Senior... ...closely with the Tech Lead, operations, and other engineers to deliver production-ready automation tools that make... ..., labeling, or prompt design. Cloud experience (AWS, GCP, or Azure)....CloudLocal areaRemote workWorldwide$180k - $235k
...re looking for a hands-on AI Engineer who loves to build. You'll design... ...that stretch across product design, marketing, supply chain... ...using APIs, open models, and cloud-native services to validate ideas... ...design, supply chain, retail operations, and engineering teams to...PrincipalCloud- ...currently seeking an Associate Software Engineer - Automation Tester in IT Applications... ...experience. - Introductory knowledge of cloud concepts (AWS fundamentals preferred).... ...participating in interviews,-please contact People Operations at ****@*****.*** ....CloudMinimum wageFull timeContract workTemporary workWork experience placementRemote work
$213.36k - $320.04k
...Principal Software Engineer Paramount Skydance Corp. is seeking a Principal Software Engineer to architect... ...quality, automation, and developer productivity across the Global Quality Engineering... ...distributed systems, microservices, cloud platforms (AWS, GCP, OCI or Azure),...PrincipalCloud
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Cloud and Production Operations Engineer. Be the first to apply!
- senior aws cloud engineer Encino, CA
- senior cloud network engineer Encino, CA
- principal Encino, CA
- senior cloud service delivery manager Encino, CA
- travel operations Encino, CA
- business operations intern Encino, CA
- operations tech Encino, CA
- loan operations Encino, CA
- vice president manufacturing operations Encino, CA
- data center operations technician Encino, CA


