Principal Engineer, Cluster Orchestration
$206k - $303kCoreWeave
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at [
ABOUT THE ROLE
CoreWeave runs some of the largest GPU clusters in the world. The AI infrastructure behind those clusters determines how workloads are placed, how resources are shared, and how reliably systems perform under constant pressure. As a Principal Engineer in AI Infrastructure, you will lead the design and evolution of the cluster orchestration systems that make this possible. This includes Slurm, Kubernetes, SUNK, and the control planes that support AI training, inference, and model onboarding at scale. You will define long-term architecture, solve hard scaling problems, and set technical direction across teams. Your work will directly affect how quickly customers can run models, how efficiently we use GPUs, and how reliably the platform behaves at scale.WHAT YOU’LL DO
ARCHITECTURE AND TECHNICAL DIRECTION
* Define the long-term architecture for CoreWeave’s orchestration platforms across Kubernetes, Slurm, SUNK, Kueue, and related systems. * Act as a technical authority on scheduling, quota enforcement, fairness, pre-emption, and multi-tenant GPU isolation. * Make design decisions that balance performance, reliability, cost, and operational complexity.ORCHESTRATION PLATFORM DEVELOPMENT
* Lead the evolution of Kubernetes-native control planes, including SUNK and custom operators. * Design systems that support workload admission, validation, and rollout, including model onboarding flows. * Identify and remove scaling limits across schedulers, control planes, registries, networking, and storage.RELIABILITY AND OPERATIONS
* Set standards for reliability, observability, and operational readiness across orchestration services. * Define SLOs, alerting, and incident response practices for platform-critical systems. * Ensure systems behave predictably during failures, peak load, and rapid growth.HANDS-ON ENGINEERING
* Write and review production code for Kubernetes controllers, schedulers, admission logic, and internal tooling. * Measure and improve scheduling latency, container startup time, image distribution, and cold-start performance. * Lead architecture and design reviews across infrastructure teams.LEADERSHIP AND INFLUENCE
- Mentor senior and staff engineers and help grow technical leaders.
- Influence platform, infrastructure, security, and product teams through clear
WHO YOU ARE
* 15+ years of experience building and operating large-scale distributed systems.- Deep, practical knowledge of Kubernetes and Slurm internals.
- Experience running GPU-heavy platforms for AI training, inference, or HPC
- Strong background in Go and cloud-native systems development.
- Proven ability to set technical direction across teams without direct
- Comfortable making high-impact technical decisions in complex systems.
- Bachelor’s or Master’s degree in a relevant field, or equivalent experience.
PREFERRED QUALIFICATIONS
* Experience with systems such as Kueue, Kubeflow, Argo Workflows, Ray, Istio, or Knative. * Background in ML platform engineering, model onboarding, or lifecycle management. * Strong understanding of scheduling strategies, pre-emption, quota enforcement, and elastic scaling. * Track record of operating highly reliable systems with clear SLOs and incident processes. * Contributions to Kubernetes, ML infrastructure, or related open-source projects. * Experience mentoring senior engineers and raising engineering standards.IS THIS A GOOD FIT?
You may be a good fit if you enjoy defining long-term architecture, solving deep systems problems, and working close to the hardware layer of AI platforms. This role suits engineers who care about correctness, scale, and operational discipline, and who want their work to directly shape how AI runs in production.WHY COREWEAVE?
At CoreWeave, AI infrastructure is the product. As a Principal Engineer in cluster orchestration, you will be responsible for systems that directly determine how efficiently GPUs are used, how reliably large models run, and how quickly customers can move from research to production. This role puts you at the center of hard problems in scheduling, resource isolation, and large-scale control planes. You will work on systems where small design choices affect thousands of GPUs and real customer workloads. If you care about building infrastructure that runs under constant pressure, scales without shortcuts, and enables the next generation of AI workloads, CoreWeave is a place where your work will matter. The base salary range for this role is $206,000 to $303,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility). What We Offer The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location. In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption
- ...NVIDIA Gruppe is seeking a Principal AI and ML Infra Software Engineer to join our Hardware Infrastructure team in Santa Clara, CA. In this role, you'll... ...efficiency by addressing infrastructure deficiencies for GPU Clusters, fostering innovations in AI/ML research. The ideal...Suggested
$206k - $333k
...025. Learn more at About this role We're looking for a Principal Engineer to be the technical lead of CoreWeave's Benchmarking & Performance... ...Inference and Training submissions: workload selection, cluster planning, runbooks, audits, and result publication....SuggestedPermanent employmentTemporary workCasual workWork at officeFlexible hours$288.76k - $339.71k
...communication path across every cloud, every VPC, every Kubernetes cluster, and every serverless function, from a single policy plane.... ..., visit aviatrix.ai. ABOUT THE ROLE: We are seeking a Principal Engineer to join our Networking team and take deep technical...SuggestedFull timeTemporary workLocal areaRemote workFlexible hoursDay shift$300 per month
...cloud platform that frontier AI runs on. We are looking for a Principal Engineer on our Production Engineering team. Someone who will own the... ...same incident Experience with HPC infrastructure: GPU cluster operations, job schedulers (Slurm, Kubernetes), high-bandwidth...SuggestedFull timeTemporary workImmediate start$135k - $175k
...partnership with local FA/PE teams and responsible to communicate outcome to the customer including claim management Partner with engineering and FA teams during quality issues to ensure internal global alignment and manages the relationship with the customer Reviews...SuggestedLocal area- ...Job Description Job Description RESPONSIBILITIES: ThePrincipal Signal Integrity Engineer will serve as a technical leader within the high speed copper based I/O cable assembly design team. This role will make a strategic engineering impact across advanced datacenter...Work at officeLocal area
$198k - $264k
...design, construction, commissioning, networking, and operations to bring new capacity online safely and at scale. The Principal Operational Readiness Engineer will serve as a senior technical leader responsible for stabilizing and driving execution of complex data center...Permanent employmentTemporary workFor contractorsCasual workWork at officeFlexible hours$180k - $230k
...mission is to make the United States of America the safest country in the world About the Role Knightscope is seeking a Principal Engineer to own the end-to-end technical design and architecture of Signals our autonomous Physical Security Information Management (...Full time$164.8k - $226.6k
...devices shipped, SiTime is changing the timing industry. For more information, visit: Job Summary The Principal Semiconductor Package Engineer is responsible for the specification, design, development, and qualification of unique custom packaging technology...Overseas- ...every application, data source, and process to power real-time orchestration at scale. With enterprise-grade security and continuous... ...best company for remote workers Responsibilities As Senior Engineering Manager for Enterprise Retrieval, you'll lead the team building...Remote workFlexible hours
- ...huge impact and drive the future of autonomy, Plus is looking for talented individuals to join its fast-growing teams. As a Principal Engineer on our Mapping and Localization team, you will design and implement high-fidelity fusion algorithms that combine LiDAR, Radar...Odd jobShift work
$200.51k - $291.22k
...brands that will transform mobility. Join us as we shape the future of the car and everyone around it. Role Summary: The Principal Engineer, SI/PI (Signal Integrity / Power Integrity) is the electrical performance expert for CARIADs high-speed automotive compute...Permanent employmentTemporary work- ...The Role We're looking for a deeply technical, hands-on engineering leader for our on-field Kernel Reliability team. You will lead... ...challenge: improving the reliability of our advanced compute clusters and the underlying inference, training, and internal production...
$190k - $253.75k
...use deep data insights to improve their business. Founded by engineers - and customer obsessed - we leap at every opportunity to solve... ...of millions of VMs per day, operates thousands of Kubernetes clusters, and must deliver extreme elasticity, reliability and cost efficiency...Local areaImmediate startWorldwide$206k - $303k
...Principal Engineer - Observability New York, NY / Sunnyvale, CA CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted...Permanent employmentTemporary workCasual workWork at officeFlexible hours$240k - $320k
...some of the hardest automated driving problems, and are looking for experts for product engineering of AI-based Autonomous Driving systems. Job Description As the Senior Principal Engineer, E2E AI Training Framework for Autonomous Driving Systems, you will...Full timeWork experience placementLocal areaFlexible hours$224k - $336k
...yr - $336,000.00/yr The Role AMD is looking for a Principal Application Support Engineer to join our AI Networking Infrastructure support team.... ...technologies driving the enablement of large-scale AI clusters. The Person The ideal candidate should be excited...Full time$164.8k - $226.6k
...SiTime is changing the timing industry. For more information, visit: Job Summary We are seeking a hands-on Principal Infrastructure Hardware Engineer to architect, design, and deliver system platforms supporting characterization, validation, ATE, and high-volume...$272k - $431.25k
...at scale. Provide technical leadership and mentorship to other engineers, influencing design and implementation across the broader... ...production. Demonstrated technical leadership as a senior or principal‑level individual contributor: owning features or subsystems end...$240k - $379.5k
...design, hardware validation, system verification, signal integrity, etc) Master’s degree (or equivalent experience) in Electrical Engineering or related field Deeply inquisitive and able to use core technical competencies to provide direction on system architecture...$196k - $310.5k
...our team. As a seasoned professional with a strong background in material Surface Mount Technology (SMT), materials science, and engineering, you will provide expert guidance and support to our engineering team and Contract Manufacturers (CM). What you’ll be doing: Lead...Contract work$280k - $385k
...and AI infrastructure platform, so our customers can focus on the high-value challenges that are central to their missions. Our engineering teams build highly technical products that fulfill real, important needs in the world. We constantly push the boundaries of data...Local areaRemote workWorldwide$230k - $375k
...highway launch. That system runs on GPU clusters for model training, on large-scale... ...Manager of AV Cloud Capacity & Performance Engineering, you own the team and function responsible... ...:GPU training clusters, ML pipeline orchestration, or inference serving at production scale...Work experience placementWork at officeLocal areaWork from homeFlexible hours$205.5k - $310.2k
...Dell Technologies is seeking a Senior Principal Security Software Engineer to contribute to the design and architecture of cryptography security software. This role involves implementing encryption algorithms and collaborating with top engineers. Candidates should have...$167.6k - $271.15k
...Job Summary The Offensive Security team is seeking a Principal Offensive Security Engineer to support the team responsible for testing the... ...document vulnerabilities in cloud services, container orchestration platforms, and automated deployment pipelines. .Plan...Full timeWork at officeVisa sponsorshipWork visa- ...A leading professional networking company is seeking a Principal Staff Software Engineer, specializing in adversarial abuse. In this role, you'll lead a team to deliver sophisticated solutions that enhance user safety on the platform, predict abuse patterns, and drive...Remote work
- ...AMD is looking for a Senior Staff AI Infra Engineer who is passionate about improving the performance... ...Design, build, and optimize AI workloads on GPU clusters, including large-scale training and inference orchestration, elastic scaling, and workload scheduling across...
- ...Involved directly with new product development projects incorporating optical, electrical, mechanical, software and system integration engineering functions. Ensure meeting project schedule and timelines and milestones, and the completion of complex cross-disciplinary...Permanent employmentWork at officeRemote workWorldwideFlexible hours
$206k - $303k
...What You’ll Do: We’re seeking a Principal Engineer to serve as the hands-on technical leader... ...-shoulder with engineering, product, orchestration, and hardware teams to make CoreWeave... .../ cost-per-request on multi-node GPU clusters. ~ Fluency with Kubernetes (or Slurm...Permanent employmentTemporary workCasual workWork at officeRemote workFlexible hoursShift work- ...hyperscale data centers enhanced performance, efficiency, and scalability. Job Overview: nEye is seeking a Principal Silicon Photonics Layout Engineer to actively lead the company’s design and tapeout of complex Photonic Integrated Circuits (PICs) using a code-...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Principal Engineer, Cluster Orchestration. Be the first to apply!
- senior civil engineer project manager Sunnyvale, CA
- senior chief engineer Sunnyvale, CA
- engineering director Sunnyvale, CA
- chief engineer Sunnyvale, CA
- principal network engineer Sunnyvale, CA
- data center chief engineer Sunnyvale, CA
- principal infrastructure engineer Sunnyvale, CA
- project engineer assistant project manager Sunnyvale, CA
- hotel chief engineer Sunnyvale, CA
- director data engineering Sunnyvale, CA


