Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Principal Engineer - Observability [Remote]

$206k - $303k

CoreWeave

Sunnyvale, CA
  • Remote job

CoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. Our technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint of data centers covering every region of the US and across Europe. CoreWeave was ranked as one of the TIME100 most influential companies of 2024.

As the leader in the industry, we thrive in an environment where adaptability and resilience are key. Our culture offers career-defining opportunities for those who excel amid change and challenge. If you’re someone who thrives in a dynamic environment, enjoys solving complex problems, and is eager to make a significant impact, CoreWeave is the place for you. Join us, and be part of a team solving some of the most exciting challenges in the industry.  

CoreWeave powers the creation and delivery of the intelligence that drives innovation. 

About this Role:

We are seeking a highly experienced and strategic Principal Engineer to lead the architecture, development, and operations of our Observability product. In this role, you will help shape the vision and direction on how customers monitor, troubleshoot and run their AI workloads effectively, at scale. You will have direct access to customers and work closely with engineering stakeholders across multiple teams to advance the development of our unified Observability experience across CoreWeave products.

What You'll Do:

  • Lead the strategy and implementation for Observability, ensuring alignment with business goals and performance objectives.
  • Design and implement advanced solutions, including low-latency, high-scale Observability pipelines across all products.
  • Build solutions that offer insights to customers for rapid troubleshooting of their AI workloads.
  • Champion initiatives to improve the reliability, durability, and self-healing capabilities of Observability metrics, and assume operational responsibilities.
  • Help shape customer experience by promoting unparalleled visibility into our systems’ performance and reliability with customer facing metrics and dashboards.
  • Analyze telemetry for production systems to identify opportunities for improvement in performance and reliability.
  • Develop operational review practices for storage engineering to assess performance against targets and iterating on those targets.
  • Act as a trusted advisor to senior leadership, providing insights on storage industry trends and advocating for investments in storage technologies.
  • Collaborate with engineering, infrastructure, and product teams to ensure storage solutions align with evolving project requirements and technical architecture.
  • Mentor and guide engineering teams on best practices in product engineering, fostering a customer-focused approach to systems design and technical excellence.

Who You Are:

  • Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
  • 10+ years of experience in distributed systems, with a focus on reliability and scale.
  • Proven experience leading storage product engineering projects and building products to address customer needs.
  • Proficiency in one or more programming languages (e.g. Go, C, Rust).
  • Good understanding of distributed observability systems such as ClickHouse for telemetry at scale.
  • Strong understanding of cloud computing infrastructure using Kubernetes, scalable architectures, and automation.
  • Excellent analytical and problem-solving skills, with the ability to synthesize problem statements from interactions with customers and design solutions given ambiguous requirements.
  • Strong communication and interpersonal skills, able to convey storage engineering strategies and practices to technical and non-technical audiences.
  • Prior experience with building Observability solutions is a plus.

The base pay and target total cash for this position range from $206,000 to $303,000 and $258,000 to $378,000, accordingly. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. This position includes a discretionary bonus, equity, and a comprehensive benefits package.

What We Offer

The range we’ve posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

  • Medical, dental, and vision insurance - 100% paid for by CoreWeave
  • Company-paid Life Insurance 
  • Voluntary supplemental life insurance 
  • Short and long-term disability insurance 
  • Flexible Spending Account
  • Health Savings Account
  • Tuition Reimbursement 
  • Ability to Participate in Employee Stock Purchase Program (ESPP)
  • Mental Wellness Benefits through Spring Health 
  • Family-Forming support provided by Carrot
  • Paid Parental Leave 
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our office and data center locations
  • A casual work environment
  • A work culture focused on innovative disruption

Our Workplace

While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration

California Consumer Privacy Act - California applicants only

CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.

As part of this commitment and consistent with the Americans with Disabilities Act (ADA) , CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on swooped.co .

Export Control Compliance

This position requires access to export controlled information.  To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.  CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.

Vacancy posted more than 2 months ago
Similar jobs that could be interesting for youBased on the Principal Engineer - Observability [Remote] in Sunnyvale, CA vacancy
  • $218.8k - $335.3k

     ...responsible for delivering and maintaining the tools and services engineers here at GM use every day to do their best work and drive our...  ...engineers. This person will start delivering impact through observability frameworks and will evolve depending on business needs but... 
    Suggested
    Full time
    Local area
    Remote work
    Work from home
    Relocation package
    Flexible hours

    General Motors

    Mountain View, CA
    2 days ago
  • Netflix, Inc. is seeking an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team. In this role, you will ensure every client release, server canary, and A/B test is safely delivered while building a high-performing team of engineers. Responsibilities... 
    Suggested
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    5 days ago
  •  ..., building, and launching spacecraft. They are seeking a Principal Mechanical Engineer to lead the design and development of advanced spacecraft...  ...launching spacecraft, with a mission-driven approach to Earth observation, environmental monitoring, and space-based intelligence.... 
    Suggested
    Relocation package
    Flexible hours

    Australia-Employment

    Mountain View, CA
    4 days ago
  • $272k - $431.25k

     ...Demonstrable experience in implementing left‑shift strategy to de‑risk program execution. BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience). 15+ years in the area of system architecture and design. Ways to stand out from the... 
    Suggested
    Shift work

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  •  ...Serve as the technical authority for enterprise BI and analytics engineering solutions, providing guidance and mentorship to engineers and...  ...for data governance, metadata management, reliability, observability, and operational excellence. Participate in strategic planning... 
    Suggested
    Work at office

    Coherent

    Santa Clara, CA
    4 days ago
  • Overview We are looking for an experienced Engineering Manager to lead the Client Delivery & Observability (CDO) team, a newly formed group that owns the release delivery automation platform and the real‑time observability stack, ensuring every client release, server canary... 
    Flexible hours

    Netflix, Inc.

    Los Gatos, CA
    2 days ago
  • $190.9k - $253.75k

    A leading data and AI company is seeking a Software Engineer for their Observability team in Mountain View, California. The role focuses on developing observability solutions to enhance product monitoring and performance. Candidates should have at least 7 years of experience... 

    Menlo Ventures

    Mountain View, CA
    2 days ago
  • $220.2k - $330.4k

    Company:Qualcomm Technologies, Inc.Job Area:Engineering Group, Engineering Group > Systems...  ...prem, and hybrid cloud scenarios. As a Principal Systems Solutions Architect, you will define...  ...; design for offline/online modes, observability, and device management.Agentic AI:... 
    Work experience placement
    Work at office
    Work from home

    Nutanix

    Santa Clara, CA
    3 days ago
  • $175.8k - $293k

     ...they can seize a competitive advantage. We're looking for a Principal AI Engineer to architect, build, and harden the agentic AI systems that...  ...evaluation, and turn promising prototypes into reliable, observable, governed production systems. Here is how, through this role... 

    BMC Software

    Santa Clara, CA
    2 days ago
  • $147k - $237.5k

     ..., protecting our digital way of life. We are looking for an Engineering Manager to lead the Explicit Proxy team, one of the most technically...  ...cloud‑native microservices. Ensure efficient deployment, observability, and runtime stability in production environments.... 

    Palo Alto Networks

    Santa Clara, CA
    5 days ago
  • $248k - $396.75k

     ...this exciting endeavor! We are seeking a highly skilled Principal AI/ML Engineer to join our dynamic team to build the next generation of...  ...(Arista, Cumulus, Cisco, Palo Alto, load balancers) plus observability integration (Prometheus/Grafana) tied into automation/validation... 

    NVIDIA Gruppe

    Santa Clara, CA
    3 days ago
  • $140k - $215k

     ...starts with you. About the Role At CrowdStrike, Site Reliability Engineering (SRE) is at the forefront of ensuring the reliability and...  ...including multi‑cloud failover strategies. Advanced observability experience including Prometheus, Grafana, distributed tracing... 
    Full time
    Work experience placement
    Work at office
    Local area
    2 days per week

    Koitecc Solutions

    Sunnyvale, CA
    4 days ago
  •  ...organizations that keep the world running. Our Team's Vision Our Engineering team is shaping the future of cybersecurity. We thrive on...  ...deployment, including prompt engineering, data pipelines, observability, evaluation frameworks, and cost/performance/latency trade‑offs... 
    Full time
    Work experience placement

    Illumio

    Sunnyvale, CA
    5 days ago
  •  ...a week in Sunnyvale, CA Headquarters. Our Team's Vision: Our Engineering team is driven by a culture that thrives on visionary leadership...  ...of cloud solutions, ensuring high‑quality automation, observability, and operational excellence. Lead and manage a cloud engineering... 
    Immediate start

    Illumio

    Sunnyvale, CA
    4 days ago
  • $280k - $385k

    A leading data and AI company seeks senior leaders to define the strategy for its security platform, focusing on Authentication. Candidates should have extensive experience in Data Security, leadership skills, and a strong communication background. The role offers a competitive...
    Remote work

    Databricks

    Mountain View, CA
    5 days ago
  • Intuitive in Sunnyvale, California is seeking a Senior Software Engineer to design and develop robust software observability applications for its da Vinci™ Surgical System. The role involves collaboration with cross-functional teams to enhance system introspection and... 

    Intuitive

    Sunnyvale, CA
    1 day ago
  • $233.7k - $336.3k

     ...large matrix of key stakeholders, to ensure seamless system integration. Resource Management: Leverage and direct both internal engineering talent and external Tier 1/2 partners to execute complex hardware roadmaps. Subject Matter Expert: Provide expert‑level review of... 

    42dot

    Sunnyvale, CA
    3 days ago
  •  ...the talent we hire. We’re looking for a Principal SRE to join our InfoSec SRE team that...  ...automate robust deployments and end‑to‑end observability across production environments....  ....S. Citizen. BS/MS in Computer Science/Engineering or equivalent training, education, and... 
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    5 days ago
  • $151.6k - $245.3k

     ...infrastructure and is one of the largest GCP customers. As a Principal Site Reliability Engineer for the ADEM (Autonomous Digital Experience Management)...  .... This includes automation, architecture, performance, observability, troubleshooting, security, and reliability. Our... 
    Full time
    Work at office
    Visa sponsorship
    Work visa

    Palo Alto Networks, Inc.

    Santa Clara, CA
    3 days ago
  • $202k - $247k

    Job Category Site Reliability Engineering Posting Date 11/18/2025, 12:24 AM Locations Santa...  .../connectivity, workload management, observability, and storage services. We build tooling...  ...infrastructure. About this role: As a Principal Site Reliability Engineer at FortiCNAPP... 
    Full time
    Worldwide

    Fortinet, Inc.

    Santa Clara, CA
    2 days ago
  • $200k - $260k

     ...rapidly grown into a team of over 200 engineers, scientists, and innovators focused on...  ...helping accelerate the future of Earth observation, environmental monitoring, and space-based...  ...Job Details Our client is seeking a Principal Mechanical Engineer to lead the design... 
    Local area
    Relocation package
    Flexible hours

    Jobot

    San Jose, CA
    3 days ago
  • $157k - $217.4k

     ...Product Development Job Sub Function R&D Electrical/Mechatronic Engineering Job Category Scientific/Technology Locations Santa Clara,...  ...Solutions, part of the Johnson & Johnson family, is recruiting a Principal Electrical Engineer for MONARCH. The position involves leading... 

    Israelvcforum

    Santa Clara, CA
    3 days ago
  • $233.7k - $336.3k

     ...At 42dot, we are building the high‑performance hardware that powers the next generation of autonomous mobility. As the Principal Mechanical Engineer, you will be the technical lead for all mechanical aspects of our compute platforms. Your scope ranges from the precision... 

    42dot Inc.

    Sunnyvale, CA
    4 days ago
  • $157k - $271.4k

     ...Johnson & Johnson Robotics & Digital Solutions is recruiting a Principal Electrical Engineer for the MONARCH platform at Santa Clara, CA. Overview We seek an engineer with deep expertise in PCB design, electrical system architecture, and medical device safety standards... 
    Temporary work
    Local area

    6267-Auris Health Inc. Legal Entity

    Santa Clara, CA
    4 days ago
  •  ...: This role requires US Citizenship. Your Career As a Principal Site Reliability Engineer, you will serve as the technical authority for our cloud...  ...in Python to eliminate manual toil and improve system observability. AI‑Driven Development : Utilize Cursor and Claude to... 
    Visa sponsorship
    Work visa
    Shift work

    Palo Alto Networks, Inc.

    Santa Clara, CA
    2 days ago
  •  ...Intuit is seeking a highly motivated and experienced Principal Machine Learning Engineer to join our Mid Market AI team. In this influential role, you will lead the design, development, and deployment of end-to-end AI/ML solutions that power the next generation of intelligent... 

    Intuit Inc.

    Mountain View, CA
    3 days ago
  • $148.7k - $297.3k

     ...500,000 people with diabetes from routine fingersticks. Principal AI/ML Engineer Location: Santa Clara, CA. What You’ll Work On Lead end-to...  ...versioning, annotation tools, model serving, monitoring and observability. Experience with GenAI and Agentic AI development... 

    Abbott Laboratories company

    Santa Clara, CA
    4 days ago
  • $182.2k - $250k

     ...environments of the industries we serve. Senior Director of Engineering RUCKUS is hiring a Senior Director of Engineering to lead a global...  ...Promote modern practices including automation, testing, observability, and continuous improvement Executive & Cross‑Functional Leadership... 
    Worldwide

    Vistance Networks

    Sunnyvale, CA
    2 days ago
  •  ...NVIDIA Corporation is seeking a Principal Rack Scale Systems Infrastructure Engineer in Santa Clara, California. In this role, you will define software architecture for rack-scale infrastructure products and mentor engineers while collaborating with hardware teams.... 

    Jobleads-US

    Santa Clara, CA
    22 hours ago
  • $103.6k - $155.4k

     ...history, they're making history. Northrop Grumman Mission Systems (NGMS) is looking for you to join our team as a Principal Manufacturing Engineer based out of Sunnyvale, CA. You will support the manufacturing and testing of marine launcher systems, working on... 
    Work experience placement
    Relocation package
    Shift work
    Weekend work

    Northrop Grumman

    Sunnyvale, CA
    4 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Principal Engineer - Observability [Remote]. Be the first to apply!