Senior Software Engineer, Observability
$139k - $220kCoreWeave
Job Description
Job Description
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at
What You'll Do:Join CoreWeave's Observability team, responsible for building the systems that give our customers and internal teams unparalleled visibility into complex AI workloads. Our team empowers engineers to understand, troubleshoot, and optimize high-performance infrastructure at massive scale.
About the role:
As a Senior Software Engineer on the Observability team, you will design, build, and maintain core observability infrastructure spanning metrics, logging, tracing, and telemetry pipelines. Your day-to-day will involve developing highly reliable and scalable systems, collaborating with internal engineering teams to embed observability best practices, and tackling performance and reliability challenges across clusters of thousands of GPUs. You'll also contribute to platform strategy and participate in on-call rotations to ensure critical production systems remain robust and operational.
- 5+ years of experience in software or infrastructure engineering with a focus on designing, building, and operating large-scale distributed systems in production.
- Proficient in Go or Python with experience writing clean, testable, and resilient production code.
- Hands-on experience with Kubernetes, containerization, and microservices architectures in production environments.
- Proven ability to design and deliver scalable, robust systems with high-quality code, automated testing, and progressive release strategies.
- Skilled in decomposing complex problems in distributed architectures into manageable, well-scoped work.
- Familiar with Helm and YAML-based configurations for deploying and managing services, including templating, automation, and infrastructure-as-code practices.
- Experience participating in on-call rotations for critical production systems.
- Bachelor's degree in Computer Science, Electrical Engineering, Mathematics, or related field.
- Experience designing, operating, or scaling logging, metrics, or tracing platforms (e.g., Loki, ClickHouse, Elasticsearch, Prometheus, VictoriaMetrics, Grafana, Thanos).
- Familiarity with data streaming systems for observability pipelines (e.g., Kafka, Kafka Connect).
- Experience automating infrastructure provisioning using tools like Terraform.
- Knowledge of OpenTelemetry for unified telemetry collection and instrumentation.
- Exposure to modern AI workloads and GPU-based infrastructure, including large-scale training and inference.
- You love building systems that provide deep visibility into complex, high-scale environments.
- You're curious about observability, telemetry, and platform performance at massive scale.
- You're an expert in distributed systems and engineering resilient, scalable software.
Why CoreWeave?
At CoreWeave, we work hard, have fun, and move fast! We're in an exciting stage of hyper-growth that you will not want to miss out on. We're not afraid of a little chaos, and we're constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:- Be Curious at Your Core
- Act Like an Owner
- Empower Employees
- Deliver Best-in-Class Client Experiences
- Achieve More Together
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and enables the development of innovative solutions to complex problems. As we get set for takeoff, the organization's growth opportunities are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us!
The base salary range for this role is $139,000 to $220,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility).
What We Offer
The range we've posted represents the typical compensation range for this role. To determine actual compensation, we review the market rate for each candidate which can include a variety of factors. These include qualifications, experience, interview performance, and location.
In addition to a competitive salary, we offer a variety of benefits to support your needs. The benefits below reflect our US-based offerings; for roles in other locations, benefits vary and are shared during the hiring process. These include:
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption
California Applicants
California Consumer Privacy Act
Equal Opportunity & Accommodations
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
As part of this commitment and consistent with the Americans with Disabilities Act (ADA) , CoreWeave will ensure that qualified applicants and candidates with disabilities are provided reasonable accommodations for the hiring process, unless such accommodation would cause an undue hardship. If reasonable accommodation is needed, please contact: View email address on ziprecruiter.com.
Export Control Compliance
This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
$178.42k - $230.5k
General Motors is seeking a Senior ML/AI Engineer to enhance developer productivity in Sunnyvale, California. This role emphasizes building tools for observability in software engineering and mentoring junior engineers. Candidates should have over 5 years of experience...Senior$235k - $295k
A data and AI company in Mountain View seeks a Software Engineer for the Observability team. This role involves developing solutions for product performance insights and managing cloud infrastructure. The ideal candidate has over 15 years of experience in software development...Senior- Senior Systems Software Engineer (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with... ...support operational and reliability aspects of large scale Observability & Telemetry collection platform with a focus on...Senior
$184k - $287.5k
NVIDIA Gruppe is seeking a Senior System Software Engineer to lead the development of their next-generation Data & Observability Platform in Santa Clara, California. This role focuses on high-performance ingestion, governance systems, and user experience improvements while...Senior$184k - $287.5k
NVIDIA’s Hardware Infrastructure organization is seeking a Senior System Software Engineer to lead the evolution of our next-generation Data & Observability Platform. We serve and collaborate directly with NVIDIA’s rapidly growing AI, HW, and SW engineering and research...Senior- ...Our team is responsible for the real‑time software infrastructure that supports critical performance, safety, observability and user‑facing features of the da Vinci... ...product safety and reliability. As a Senior Software Engineer, you will be collaborating with talented...Senior
$224k - $356.5k
We are seeking a Senior Software Engineer to help define the runtime intelligence and safety architecture behind next-generation autonomous... ...disagreement, and real-world safety constraints. Improve observability, reliability, and debuggability across large-scale autonomy...Senior$152k - $241.5k
...and tools that enable researchers and engineers to develop the next generation of AI/ML... ...computing workloads. We are seeking a Software Engineer to join our MARS team at NVIDIA... ...in system reliability, performance, and observability to meet exascale standards. Partner with...Senior$170.6k - $261.3k
Overview As a Senior Software Engineer on the SimCore team, you will build and deploy applied AI/ML solutions that directly support simulation... ...cloud platforms, Kubernetes, Docker, and production observability. Prior experience mentoring teams, defining architecture...SeniorFlexible hours$152k - $241.5k
...make a lasting impact on the world. We are looking for a Senior Software Engineer to join our mission to continue improving our HPC infrastructure... ..., build reviews, implementation, testing, rollout, observability, and iterative improvement. Hands‑on experience with at...Senior$144.7k - $221.4k
...analyses to introspect autonomous driving software performance at interfaces across the... ...closely with autonomy developers and systems engineers. Design and implement analysis... ...system design, code reviews, testing, observability, and adherence to software‑engineering...SeniorLocal areaRemote workRelocationRelocation packageFlexible hours$122k - $185k
Arlo Technologies, Inc. is seeking a Senior Frontend Engineer to drive Angular upgrades and lead CI/CD ownership. The ideal candidate has 5... ...downtime releases, ensuring top-notch performance through observability tools like Datadog. The expected salary ranges from USD$1...Senior- ...best practices for code quality, testing, deployment, and observability Mentor engineers across the team, raising the bar on design patterns, code... ..., or a related field, required 4+ years of professional software engineering experience, ideally in a high-growth or...Senior
$152k - $241.5k
...partner with OS, container, GPU, and systems engineers, and apply machine learning or deep... ...or prediction) within existing software workflows. Qualifications 5+ years of experience... ...analysis. Hands‑on use of telemetry/observability stacks (e.g., Grafana, Elasticsearch, Splunk...Senior$178.42k - $230.5k
...maintaining the tools and services engineers here at GM use every day to... ...Role We are looking for a Senior Engineer with an extensive... ...delivering impact through observability frameworks and will evolve depending... .... What You’ll Do Using your software and systems engineering...SeniorFull timeWork experience placementLocal areaWork from homeFlexible hours$181.1k - $272.1k
Senior OS Software Engineer, Field Engagement & Analytics Cupertino, California, United States Software and Services How are Apple devices used... ...requirements Experience with telemetry, analytics, or observability systems Experience with performance optimization under constrained...SeniorWorldwideRelocation$160k - $200k
As a Senior Software Engineer - Go (Golang), you will design, develop, and deliver high-performance middleware and application software solutions... ...automotive lifecycle processes Experience with system observability (logging, monitoring, tracing) and production support...SeniorFlexible hours$168k - $270.25k
...modeling and schema design, and expand observability over the factory pipeline and its compute... ...BS or MS in Computer Science, Computer Engineering or related field (or equivalent... ...experience developing microservices, cloud software and/or tooling roles. Desirable Experience...Senior- ...to help the staff boost their practice’s revenue. As a Senior Software Engineer Engineer, you will play a key role in scaling and maintaining... ..., Python, and PostgreSQL Improve system reliability and observability by implementing best practices in monitoring, logging,...SeniorImmediate start
$168k - $270.25k
Senior Software Engineer, Distributed Systems - NIM Factory page is loaded## Senior Software Engineer, Distributed Systems - NIM Factorylocations... ..., data modeling and schema design, and expanding observability over the factory pipeline and its compute infrastructure....SeniorRemote work- We are looking for a Senior Software Engineer to help build NeMo Platform, NVIDIA’s product for developing, evaluating, deploying, and operating... ...real workflows, teams need practical infrastructure for observing behavior, measuring progress, catching regressions, and...Senior
$224k - $431.25k
NVIDIA Gruppe is seeking a Senior System Software Engineer for Cloud in Santa Clara, California. The role involves designing and building scalable cloud solutions for GeForce NOW. Candidates should have extensive experience with Java, Golang, and Kubernetes, along with...Senior$147.4k - $272.1k
Senior Software Engineer - Distributed Systems Cupertino, California, United States Machine Learning and AI Our team is on a mission to build... ...Kubernetes. Strong interest in distributed storage, observability, reliability, and cloud services. Interest in working across...SeniorRelocation- A leading technology company is seeking a Senior System Software Engineer for Cloud in Santa Clara, CA. This role involves designing and deploying scalable cloud-based solutions for a cloud gaming service. The ideal candidate will have extensive experience with programming...Senior
$224k - $356.5k
At NVIDIA, our Financial Systems Engineering team is at the heart of ensuring that our massive... ...Design, deploy, and maintain scalable software services that ensure transactional... ..., including Kubernetes, Docker, CI/CD, observability, and reliability engineering. Your base...Senior- Sanas is looking for a skilled Production Engineer to manage the infrastructure for its high-scale, real-time speech AI platform. The... ...Terraform. The role emphasizes operational excellence, developer velocity, and deep observability across systems. #J-18808-Ljbffr SanasSenior
$152k - $241.5k
.... You will work with a diverse team of engineers in mapping, perception, reconstruction,... ...detection methods that can handle noisy observations, dynamic scenes, imperfect localization... ...experience building production-quality software systems. Solid foundation in 3D...SeniorFull timeWorldwide$170.6k - $261.3k
...systems to intuitive design, intelligent software, and next-generation safety and... ...bring the vehicle to a safe stop. As a Senior Software Engineer on the Secondary Driving System team... ...integration, performance profiling, and observability for on‑road incidents. Analyze and...SeniorRemote workRelocation packageFlexible hours- ...deliver this transformation. About the Role We are seeking a Senior Software Engineer for our R&D team at Commure, the team taking emerging... ...practices for code quality, testing, deployment, and observability Collaborate with product managers, designers, and clinical...SeniorFull timeWork at officeImmediate start
$235k - $295k
...insights to improve their business. Our engineering teams build technical products that fulfill... ...and operate one of the largest-scale software platforms. The fleet consists of millions... ...of data per day. At our scale, we observe cloud hardware, network, and operating system...SeniorLocal areaWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Software Engineer, Observability. Be the first to apply!
- software engineer amazon Sunnyvale, CA
- agile software developer Sunnyvale, CA
- rust software engineer Sunnyvale, CA
- software developer positions Sunnyvale, CA
- senior software design engineer Sunnyvale, CA
- software developer Sunnyvale, CA
- ngo software engineer Sunnyvale, CA
- startup software engineer Sunnyvale, CA
- software development engineer (robotics engineer) Sunnyvale, CA
- software data engineer Sunnyvale, CA
