Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Distinguished Engineer, Data Platform

CloudZero

About the Role

CloudZero is growing fast. Our customer base is expanding, the data challenges we're solving are getting more complex, and the platform is scaling to match. As a Distinguished Engineer on the Data Engineering team, you'll own some of the hardest infrastructure problems at CloudZero: shaping the next-generation streaming data platform, the dimensional cost model underlying every attribution decision, the hot/cold storage architecture serving both real-time and historical queries, and the query engine that powers our entire product.

This is real platform architecture work at real scale, not a consulting role or a review-and-advise job. You'll define the roadmap, drive the foundational decisions, and be a force multiplier for a talented engineering team - evolving CloudZero from batch-oriented pipelines toward a streaming-first architecture where cost attribution reaches engineers within seconds of a resource being used, not the next morning.

This role is ideal for an architect who has built systems like this before, has the scars to prove it, and wants to see their decisions matter in direct and measurable ways for customers and for the business.
What You'll Do

Define the Data Platform Architecture
  • Lead end-to-end technical design for CloudZero's next-generation data platform, from event ingestion and stream processing through hot/cold storage and the query layer to the API surface
  • Document architectural decisions, tradeoffs, and migration strategies with the rigor of an RFC-driven process
  • Shape and drive every layer of the new architecture: event ingestion, stream processing and enrichment, real-time serving, analytical storage, query layer, and API
Drive Streaming Infrastructure to Production
  • Design and deliver CloudZero's real-time data pipeline from ingestion through enrichment to serving
  • Establish SLOs for throughput, latency, and correctness, and build the operational playbooks that make this system trustworthy enough to replace the batch pipelines our entire product currently depends on
  • Tackle real-time streaming at scale across thousands of customers simultaneously, with fault tolerance, backpressure awareness, and correctness as non-negotiables
Tackle the Dimension Cardinality Problem
  • Redesign CloudZero's dimensional cost model to support high-cardinality, multi-dimensional cost attribution without runaway materialization costs
  • Drive incremental, delta-based materialization strategies using modern open table formats, dramatically reducing expensive full-rebuild jobs and unlocking millions in annual infrastructure savings
Evolve the Query Layer
  • Assess CloudZero's current query infrastructure, drive in-flight migrations to completion, and lead the evolution of the query engine layer going forward
  • Own performance optimization across partition pruning, predicate pushdown, and query planning, and set the vision for how the query layer grows as data volumes scale 10x
Extend Cost Attribution to Real-Time
  • Evolve CloudZero's proprietary cost attribution engine from a batch-oriented model to one that assigns complex cost dimensions by team, feature, and customer within seconds of resource usage
  • Rethink enrichment, data lineage, and correctness guarantees in a streaming context
Shape the Data Engineering Roadmap
  • Partner with product, infrastructure, and analytics engineering to define a multi-year data platform roadmap
  • Build consensus across engineering leadership on foundational investments including table formats, streaming frameworks, query engines, and schema management
Elevate the Engineering Team
  • Participate in architecture reviews, contribute to design patterns and best practices, and mentor senior and staff engineers through code review, pairing, and structured feedback
  • Make everyone around you better, not by directing, but by raising the collective craft
What You Bring

Data Platform & Architecture
  • 10+ years in data engineering with a clear trajectory toward principal or staff-level architecture
  • Built and operated large-scale data platforms serving tens of millions of events per day in production
  • Deep experience with streaming systems such as Kafka, Kinesis, Flink, or Spark Streaming at real production throughput
  • Strong hands-on fluency with modern open table formats including Apache Iceberg, Delta Lake, and Hudi, including compaction, partitioning strategy, and time-travel queries
  • Designed hot/cold storage architectures with explicit latency SLOs per tier
  • Proven ability to drive a data platform end to end, not just a single layer
Data Modeling & Dimensional Design
  • Expert in dimensional data modeling including fact/dimension schema design, slowly changing dimensions, and cardinality management
  • Deep understanding of the materialization tradeoff space: full vs. incremental, push vs. pull, pre-aggregate vs. query-time
  • Experience with cost attribution, showback/chargeback, or multi-tenant data partitioning patterns
  • Strong SQL and query optimization background across predicate pushdown, partition pruning, and cost-based query planning
Query Engines & Compute
  • Hands-on with distributed query engines such as Trino, Presto, Spark SQL, or DuckDB including configuration, optimization, and production operations
  • Understands catalog and metadata management and how it couples to query engines
  • Comfortable with cloud data warehouses such as Snowflake, BigQuery, and Redshift and how they integrate with open table formats
  • Experience driving query engine migrations while maintaining production SLAs
Engineering Leadership
  • Track record as a technical anchor for a data platform or data engineering team
  • Writes clear ADRs, RFCs, and technical design docs that bring engineers along
  • Can drive multi-month, multi-team technical initiatives from inception to production without heavy process overhead
  • Communicates complex tradeoffs to non-technical stakeholders including product and business leadership
  • Comfortable in a high-autonomy environment: builds consensus, influences through expertise, and helps teams move forward
Bonus If You Have...
  • FinOps or cloud cost domain experience
  • Multi-cloud data ingestion across AWS, Azure, and GCP
  • Apache Flink at production scale
  • Lakehouse architecture patterns
  • Real-time feature engineering for ML
  • Data mesh or domain-oriented design patterns
  • Prior startup or high-growth SaaS experience
  • Open source contributions to the data ecosystem
About CloudZero

Cloud cost management is one of the biggest challenges organizations face today. As cloud adoption continues to accelerate, so do the complexities and costs associated with it, and macroeconomic conditions only increase pressure to prove cloud efficiency.

CloudZero is a SaaS platform at the intersection of next-generation cloud cost management and FinOps. We ingest billing and usage data from all cloud, SaaS, and PaaS providers, organize it in real time according to our customers' business structures, and empower organizations to make more informed business decisions.

Since our founding in 2016, our mission has been to make efficient innovation a reality for every cloud-driven organization. We believe every engineering decision is a buying decision, and we're applying proven reliability engineering principles to financial efficiency.

We believe the best AI empowers users with clear insights and confident decisions, transforming complex cloud cost data into actionable intelligence that drives meaningful business outcomes.

To date, we've raised over $56 million from leading venture capital firms. We're solving problems of massive scale, business importance, and complexity in a space that needs it more than ever.
Equal Opportunity Employer

CloudZero is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background and reference checks.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Distinguished Engineer, Data Platform in San Francisco, CA vacancy
  • $228.4k - $303.55k

     ...Sr. Staff Software Engineer – Data Platform RDQ126R106 At Databricks, we are passionate about enabling data teams to solve the world’s toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs... 
    Suggested
    Worldwide

    Databricks Inc.

    San Francisco, CA
    1 day ago
  • $200k - $220k

     ...of experts across energy, manufacturing, data center construction, and cloud services....  ...: Join Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on our growing...  ...architect and build the foundational data platform infrastructure that powers Crusoe's AI and... 
    Suggested
    Full time
    Temporary work
    Work at office
    Remote work

    Crusoe

    San Francisco, CA
    5 days ago
  • $202.5k - $247.5k

     ...Software Engineer III/Senior, Data Platform ngrok is an all-in-one cloud networking platform that secures, transforms, and routes traffic to services running anywhere. Instead of cobbling together nginx, NLBs, VPNs, model routers, and oodles of other tools, developers... 
    Suggested
    Permanent employment
    Full time
    Live in
    Work at office
    Local area
    Remote work
    Home office
    Flexible hours

    ngrok

    San Francisco, CA
    4 days ago
  • $200k - $236k

     ...Software Engineer, Data Platform Hybrid - SF Bay Area About GlossGenius GlossGenius is the AI-powered system behind the world's most meaningful appointments, helping 100,000+ service businesses earn more revenue and free up time for the work they love. Our agentic... 
    Suggested
    Work at office
    Home office
    Flexible hours
    3 days per week

    GlossGenius

    San Francisco, CA
    5 days ago
  • $320k - $405k

     ...Software Engineer, Research Data Platform San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.... 
    Suggested
    Work at office
    Visa sponsorship
    Flexible hours

    Anthropic

    San Francisco, CA
    3 days ago
  •  ...Vice President, Software Engineering – Agentic AI and Data Engineering Platform The Vice President, Software Engineering – Agentic AI and Data Engineering Platform is a senior technology executive responsible for defining and executing the engineering strategy for... 

    Duck Creek Technologies

    San Francisco, CA
    1 day ago
  •  ...Job Title You'll join a full-stack data team building the systems making our AI agents...  ...: Architect and build our core platform to handle data at scale with low latency...  ...depth in one of these areas: Platform Engineering You have designed, built, and operated... 
    Full time
    Flexible hours

    Colorwave Inc

    San Francisco, CA
    2 days ago
  • $140k - $210k

     ...them. Actively addresses this at the structural level. Our platform deploys Per-Account AgentsTM across our customers' TAM,...  ...more. About the Role We're looking for a Senior/Staff Data Platform Engineer to build and scale the foundation of Actively's data... 
    Work at office
    Flexible hours
    Shift work

    Actively AI

    San Francisco, CA
    2 days ago
  • $268k - $368.5k

     ...Faire Faire is a technology wholesale platform built on the belief that the future is...  ...Faire, we're using the power of tech, data, and machine learning to connect this thriving...  ...join ours. About this role Our Engineering organization owns the software that... 
    Work experience placement
    Work at office
    Local area
    Remote work
    Monday to Friday
    Flexible hours
    3 days per week

    Faire Inc

    San Francisco, CA
    2 days ago
  •  ...is an AI Storytelling and Visual Creation Platform used by millions worldwide. We're...  ...Why Join OpenArt ~ Own the entire data foundation of a fast-scaling AI company...  ...We're looking for a Founding Data Engineer to build and own OpenArt's core data platform... 
    Remote work
    Worldwide
    Visa sponsorship

    Embedding VC

    San Francisco, CA
    5 days ago
  • $227.9k - $340k

     ...San Francisco, CA, USA Principal Engineer, Data Platform Location San Francisco, CA, USA Department Engineering Requisition ID JOBREQ-2615777 Role description The opportunity Unity is looking for a Principal Engineer, Data Platform to help... 
    Work at office
    Worldwide

    Unity Technologies

    San Francisco, CA
    4 days ago
  • $190k - $240k

     ...Senior Full Stack Engineer At Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data. We're on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes... 
    Work at office
    Local area
    3 days per week

    Snorkel AI

    San Francisco, CA
    5 days ago
  •  ...Hivemapper is a decentralized global map data network built by 10s of thousands of...  ...value to large fleets of vehicles. Data Platform Every day, we process data from millions...  ...clusters called Map AI. The Platform Engineering team is responsible for the core... 
    Flexible hours

    Hivemapper

    San Francisco, CA
    3 days ago
  • $194k - $267k

     ...mission. If you are too, let's talk. About the Team The Data Platform team is responsible for the foundational data services,...  ...flexible. We encourage ownership. We expect great things from our engineers and reward them with stimulating new projects, new... 
    Permanent employment
    Local area
    Worldwide
    Flexible hours

    Okta, Inc.

    San Francisco, CA
    5 days ago
  • $176k - $210k

    Komodo Health Inc is seeking a Senior Data Engineer based in San Francisco to develop and maintain AWS cloud infrastructure and enhance data quality. The ideal candidate will have a strong proficiency in Python and expert knowledge of AWS services, along with experience... 

    Komodo Health Inc

    San Francisco, CA
    5 days ago
  • $66k - $165k

     ...Diego, CA; San Francisco, CA; Boston, MA; Louisville, CO; Indianapolis, IN Position Summary We are seeking an Engineer - MLOps & Scientific Platforms - Data Foundry to operationalize Data Foundry’s scientific tools and analytical methods into actionable‑prototypes. You... 
    Full time
    H1b
    Visa sponsorship
    Work visa
    Flexible hours

    Initial Therapeutics, Inc.

    San Francisco, CA
    3 days ago
  • F2 AI is seeking a knowledgeable Infrastructure Engineer to design and secure infrastructure for our AI data platform in San Francisco. The role requires 6+ years of experience in infrastructure engineering, with expertise in AWS, GCP, or Azure. Key responsibilities include... 

    F2 AI

    San Francisco, CA
    3 days ago
  • A data intelligence platform is seeking a Senior Software Engineer in San Francisco to drive projects from ideation to production while working with cutting-edge technology in a high-autonomy environment. The ideal candidate has over 6 years of full-stack experience, particularly... 
    Remote job

    Metriport

    San Francisco, CA
    1 day ago
  • Sephora USA, Inc is looking for a Senior Engineer, Marketing Technology in San Francisco. This hybrid role involves designing analytical solutions, building data infrastructure, and working cross-functionally with marketing teams. Ideal candidates have 7+ years of experience... 

    Sephora USA, Inc.

    San Francisco, CA
    5 days ago
  •  ...we’re building the category-defining AI agentic operating platform that healthcare teams rely on to operate smarter, faster, and...  ...team. About the role We are hiring a Senior Software Engineer to own the data platform that powers Plenful’s automation engine. You will... 
    Work at office
    Flexible hours
    2 days per week

    Plenful

    San Francisco, CA
    2 days ago
  • $142.6k - $176k

    Octave is hiring a Sr. Data Engineer in San Francisco to evolve their data platform for AI and ML applications. The role requires extensive experience in data engineering and familiarity with cloud platforms like AWS and GCP. Responsibilities include designing scalable... 

    Octave

    San Francisco, CA
    1 day ago
  •  ...About the Role We're looking for a Senior Staff Data Engineer to be the technical backbone of our Data & ML Platform team - the foundation powering analytics, product experiences, and machine learning across Hinge Health. This is a high-ownership IC role for someone... 
    Work at office
    Local area
    Immediate start
    Remote work
    Worldwide
    3 days per week

    Hinge Health

    San Francisco, CA
    5 days ago
  • $120k - $170k

     ...Backend Software Engineer — Data Platform & AI Data Products San Francisco About the Role You'll join the Data Platform team, responsible for building the backend services and "data products" that power how data moves through the company. We create the core platform... 
    Full time
    Internship

    Together AI

    San Francisco, CA
    20 days ago
  •  ...intelligence - the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting a tail-...  ...pager that keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand them back... 
    Flexible hours

    Adaption

    San Francisco, CA
    23 days ago
  • $175k - $225k

     ...She will pick the best candidates from Jack's network The next step is to speak to Jack. Job Title: Founding Engineer - Backend & Data Platform Salary: $175,000 - $225,000 + Equity Company Description: Greenline - Fast-growing fintech startup Job Description... 
    Self employment
    Remote work

    Jack and Jill AI

    San Francisco, CA
    3 days ago
  • $191k - $225k

    Nerdleveltech is looking for a highly skilled individual to join their team in San Francisco. This role focuses on building and managing big data infrastructure, emphasizing strong programming skills, particularly in Java and Scala. You will have the opportunity to contribute... 

    Nerdleveltech

    San Francisco, CA
    1 day ago
  •  ...startup in neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential for developing... 

    Echo Neurotechnologies

    San Francisco, CA
    3 days ago
  • $130k - $200k

     ...enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental...  ...AI development: # Enterprise Platform & Tools : Advanced annotation tools, workflow...  ...We're looking for a Full-Stack AI Engineer to join our team, where you'll build the... 
    Work at office
    Flexible hours
    3 days per week

    Labelbox

    San Francisco, CA
    2 days ago
  • $199k

     ...About the Role As an Engineering Manager on Chime's Data Platform, leading the Data Storage team, you will own the group that manages Chime's online and analytical data stores and low-latency metric serving layer. Your team will be responsible for building and operating... 
    Full time
    Work at office
    Local area
    Remote work
    Night shift

    Chime Financial, Inc

    San Francisco, CA
    4 days ago
  •  ...the only unified payments and financial platform for global businesses. Powered by our unique...  ...team is at the heart of our company's data and AI strategy. We are building the...  ...your role will also include scaling the engineering teams, defining new roles, and establishing... 
    Worldwide

    Airwallex

    San Francisco, CA
    2 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Distinguished Engineer, Data Platform. Be the first to apply!