Distinguished Engineer, Data Platform
CloudZero
About the Role CloudZero is growing fast. Our customer base is expanding, the data challenges we're solving are getting more complex, and the platform is scaling to match. As a Distinguished Engineer on the Data Engineering team, you'll own some of the hardest infrastructure problems at CloudZero: shaping the next-generation streaming data platform, the dimensional cost model underlying every attribution decision, the hot/cold storage architecture serving both real-time and historical queries, and the query engine that powers our entire product. This is real platform architecture work at real scale, not a consulting role or a review-and-advise job. You'll define the roadmap, drive the foundational decisions, and be a force multiplier for a talented engineering team - evolving CloudZero from batch-oriented pipelines toward a streaming-first architecture where cost attribution reaches engineers within seconds of a resource being used, not the next morning. This role is ideal for an architect who has built systems like this before, has the scars to prove it, and wants to see their decisions matter in direct and measurable ways for customers and for the business.
What You'll Do Define the Data Platform Architecture
Equal Opportunity Employer CloudZero is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background and reference checks.
What You'll Do Define the Data Platform Architecture
- Lead end-to-end technical design for CloudZero's next-generation data platform, from event ingestion and stream processing through hot/cold storage and the query layer to the API surface
- Document architectural decisions, tradeoffs, and migration strategies with the rigor of an RFC-driven process
- Shape and drive every layer of the new architecture: event ingestion, stream processing and enrichment, real-time serving, analytical storage, query layer, and API
- Design and deliver CloudZero's real-time data pipeline from ingestion through enrichment to serving
- Establish SLOs for throughput, latency, and correctness, and build the operational playbooks that make this system trustworthy enough to replace the batch pipelines our entire product currently depends on
- Tackle real-time streaming at scale across thousands of customers simultaneously, with fault tolerance, backpressure awareness, and correctness as non-negotiables
- Redesign CloudZero's dimensional cost model to support high-cardinality, multi-dimensional cost attribution without runaway materialization costs
- Drive incremental, delta-based materialization strategies using modern open table formats, dramatically reducing expensive full-rebuild jobs and unlocking millions in annual infrastructure savings
- Assess CloudZero's current query infrastructure, drive in-flight migrations to completion, and lead the evolution of the query engine layer going forward
- Own performance optimization across partition pruning, predicate pushdown, and query planning, and set the vision for how the query layer grows as data volumes scale 10x
- Evolve CloudZero's proprietary cost attribution engine from a batch-oriented model to one that assigns complex cost dimensions by team, feature, and customer within seconds of resource usage
- Rethink enrichment, data lineage, and correctness guarantees in a streaming context
- Partner with product, infrastructure, and analytics engineering to define a multi-year data platform roadmap
- Build consensus across engineering leadership on foundational investments including table formats, streaming frameworks, query engines, and schema management
- Participate in architecture reviews, contribute to design patterns and best practices, and mentor senior and staff engineers through code review, pairing, and structured feedback
- Make everyone around you better, not by directing, but by raising the collective craft
- 10+ years in data engineering with a clear trajectory toward principal or staff-level architecture
- Built and operated large-scale data platforms serving tens of millions of events per day in production
- Deep experience with streaming systems such as Kafka, Kinesis, Flink, or Spark Streaming at real production throughput
- Strong hands-on fluency with modern open table formats including Apache Iceberg, Delta Lake, and Hudi, including compaction, partitioning strategy, and time-travel queries
- Designed hot/cold storage architectures with explicit latency SLOs per tier
- Proven ability to drive a data platform end to end, not just a single layer
- Expert in dimensional data modeling including fact/dimension schema design, slowly changing dimensions, and cardinality management
- Deep understanding of the materialization tradeoff space: full vs. incremental, push vs. pull, pre-aggregate vs. query-time
- Experience with cost attribution, showback/chargeback, or multi-tenant data partitioning patterns
- Strong SQL and query optimization background across predicate pushdown, partition pruning, and cost-based query planning
- Hands-on with distributed query engines such as Trino, Presto, Spark SQL, or DuckDB including configuration, optimization, and production operations
- Understands catalog and metadata management and how it couples to query engines
- Comfortable with cloud data warehouses such as Snowflake, BigQuery, and Redshift and how they integrate with open table formats
- Experience driving query engine migrations while maintaining production SLAs
- Track record as a technical anchor for a data platform or data engineering team
- Writes clear ADRs, RFCs, and technical design docs that bring engineers along
- Can drive multi-month, multi-team technical initiatives from inception to production without heavy process overhead
- Communicates complex tradeoffs to non-technical stakeholders including product and business leadership
- Comfortable in a high-autonomy environment: builds consensus, influences through expertise, and helps teams move forward
- FinOps or cloud cost domain experience
- Multi-cloud data ingestion across AWS, Azure, and GCP
- Apache Flink at production scale
- Lakehouse architecture patterns
- Real-time feature engineering for ML
- Data mesh or domain-oriented design patterns
- Prior startup or high-growth SaaS experience
- Open source contributions to the data ecosystem
Equal Opportunity Employer CloudZero is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background and reference checks.
Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Distinguished Engineer, Data Platform in San Francisco, CA vacancy
$275k - $330k
...Full time Location Type Remote Department Engineering Data Engineering & AI Compensation ~$275K – $330K •... ...we're solving are getting more complex, and the platform is scaling to match. As a Distinguished Architect on the Data Engineering team, you'll own...SuggestedPermanent employmentFull timeRemote workVisa sponsorshipWork visaDay shift$269.1k - $307.2k
...Distinguished Data Engineer - Enterprise Data Technology Distinguished Data Engineers are individual contributors who strive to be diverse in... ...ensuring the stability and scalability of our core e-data platforms. As a horizontal organization, we manage a diverse tech stack...SuggestedFull timePart timeLocal area- Distinguished Data Engineer - Card Data Distinguished Data Engineers are individual contributors who strive to be diverse in thought so we visualize... ...Operate as a trusted advisor for a specific technology, platform or capability domain, helping to shape use cases and...Suggested
$269.1k - $307.2k
...Distinguished AI Engineer (Agentic AI Platform) At Capital One, we are creating responsible and reliable AI systems, changing banking for good. For years... ...production ~ Deep understanding of Responsible AI, data privacy and multi-tenant security patterns ~ Experience...SuggestedFull timePart timeWork at officeLocal area- Capital One National Association is seeking a Distinguished Data Engineer in San Francisco, California, to enhance data engineering practices within the Credit Card Technology Team. The role demands strong expertise in data engineering and architecture, with a focus on...Suggested
$196k - $245k
...one thing that nearly everyone does on our platform: play video games. Over 90% of our... ...accomplished and experienced Senior Software Engineer to join our dynamic team. In this role,... ...designing, developing, and maintaining our data infrastructure and services. You will...Full timeWork at officeRelocationRelocation package$320k - $405k
...Software Engineer, Research Data Platform San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole....Work at officeVisa sponsorshipFlexible hours$200k - $236k
...Software Engineer, Data Platform Hybrid - SF Bay Area GlossGenius is the AI-powered system behind the world's most meaningful appointments, helping 100,000+ service businesses earn more revenue and free up time for the work they love. Our agentic workforce gets more...Work at officeHome officeFlexible hours3 days per week$200k - $220k
...of experts across energy, manufacturing, data center construction, and cloud services.... ...: Join Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on our growing... ...architect and build the foundational data platform infrastructure that powers Crusoe's AI and...Full timeTemporary work$160k - $180k
...one thing that nearly everyone does on our platform: play video games. Over 90% of our... ...Go Live, every Quest completed, there's data, petabytes of it, telling the story of how... ...lovable products for Discord users and Discord engineers. We’re building the next generation Data...Full timeWorldwideRelocationRelocation package$202.5k - $247.5k
...Software Engineer III/Senior, Data Platform ngrok is an all-in-one cloud networking platform that secures, transforms, and routes traffic to services running anywhere. Instead of cobbling together nginx, NLBs, VPNs, model routers, and oodles of other tools, developers...Permanent employmentFull timeLive inWork at officeLocal areaRemote workHome officeFlexible hours$187k
...About the Role Chime's Data Platform team builds the infrastructure every engineering and analytics team depends on - ingestion, transformation, quality, governance, and self-serve tooling across batch and streaming workloads. You'll own core platform systems...Full timeWork at officeLocal areaRemote workNight shift$147.4k - $272.1k
...Sr Software Engineer, Ai & Data Platforms (Aidp) Imagine what you could do here. At Apple, we believe new insights have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling...Relocation- ...the only unified payments and financial platform for global businesses. Powered by our unique... ...team is at the heart of our company's data and AI strategy. We are building the foundational... ...Are? As a high level architect (staff engineer), you will oversee the strategy,...Work at officeWorldwide
- ...Posting At Sierra, we're creating a platform to help businesses build better, more human... ...You'll Do You'll join a full-stack data team building the systems making our AI... ...depth in one of these areas: Platform Engineering: You have designed, built, and operated...Full timeFlexible hours
- ...is an AI Storytelling and Visual Creation Platform used by millions worldwide. We're... ...Why Join OpenArt ~ Own the entire data foundation of a fast-scaling AI company... ...We're looking for a Founding Data Engineer to build and own OpenArt's core data platform...Remote workWorldwideVisa sponsorship
$248k - $279k
...one thing that nearly everyone does on our platform: play video games. Over 90% of our... ...someone who gets excited about building data infrastructure at massive scale and cares... ...lovable products for Discord users and Discord engineers. We're building the next generation Data...Full timeWork at officeWorldwideRelocationRelocation package$268k - $368.5k
...Faire Faire is a technology wholesale platform built on the belief that the future is... ...Faire, we're using the power of tech, data, and machine learning to connect this thriving... ...join ours. About this role Our Engineering organization owns the software that...Work experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week$140k - $210k
...them. Actively addresses this at the structural level. Our platform deploys Per-Account AgentsTM across our customers' TAM,... ...more. About the Role We're looking for a Senior/Staff Data Platform Engineer to build and scale the foundation of Actively's data...Work at officeFlexible hoursShift work$252k - $315k
...throughout life. As the world adjusts to this new reality, leading platform companies are scrambling to build LLMs at billion scale,... .... At Scale, our products include the Generative AI Data Engine, SGP, Donovan, and others that power the most advanced LLMs and...Full timeLive in- An educational technology company is seeking a Staff Analytics Engineer to take ownership of analytics data management. You will enhance data quality, create scalable data models, and leverage AI for improved workflows. Ideal candidates will have strong experience in analytics...
- ...AirOps AirOps is the first end-to-end content engineering platform built for the AI era. In a world where discovery is shifting from traditional... ...exactly how they show up across AI search—and that data has to be fast, accurate, and trusted. We've outgrown "data...Flexible hoursShift work
$197.3k - $313.7k
...you are not duplicating efforts. Job Category Software Engineering Job Details About Salesforce Salesforce is the #1... ...The Principal Member of Technical Staff for the Enterprise Data Platform is the primary technical architect responsible for modernizing...- ...Hivemapper is a decentralized global map data network built by 10s of thousands of... ...value to large fleets of vehicles. Data Platform Every day, we process data from millions... ...clusters called Map AI. The Platform Engineering team is responsible for the core...Flexible hours
$207k - $362.25k
...addresses. About the role Rippling's Data Cloud underpins every analytical experience across our platform-from real-time dashboards for HR and IT admins... ...analytics, and applications. As a Senior Staff Engineer on the Query and Data Platform team, you will...Work at officeShift work3 days per week$192k - $240k
...Senior Full Stack Engineer At Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data. We're on a mission to help enterprises transform expert knowledge... ...that power our Expert Data Collection Platform, a critical engine for high-quality ML...Work at officeLocal area3 days per week- About the Role We're looking for a Senior Staff Data Engineer to be the technical backbone of our Data & ML Platform team — the foundation powering analytics, product experiences, and machine learning across Hinge Health. This is a high-ownership IC role for someone who...Work at officeLocal areaImmediate startRemote workWorldwide3 days per week
$194k - $267k
...mission. If you are too, let's talk. About the Team The Data Platform team is responsible for the foundational data services,... ...flexible. We encourage ownership. We expect great things from our engineers and reward them with stimulating new projects, new...Permanent employmentWork at officeLocal areaWorldwideFlexible hours$220k - $250k
...Senior / Staff Software Engineer, Data Platform Title of Role: Senior / Staff Software Engineer, Data Platform Location: San Francisco, hybrid Company Stage of Funding: Venture Round — Software Development, AI Office Type: Hybrid Salary: $220K–$250K Company...Work at office$120k - $170k
...About the Role You’ll join the Data Platform team, responsible for building the backend services and “data products” that power how... ...and help make our data platform more self-serve so product and engineering teams can easily create and operate event-driven...Full timeInternship
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Distinguished Engineer, Data Platform. Be the first to apply!
Related searches
- director data engineering San Francisco, CA
- junior big data engineer San Francisco, CA
- data engineer graduate San Francisco, CA
- senior data engineer San Francisco, CA
- data platform engineer San Francisco, CA
- sr information security engineer San Francisco, CA
- senior data integration developer San Francisco, CA
- data developer San Francisco, CA
- data engineer San Francisco, CA
- data infrastructure engineer San Francisco, CA

