Distinguished Engineer, Data Platform
CloudZero
About the Role CloudZero is growing fast. Our customer base is expanding, the data challenges we're solving are getting more complex, and the platform is scaling to match. As a Distinguished Engineer on the Data Engineering team, you'll own some of the hardest infrastructure problems at CloudZero: shaping the next-generation streaming data platform, the dimensional cost model underlying every attribution decision, the hot/cold storage architecture serving both real-time and historical queries, and the query engine that powers our entire product. This is real platform architecture work at real scale, not a consulting role or a review-and-advise job. You'll define the roadmap, drive the foundational decisions, and be a force multiplier for a talented engineering team - evolving CloudZero from batch-oriented pipelines toward a streaming-first architecture where cost attribution reaches engineers within seconds of a resource being used, not the next morning. This role is ideal for an architect who has built systems like this before, has the scars to prove it, and wants to see their decisions matter in direct and measurable ways for customers and for the business.
What You'll Do Define the Data Platform Architecture
Equal Opportunity Employer CloudZero is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background and reference checks.
What You'll Do Define the Data Platform Architecture
- Lead end-to-end technical design for CloudZero's next-generation data platform, from event ingestion and stream processing through hot/cold storage and the query layer to the API surface
- Document architectural decisions, tradeoffs, and migration strategies with the rigor of an RFC-driven process
- Shape and drive every layer of the new architecture: event ingestion, stream processing and enrichment, real-time serving, analytical storage, query layer, and API
- Design and deliver CloudZero's real-time data pipeline from ingestion through enrichment to serving
- Establish SLOs for throughput, latency, and correctness, and build the operational playbooks that make this system trustworthy enough to replace the batch pipelines our entire product currently depends on
- Tackle real-time streaming at scale across thousands of customers simultaneously, with fault tolerance, backpressure awareness, and correctness as non-negotiables
- Redesign CloudZero's dimensional cost model to support high-cardinality, multi-dimensional cost attribution without runaway materialization costs
- Drive incremental, delta-based materialization strategies using modern open table formats, dramatically reducing expensive full-rebuild jobs and unlocking millions in annual infrastructure savings
- Assess CloudZero's current query infrastructure, drive in-flight migrations to completion, and lead the evolution of the query engine layer going forward
- Own performance optimization across partition pruning, predicate pushdown, and query planning, and set the vision for how the query layer grows as data volumes scale 10x
- Evolve CloudZero's proprietary cost attribution engine from a batch-oriented model to one that assigns complex cost dimensions by team, feature, and customer within seconds of resource usage
- Rethink enrichment, data lineage, and correctness guarantees in a streaming context
- Partner with product, infrastructure, and analytics engineering to define a multi-year data platform roadmap
- Build consensus across engineering leadership on foundational investments including table formats, streaming frameworks, query engines, and schema management
- Participate in architecture reviews, contribute to design patterns and best practices, and mentor senior and staff engineers through code review, pairing, and structured feedback
- Make everyone around you better, not by directing, but by raising the collective craft
- 10+ years in data engineering with a clear trajectory toward principal or staff-level architecture
- Built and operated large-scale data platforms serving tens of millions of events per day in production
- Deep experience with streaming systems such as Kafka, Kinesis, Flink, or Spark Streaming at real production throughput
- Strong hands-on fluency with modern open table formats including Apache Iceberg, Delta Lake, and Hudi, including compaction, partitioning strategy, and time-travel queries
- Designed hot/cold storage architectures with explicit latency SLOs per tier
- Proven ability to drive a data platform end to end, not just a single layer
- Expert in dimensional data modeling including fact/dimension schema design, slowly changing dimensions, and cardinality management
- Deep understanding of the materialization tradeoff space: full vs. incremental, push vs. pull, pre-aggregate vs. query-time
- Experience with cost attribution, showback/chargeback, or multi-tenant data partitioning patterns
- Strong SQL and query optimization background across predicate pushdown, partition pruning, and cost-based query planning
- Hands-on with distributed query engines such as Trino, Presto, Spark SQL, or DuckDB including configuration, optimization, and production operations
- Understands catalog and metadata management and how it couples to query engines
- Comfortable with cloud data warehouses such as Snowflake, BigQuery, and Redshift and how they integrate with open table formats
- Experience driving query engine migrations while maintaining production SLAs
- Track record as a technical anchor for a data platform or data engineering team
- Writes clear ADRs, RFCs, and technical design docs that bring engineers along
- Can drive multi-month, multi-team technical initiatives from inception to production without heavy process overhead
- Communicates complex tradeoffs to non-technical stakeholders including product and business leadership
- Comfortable in a high-autonomy environment: builds consensus, influences through expertise, and helps teams move forward
- FinOps or cloud cost domain experience
- Multi-cloud data ingestion across AWS, Azure, and GCP
- Apache Flink at production scale
- Lakehouse architecture patterns
- Real-time feature engineering for ML
- Data mesh or domain-oriented design patterns
- Prior startup or high-growth SaaS experience
- Open source contributions to the data ecosystem
Equal Opportunity Employer CloudZero is an equal opportunity employer and values diversity. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status or disability status. All job offers are contingent upon the candidate passing background and reference checks.
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Distinguished Engineer, Data Platform in San Francisco, CA vacancy
$228.4k - $303.55k
...Sr. Staff Software Engineer – Data Platform RDQ126R106 At Databricks, we are passionate about enabling data teams to solve the world’s toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs...SuggestedWorldwide$200k - $220k
...of experts across energy, manufacturing, data center construction, and cloud services.... ...: Join Crusoe Energy as a Senior Data Engineer, an early and pivotal hire on our growing... ...architect and build the foundational data platform infrastructure that powers Crusoe's AI and...SuggestedFull timeTemporary workWork at officeRemote work$202.5k - $247.5k
...Software Engineer III/Senior, Data Platform ngrok is an all-in-one cloud networking platform that secures, transforms, and routes traffic to services running anywhere. Instead of cobbling together nginx, NLBs, VPNs, model routers, and oodles of other tools, developers...SuggestedPermanent employmentFull timeLive inWork at officeLocal areaRemote workHome officeFlexible hours$200k - $236k
...Software Engineer, Data Platform Hybrid - SF Bay Area About GlossGenius GlossGenius is the AI-powered system behind the world's most meaningful appointments, helping 100,000+ service businesses earn more revenue and free up time for the work they love. Our agentic...SuggestedWork at officeHome officeFlexible hours3 days per week$320k - $405k
...Software Engineer, Research Data Platform San Francisco, CA | New York City, NY About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole....SuggestedWork at officeVisa sponsorshipFlexible hours- ...Vice President, Software Engineering – Agentic AI and Data Engineering Platform The Vice President, Software Engineering – Agentic AI and Data Engineering Platform is a senior technology executive responsible for defining and executing the engineering strategy for...
- ...Job Title You'll join a full-stack data team building the systems making our AI agents... ...: Architect and build our core platform to handle data at scale with low latency... ...depth in one of these areas: Platform Engineering You have designed, built, and operated...Full timeFlexible hours
$140k - $210k
...them. Actively addresses this at the structural level. Our platform deploys Per-Account AgentsTM across our customers' TAM,... ...more. About the Role We're looking for a Senior/Staff Data Platform Engineer to build and scale the foundation of Actively's data...Work at officeFlexible hoursShift work$268k - $368.5k
...Faire Faire is a technology wholesale platform built on the belief that the future is... ...Faire, we're using the power of tech, data, and machine learning to connect this thriving... ...join ours. About this role Our Engineering organization owns the software that...Work experience placementWork at officeLocal areaRemote workMonday to FridayFlexible hours3 days per week- ...is an AI Storytelling and Visual Creation Platform used by millions worldwide. We're... ...Why Join OpenArt ~ Own the entire data foundation of a fast-scaling AI company... ...We're looking for a Founding Data Engineer to build and own OpenArt's core data platform...Remote workWorldwideVisa sponsorship
$227.9k - $340k
...San Francisco, CA, USA Principal Engineer, Data Platform Location San Francisco, CA, USA Department Engineering Requisition ID JOBREQ-2615777 Role description The opportunity Unity is looking for a Principal Engineer, Data Platform to help...Work at officeWorldwide$190k - $240k
...Senior Full Stack Engineer At Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data. We're on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes...Work at officeLocal area3 days per week- ...Hivemapper is a decentralized global map data network built by 10s of thousands of... ...value to large fleets of vehicles. Data Platform Every day, we process data from millions... ...clusters called Map AI. The Platform Engineering team is responsible for the core...Flexible hours
$194k - $267k
...mission. If you are too, let's talk. About the Team The Data Platform team is responsible for the foundational data services,... ...flexible. We encourage ownership. We expect great things from our engineers and reward them with stimulating new projects, new...Permanent employmentLocal areaWorldwideFlexible hours$176k - $210k
Komodo Health Inc is seeking a Senior Data Engineer based in San Francisco to develop and maintain AWS cloud infrastructure and enhance data quality. The ideal candidate will have a strong proficiency in Python and expert knowledge of AWS services, along with experience...$66k - $165k
...Diego, CA; San Francisco, CA; Boston, MA; Louisville, CO; Indianapolis, IN Position Summary We are seeking an Engineer - MLOps & Scientific Platforms - Data Foundry to operationalize Data Foundry’s scientific tools and analytical methods into actionable‑prototypes. You...Full timeH1bVisa sponsorshipWork visaFlexible hours- F2 AI is seeking a knowledgeable Infrastructure Engineer to design and secure infrastructure for our AI data platform in San Francisco. The role requires 6+ years of experience in infrastructure engineering, with expertise in AWS, GCP, or Azure. Key responsibilities include...
- A data intelligence platform is seeking a Senior Software Engineer in San Francisco to drive projects from ideation to production while working with cutting-edge technology in a high-autonomy environment. The ideal candidate has over 6 years of full-stack experience, particularly...Remote job
- Sephora USA, Inc is looking for a Senior Engineer, Marketing Technology in San Francisco. This hybrid role involves designing analytical solutions, building data infrastructure, and working cross-functionally with marketing teams. Ideal candidates have 7+ years of experience...
- ...we’re building the category-defining AI agentic operating platform that healthcare teams rely on to operate smarter, faster, and... ...team. About the role We are hiring a Senior Software Engineer to own the data platform that powers Plenful’s automation engine. You will...Work at officeFlexible hours2 days per week
$142.6k - $176k
Octave is hiring a Sr. Data Engineer in San Francisco to evolve their data platform for AI and ML applications. The role requires extensive experience in data engineering and familiarity with cloud platforms like AWS and GCP. Responsibilities include designing scalable...- ...About the Role We're looking for a Senior Staff Data Engineer to be the technical backbone of our Data & ML Platform team - the foundation powering analytics, product experiences, and machine learning across Hinge Health. This is a high-ownership IC role for someone...Work at officeLocal areaImmediate startRemote workWorldwide3 days per week
$120k - $170k
...Backend Software Engineer — Data Platform & AI Data Products San Francisco About the Role You'll join the Data Platform team, responsible for building the backend services and "data products" that power how data moves through the company. We create the core platform...Full timeInternship- ...intelligence - the inference services that serve LLMs at scale and the data pipelines that feed them. One week you're hunting a tail-... ...pager that keeps you honest about both. Researchers and ML engineers will hand you workloads that barely run; you'll hand them back...Flexible hours
$175k - $225k
...She will pick the best candidates from Jack's network The next step is to speak to Jack. Job Title: Founding Engineer - Backend & Data Platform Salary: $175,000 - $225,000 + Equity Company Description: Greenline - Fast-growing fintech startup Job Description...Self employmentRemote work$191k - $225k
Nerdleveltech is looking for a highly skilled individual to join their team in San Francisco. This role focuses on building and managing big data infrastructure, emphasizing strong programming skills, particularly in Java and Scala. You will have the opportunity to contribute...- ...startup in neurotechnology is seeking a Senior Machine Learning Infrastructure Engineer to design and scale critical infrastructure powering ML applications. This role involves creating robust data pipelines and optimizing modeling processes, essential for developing...
$130k - $200k
...enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental... ...AI development: # Enterprise Platform & Tools : Advanced annotation tools, workflow... ...We're looking for a Full-Stack AI Engineer to join our team, where you'll build the...Work at officeFlexible hours3 days per week$199k
...About the Role As an Engineering Manager on Chime's Data Platform, leading the Data Storage team, you will own the group that manages Chime's online and analytical data stores and low-latency metric serving layer. Your team will be responsible for building and operating...Full timeWork at officeLocal areaRemote workNight shift- ...the only unified payments and financial platform for global businesses. Powered by our unique... ...team is at the heart of our company's data and AI strategy. We are building the... ...your role will also include scaling the engineering teams, defining new roles, and establishing...Worldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Distinguished Engineer, Data Platform. Be the first to apply!
Related searches
- staff data engineer San Francisco, CA
- data engineering intern summer San Francisco, CA
- senior data integration developer San Francisco, CA
- data engineer graduate San Francisco, CA
- data engineer contract San Francisco, CA
- data science developer San Francisco, CA
- senior data center engineer San Francisco, CA
- software data engineer San Francisco, CA
- hadoop big data developer San Francisco, CA
- data developer San Francisco, CA


