Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer (Founding Team)

Fabrion

Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + early-stage equity Backed by 8VC, we're building a world-class team to tackle one of the industry’s most critical infrastructure problems. About the Role We’re building a multi-tenant, AI-native platform where enterprise data becomes actionable through semantic enrichment, intelligent agents, and governed interoperability. At the heart of this architecture lies our Data Fabric — an intelligent, governed layer that turns fragmented and siloed data into a connected ontology ready for model training, vector search, and insight-to-action workflows. We're looking for engineers who enjoy hard data problems at scale : messy unstructured data, schema drift, multi-source joins, security models, and AI-ready semantic enrichment. You’ll build the backend systems, data pipelines, connector frameworks, and graph-based knowledge models that fuel agentic applications. If you've worked on streaming unstructured pipelines, built connectors into ugly legacy systems, or mapped knowledge graphs that scale — this role will feel like home. Responsibilities Build highly reliable, scalable data ingestion and transformation pipelines across structured, semi-structured, and unstructured data sources Develop and maintain a connector framework for ingesting from enterprise systems (ERPs, PLMs, CRMs, legacy data stores, email, Excel, docs, etc.) Design and maintain the data fabric layer — including a knowledge graph (Neo4j or Puppygraph) enriched with ontologies, metadata, and relationships Normalize and vectorize data for downstream AI/LLM workflows — enabling retrieval-augmented generation (RAG), summarization, and alerting Create and manage data contracts, access layers, lineage, and governance mechanisms Build and expose secure APIs for downstream services, agents, and users to query enriched semantic data Collaborate with ML/LLM teams to feed high-quality enterprise data into model training and tuning pipelines What We’re Looking For Core Experience: 5+ years building large-scale data infrastructure in production environments Deep experience with ingestion frameworks (Kafka, Airbyte, Meltano, Fivetran) and data pipeline orchestration (Airflow, Dagster, Prefect) Comfortable processing unstructured data formats: PDFs, Excel, emails, logs, CSVs, web APIs Experience working with columnar stores, object storage, and lakehouse formats (Iceberg, Delta, Parquet) Strong background in knowledge graphs or semantic modeling (e.g. Neo4j, RDF, Gremlin, Puppygraph) Familiarity with GraphQL, RESTful APIs, and designing developer-friendly data access layers Experience implementing data governance : RBAC, ABAC, data contracts, lineage, data quality checks Mindset & Culture Fit: You’re a system thinker: you want to model the real world, not just process it Comfortable navigating ambiguous data models and building from scratch Passionate about enabling AI systems with real-world, messy enterprise data Pragmatic about scalability, observability, and schema evolution Value autonomy, high trust, and meaningful ownership over infrastructure Bonus Skills Prior work with vector DBs (e.g. Weaviate, Qdrant, Pinecone) and embedding pipelines Experience building or contributing to enterprise connector ecosystems Knowledge of ontology versioning , graph diffing , or semantic schema alignment Familiarity with data fabric patterns (e.g. Palantir Ontology, Linked Data, W3C standards) Familiar with fine-tuning LLMs or enabling RAG pipelines using enterprise knowledge Experience enforcing data access policy with tools like OPA , Keycloak , Snowflake row-level security Why This Role Matters Agents are only as smart as the data they operate on. This role builds the foundation — the semantic, governed, connected substrate — that makes autonomous decision-making and agent action possible. From factory ERP records to geopolitical news alerts, the data fabric unifies it all. If you're excited to tame complexity, unify chaos, and power intelligent systems with trusted data — we’d love to hear from you. #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Data Engineer (Founding Team) in San Francisco, CA vacancy
  • $180k - $270k

     ...Staff Data Engineer (Founding Team) Location: San Francisco (In-office) Compensation: $180,000 - $270,000 + meaningful equity Start Date: Immediate About the Company We have partnered with an elite team in SF building an AI-native platform that acts... 
    Suggested
    Work at office
    Immediate start

    Xpertalent

    San Francisco, CA
    4 days ago
  • $150k - $210k

     ...living with severe, complex diseases. Our data platform is used by drug developers and...  .... We have built a lean, all-star team to help us bring our vision to life, and...  ...About the role We're looking for our Founding Data Engineer who's excited to help shape the future... 
    Suggested
    Work experience placement
    Work at office
    Relocation
    Flexible hours
    3 days per week

    Probably Genetic

    San Francisco, CA
    1 day ago
  • ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful equity...  ...intelligence layer that sits on top of our enterprise data fabric. This isn’t a prompt engineer role. It’s full‑... 
    Suggested
    Full time

    Fabrion

    San Francisco, CA
    4 days ago
  • $145k - $215k

     ...Founding Data Engineer Salary: $145,000 - $215,000 + Equity Company Description: VC-backed healthtech AI startup Job Description: You...  ...clinical diagnostic workflows. Join a high-performance founding team led by experienced entrepreneurs with a $100M+ exit. Gain... 
    Suggested

    Jack and Jill AI

    San Francisco, CA
    5 days ago
  • A leading technology firm in San Francisco is seeking an experienced Data/ETL Engineer to join the founding team. You will be responsible for building scalable data ingestion pipelines and developing frameworks to manage enterprise data effectively. The ideal candidate... 
    Suggested

    Fabrion

    San Francisco, CA
    5 days ago
  • Success Matcher Recruitment is seeking a founding analytics engineer to build a robust data layer from scratch. This role offers total ownership over analytics...  ...collaborating closely with product and engineering teams. The ideal candidate should have at least 4 years of... 

    Success Matcher Recruitment

    San Francisco, CA
    1 day ago
  • $160k - $230k

    Open role Founding Data Infrastructure Engineer San Francisco (On-site) About Us Constellation is creating the AI-human translation layer that ensures...  ...of foundation models and we're seeking the founding team of engineers to build the infrastructure that makes this... 
    Work at office
    Local area
    Relocation package

    REACH INDUSTRIES

    San Francisco, CA
    4 days ago
  • $73.8k - $218.8k

     ..., and we're scaling fast. This is a founding team, these first hires will shape the culture...  ...conversation. You might come from engineering, consulting, product, or pre-sales — what...  ...information on how we process your data during the Recruiting and Hiring process... 
    Work experience placement
    Live in
    Work at office
    Local area

    Accenture

    San Francisco, CA
    6 days ago
  • About the Role ML Ops Engineer — Agentic AI Lab (Founding Team) — Location: San Francisco Bay Area — Type: Full-Time — Compensation: Competitive salary +...  ...and observability pipelines that power our agents and AI data fabric. You’ll work across compute orchestration, GPU... 
    Full time

    Fabrion

    San Francisco, CA
    5 days ago
  •  ...is an A-player. You are: A strong SQL and data modeling expert who cares deeply about...  ...clearly explain complex metric logic to engineering, product, ML, growth, and finance. Motivated...  ...conflicting logic across dashboards and teams. Ensure executive- and board-level reporting... 
    Full time
    Local area
    2 days per week

    Menlo Ventures

    San Francisco, CA
    3 days ago
  •  ...future. Why Join OpenArt Own the entire data foundation of a fast-scaling AI company...  .... About the Role We’re looking for a Founding Data Engineer to build and own OpenArt’s core data platform...  ...at source Support downstream teams (analytics, DS) by providing clean, well... 
    Remote work
    Worldwide
    Visa sponsorship

    Embedding VC

    San Francisco, CA
    3 days ago
  •  ...Francisco is seeking a Forward Deployed Engineer to connect customer systems with their grocery...  ...while collaborating with various teams to enhance workflows. Ideal candidates will...  ...solutions engineering and a strong background in data handling. Join us to tackle challenges in... 

    Vori, Inc

    San Francisco, CA
    4 days ago
  •  ...tech startup in the grocery sector is seeking a Forward Deployed Data Engineer to connect customer systems with their platform. The ideal...  ...responsible for managing data integrations, collaborating with teams, and troubleshooting complex customer workflows. Expect a challenging... 

    Vori

    San Francisco, CA
    4 days ago
  • $140k - $180k

     ...Our client, an early-stage AI Startup, is hiring a Founding Engineer to join their team in San Francisco. The successful candidate will play a key role...  ...semantic foundation that allows autonomous agents to interpret data and take action across sources ranging from ERP systems... 

    Alldus International Consulting Ltd

    San Francisco, CA
    2 days ago
  • $150k - $210k

    Probably Genetic in San Francisco is looking for a Founding Data Engineer to develop data infrastructure that supports both internal insights and customer solutions. You will work collaboratively with the Head of Engineering and Head of Product to create scalable data... 
    3 days per week

    Mundi

    San Francisco, CA
    3 days ago
  •  ...leading AI technology firm in San Francisco is seeking an AI/Data Systems Engineer to build critical data pipelines that transform unstructured...  ...The company offers comprehensive health benefits and unique team-building retreats in appealing destinations. #J-18808-Ljbffr... 

    Mercator Inc

    San Francisco, CA
    3 days ago
  •  ...Frontend Engineer (with familiarity of full stack) (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful...  ...infrastructure problems. Overview We’re building an AI-native data platform designed to simplify the most complex... 
    Full time
    Temporary work

    Fabrion

    San Francisco, CA
    3 days ago
  •  ...healthcare industry. Role You'll be our first dedicated Data hire — owning everything from pipelines to dashboards to...  ...RevOps, Marketing, Finance and CS, collaborate closely with Engineering and Ops Analyst teams; while building the data foundation of our organization... 
    Work at office

    Assort Health Inc.

    San Francisco, CA
    19 hours ago
  • $160k - $230k

    REACH INDUSTRIES is seeking a Founding Data Infrastructure Engineer in San Francisco. You will build the backend for data collection efforts, designing scalable architectures for multimodal data. Ideal candidates have a strong background in Python, Rust, or Go, and experience... 
    Relocation package

    REACH INDUSTRIES

    San Francisco, CA
    4 days ago
  • A healthcare technology company is seeking a Data Engineer to take ownership of analytics pipelines and dashboards. The ideal candidate has...  .... This position involves working closely with multiple teams to enhance our data systems. Competitive compensation, ongoing... 

    Assort Health Inc.

    San Francisco, CA
    5 days ago
  • Voiceflow in San Francisco is looking for a Senior Software Engineer to join our ambitious team. As a founding engineer, you will work on distributed systems and...  ...a desire to innovate in building a next generation data platform. The ideal candidate has experience in async... 
    Work at office

    Voiceflow

    San Francisco, CA
    2 days ago
  • $150k - $300k

    Compound AI seeks exceptional backend engineers in San Francisco to build the data pipelines that drive its AI agents. You will work directly with the founding team to integrate diverse financial data sources, ensuring precise query performance. The ideal candidate has... 

    Compound AI

    San Francisco, CA
    1 day ago
  • $148k - $185k

     ...About Crunchyroll Founded by fans, Crunchyroll delivers the art and...  ...content we all love. Join our team, and help us shape the future of anime! About The Team The Data Services team is focused on building...  ...around the world. The Data Engineering team provides seamless help to... 
    Flexible hours

    Crunchyroll

    San Francisco, CA
    2 days ago
  • $190k - $250k

     ...that generates alpha and drives upside. Founded in 2020 by George Sivulka and backed by...  ...leadership. The Role We are seeking our first Data Engineer, someone who can refine our data...  ...closely with both engineering and business teams to ensure every data need is met. If you... 

    Hebbia

    San Francisco, CA
    2 days ago
  •  ...bigger—and moving faster. We’re a family-founded company on a mission to create the world...  ...lives along the way. The Role As a Data Engineer at Air Apps, you will be responsible for...  ...driven environments with cross-functional teams. What benefits are we offering? Apple hardware... 
    Temporary work
    Worldwide

    Air Apps

    San Francisco, CA
    3 days ago
  • $195k - $230k

     ...Whatnot updates on our news and engineering blogs and join us as we enable...  ...commerce. Role At Whatnot data engineers build foundational data...  ..., and the Analytics Platform team. You’ll make key architectural...  ...who thrives at Whatnot? We’ve found that embodying a low ego, growth... 
    Full time
    Work at office
    Local area
    Work from home
    Home office

    Whatnot

    San Francisco, CA
    2 days ago
  •  ...Devin, the first AI software engineer, and Windsurf, the AI-native IDE...  ...problems and empower teams to strive for more ambitious goals...  ...small and talent-dense. Among our founding team, we have world‑class competitive...  ...Role We’re hiring a technical Data Engineer to own our full data... 

    Cognition Corp

    San Francisco, CA
    2 days ago
  •  ...About Abridge Abridge was founded in 2018 with the mission of powering...  ...systems. We are a growing team of practicing MDs, AI scientists...  ...creatives, technologists, and engineers working together to empower people...  ...for a highly motivated Data Engineer to join our growing US... 
    Hourly pay
    Full time
    Flexible hours

    Abridge

    San Francisco, CA
    3 days ago
  • $170k - $220k

     ...journey. Please note: We strongly encourage team members to be in the office 1-2 days per...  ...formal in-office requirement. Analytics Engineers at Rocket Money further our mission by...  ...our users and products through the lens of data. We build data models that uncover how... 
    Temporary work
    Work at office
    2 days per week
    1 day per week

    Rocket Money

    San Francisco, CA
    3 days ago
  • A leading AI company is looking for a Founding Data Engineer to build and own the core data platform that supports product and leadership decision-making. This role involves designing data pipelines, maintaining data warehouse architecture, and ensuring data integrity... 

    Embedding VC

    San Francisco, CA
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer (Founding Team). Be the first to apply!