Data Engineer (Founding Team)
Fabrion
Data/ETL Engineer (Founding Team)
Location: San Francisco Bay Area
Type: Full-Time
Compensation: Competitive salary + early-stage equity
Backed by 8VC, we're building a world-class team to tackle one of the industry's most critical infrastructure problems.
About the Role
We're building a multi-tenant, AI-native platform where enterprise data becomes actionable through semantic enrichment, intelligent agents, and governed interoperability. At the heart of this architecture lies our Data Fabric — an intelligent, governed layer that turns fragmented and siloed data into a connected ontology ready for model training, vector search, and insight-to-action workflows.
We're looking for engineers who enjoy hard data problems at scale: messy unstructured data, schema drift, multi-source joins, security models, and AI-ready semantic enrichment. You'll build the backend systems, data pipelines, connector frameworks, and graph-based knowledge models that fuel agentic applications.
If you've worked on streaming unstructured pipelines, built connectors into ugly legacy systems, or mapped knowledge graphs that scale — this role will feel like home.
Responsibilities
- Build highly reliable, scalable data ingestion and transformation pipelines across structured, semi-structured, and unstructured data sources
- Develop and maintain a connector framework for ingesting from enterprise systems (ERPs, PLMs, CRMs, legacy data stores, email, Excel, docs, etc.)
- Design and maintain the data fabric layer — including a knowledge graph (Neo4j or Puppygraph) enriched with ontologies, metadata, and relationships
- Normalize and vectorize data for downstream AI/LLM workflows — enabling retrieval-augmented generation (RAG), summarization, and alerting
- Create and manage data contracts, access layers, lineage, and governance mechanisms
- Build and expose secure APIs for downstream services, agents, and users to query enriched semantic data
- Collaborate with ML/LLM teams to feed high-quality enterprise data into model training and tuning pipelines
What We're Looking For
Core Experience:
- 5+ years building large-scale data infrastructure in production environments
- Deep experience with ingestion frameworks (Kafka, Airbyte, Meltano, Fivetran) and data pipeline orchestration (Airflow, Dagster, Prefect)
- Comfortable processing unstructured data formats: PDFs, Excel, emails, logs, CSVs, web APIs
- Experience working with columnar stores, object storage, and lakehouse formats (Iceberg, Delta, Parquet)
- Strong background in knowledge graphs or semantic modeling (e.g. Neo4j, RDF, Gremlin, Puppygraph)
- Familiarity with GraphQL, RESTful APIs, and designing developer-friendly data access layers
- Experience implementing data governance: RBAC, ABAC, data contracts, lineage, data quality checks
Mindset & Culture Fit:
- You're a system thinker: you want to model the real world, not just process it
- Comfortable navigating ambiguous data models and building from scratch
- Passionate about enabling AI systems with real-world, messy enterprise data
- Pragmatic about scalability, observability, and schema evolution
- Value autonomy, high trust, and meaningful ownership over infrastructure
Bonus Skills:
- Prior work with vector DBs (e.g. Weaviate, Qdrant, Pinecone) and embedding pipelines
- Experience building or contributing to enterprise connector ecosystems
- Knowledge of ontology versioning, graph diffing, or semantic schema alignment
- Familiarity with data fabric patterns (e.g. Palantir Ontology, Linked Data, W3C standards)
- Familiar with fine-tuning LLMs or enabling RAG pipelines using enterprise knowledge
- Experience enforcing data access policy with tools like OPA, Keycloak, Snowflake row-level security
Why This Role Matters
Agents are only as smart as the data they operate on. This role builds the foundation — the semantic, governed, connected substrate — that makes autonomous decision-making and agent action possible. From factory ERP records to geopolitical news alerts, the data fabric unifies it all.
If you're excited to tame complexity, unify chaos, and power intelligent systems with trusted data — we'd love to hear from you.
- ...ML/AI Research Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful... ...intelligence layer that sits on top of our enterprise data fabric. This isn't a prompt engineer role. It's full-...SuggestedFull time
$180k - $270k
...Staff Data Engineer (Founding Team) Location: San Francisco (In-office) Compensation: $180,000 - $270,000 + meaningful equity Start Date: Immediate About the Company We have partnered with an elite team in SF building an AI-native platform that acts...SuggestedWork at officeImmediate start- ...Seeking Founding Data Scientists and Machine Learning Engineers Imagine Multiplying Your Impact You've unlocked major wins in your career - you've shipped... ...the people who rely on it. You can help product teams iterate faster, delight users, and grow revenue, all...Suggested
$145k - $215k
...Founding Data Engineer Salary: $145,000 - $215,000 + Equity Company Description: VC-backed healthtech AI startup Job Description: You... ...clinical diagnostic workflows. Join a high-performance founding team led by experienced entrepreneurs with a $100M+ exit. Gain...Suggested- A leading technology firm in San Francisco is seeking an experienced Data/ETL Engineer to join the founding team. You will be responsible for building scalable data ingestion pipelines and developing frameworks to manage enterprise data effectively. The ideal candidate...Suggested
- ...ML Ops Engineer — Agentic AI Lab (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful... ...and observability pipelines that power our agents and AI data fabric. You'll work across compute orchestration, GPU...Full time
- ...world environments into clear, actionable data. While most AI companies focus on... ...consequences are real. The Role As a Founding Data Engineer (AI Infrastructure) , you will build the... ...You’ll work directly with the founding team to ship zero-to-one systems where reliability...
$140k - $210k
...Location Type Hybrid Department Engineering Compensation $140K - $210K •... ...where you are. We’re looking for a Founding Analytics Engineer to own Ambrook's entire data function. You'll take an... ...trustworthy foundation that enables every team and every agent to pull the...Full timeRemote work- A leading AI infrastructure company is seeking a Founding Data Engineer to design and build the foundational data architecture for its innovative... .... This role involves collaborating with the founding team to create robust data pipelines, ensuring data quality and reliability...
- ...Join OpenArt ~ Own the entire data foundation of a fast-scaling AI company... ...About the Role We're looking for a Founding Data Engineer to build and own OpenArt's core data... ...at source Support downstream teams (analytics, DS) by providing clean, well...Remote workWorldwideVisa sponsorship
$73.8k - $218.8k
..., and we're scaling fast. This is a founding team, these first hires will shape the culture... ...conversation. You might come from engineering, consulting, product, or pre-sales - what... ...information on how we process your data during the Recruiting and Hiring process...Work experience placementLive inWork at officeLocal area$158.81k - $198.49k
...Lead Scientific Data Engineer Berkeley Lab's (LBNL) Joint Genome Institute (JGI) has an opening for a Lead Scientific Data Engineer to join the Advanced Analysis Team! JGI has a long history of generating world-class genomic data to address pressing national energy...Full timeWork at officeRemote workRelocation package- ...-player. You are: A strong SQL and data modeling expert who cares deeply about metric... ...clearly explain complex metric logic to engineering, product, ML, growth, and finance.... ...conflicting logic across dashboards and teams. Ensure executive- and board-level reporting...Local area2 days per week
- ...tech startup in the grocery sector is seeking a Forward Deployed Data Engineer to connect customer systems with their platform. The ideal... ...responsible for managing data integrations, collaborating with teams, and troubleshooting complex customer workflows. Expect a challenging...
- ...Francisco is seeking a Forward Deployed Engineer to connect customer systems with their grocery... ...while collaborating with various teams to enhance workflows. Ideal candidates will... ...solutions engineering and a strong background in data handling. Join us to tackle challenges in...
- ...Shadow, we’re building the end-to-end data platform for crypto. Blockchain... ...Uniswap, and Flashbots. We're a small team who primarily works out of a beautiful... ...office space in NYC. We’re hiring a founding smart contract data engineer to join our world-class team. About the...Contract workSummer workWork at officeImmediate startFlexible hours
- A data-driven technology firm is seeking a Founding Analytics Engineer to take full ownership of the data function. You will transform the data warehouse and set up processes for data access across teams, aiming for a trustworthy data foundation. Ideal candidates will have...
$120k - $160k
...Founding Engineer For Airweave's Data And Infrastructure We're looking for a founding engineer to own Airweave's data and infrastructure layer, the... ...keeps it all running. You'll work closely with the product team, but your focus is on the foundation: making sure data...$200k - $250k
Founding Forward Deployed Data Scientist San Francisco, CA (4 days/week onsite) About the Role Our client is building an AI-native product engineer that helps teams understand what to build next. Instead of relying on instinct or fragmented analysis, their platform automatically...H1bRemote work- ...Frontend Engineer (with Familiarity of Full Stack) (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + meaningful... .... About the Role We're building an AI-native data platform designed to simplify the most complex...Full timeTemporary work
$140k - $180k
Our client, an early-stage AI Startup, is hiring a Founding Engineer to join their team in San Francisco. The successful candidate will play a key role... ...semantic foundation that allows autonomous agents to interpret data and take action across sources ranging from ERP systems...$170k - $220k
...journey. Please note: We strongly encourage team members to be in the office 1-2 days per... ...formal in-office requirement. Analytics Engineers at Rocket Money further our mission by... ...our users and products through the lens of data. We build data models that uncover how...Temporary workWork at office2 days per week1 day per week- A leading AI company is looking for a Founding Data Engineer to build and own the core data platform that supports product and leadership decision-making. This role involves designing data pipelines, maintaining data warehouse architecture, and ensuring data integrity...
$175k - $225k
...network The next step is to speak to Jack. Job Title: Founding Engineer - Backend & Data Platform Salary: $175,000 - $225,000 + Equity... ...innovative AI-driven infrastructure. Receive a founding team title with an aggressive equity package and high architectural...Self employmentRemote work- ...leading AI technology firm in San Francisco is seeking an AI/Data Systems Engineer to build critical data pipelines that transform unstructured... ...The company offers comprehensive health benefits and unique team-building retreats in appealing destinations. #J-18808-Ljbffr...
$130k - $200k
...Software Engineer, Agent Delivery (Founding Team) Title of Role: Software Engineer, Agent Delivery (Founding Team) Location: San Francisco, onsite Company Stage of Funding: Series B - Logistics, AI, Enterprise, B2B Office Type: Onsite Salary: $130K-$...Work at office$200k - $250k
A health technology company in San Francisco is seeking a Data Engineer to design and scale their data infrastructure from scratch. The ideal... ...future of healthcare data while working alongside a talented team committed to delivering free, personalized healthcare for all....$99k - $149k
...themselves. We're looking to grow our teams with more people who share our... ...is to integrate data from a variety of sources into... ...Demonstrated experience in software engineering fundamentals and coding Salary... ...'s user and privacy policy found at , we also want to make you...Work experience placementLocal area- ...Devin, the first AI software engineer, and Windsurf, the AI-native IDE... ...problems and empower teams to strive for more ambitious goals... ...small and talent-dense. Among our founding team, we have world-class competitive... ...We're hiring a technical Data Engineer to own our full data...
- ...bigger-and moving faster. We're a family-founded company on a mission to create the world... ...along the way. The Role As a Data Engineer at Air Apps, you will be responsible for... ...driven environments with cross-functional teams. What benefits are we offering? ~...Temporary workWorldwide
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer (Founding Team). Be the first to apply!
- director data engineering San Francisco, CA
- junior big data engineer San Francisco, CA
- data engineer graduate San Francisco, CA
- senior data engineer San Francisco, CA
- data platform engineer San Francisco, CA
- sr information security engineer San Francisco, CA
- senior data integration developer San Francisco, CA
- data developer San Francisco, CA
- data engineer San Francisco, CA
- data infrastructure engineer San Francisco, CA

