Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer, Knowledge Graphs

Mithrl

ABOUT MITHRL

We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.

Mithrl is building the world's first commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions in natural language, and Mithrl responds with analysis, novel targets, hypotheses, and patent-ready reports.

Our traction speaks for itself:
  • 12X year-over-year revenue growth
  • Trusted by leading biotechs and big pharma across three continents
  • Driving real breakthroughs from target discovery to patient outcomes.
ABOUT THE ROLE

We are hiring a Data Engineer, Knowledge Graphs to build the infrastructure that powers Mithrl's biological knowledge layer. You will partner closely with the Data Scientist, Knowledge Graphs to take curated knowledge sources and transform them into scalable, reliable, production ready systems that serve the entire platform.

Your work includes building ETL pipelines for large biological datasets, designing schemas and storage models for graph structured data, and creating the API surfaces that allow ML engineers, application teams, and the AI Co-Scientist to query and use the knowledge graph efficiently. You will also own the reliability, performance, and versioning of knowledge graph infrastructure across releases.

This role is the bridge between biological knowledge ingestion and the high performance engineering systems that use it. If you enjoy working on data modeling, schema design, graph storage, ETL, and scalable infrastructure, this is an opportunity to have deep impact on the intelligence layer of Mithrl.

WHAT YOU WILL DO
  • Build and maintain ETL pipelines for large public biological datasets and curated knowledge sources
  • Design, implement, and evolve schemas and storage models for graph structured biological data
  • Create efficient APIs and query surfaces that allow internal teams and AI systems to retrieve nodes, relationships, pathways, annotations, and graph analytics
  • Partner closely with the Data Scientists to operationalize curated relationships, harmonized variable IDs, metadata standards, and ontology mappings
  • Build data models that support multi tenant access, versioning, and reproducibility across releases
  • Implement scalable storage and indexing strategies for high volume graph data
  • Maintain data quality, validate data integrity, and build monitoring around ingestion and usage
  • Work with ML engineers and application teams to ensure the knowledge graph infrastructure supports downstream reasoning, analysis, and discovery applications
  • Support data warehousing, documentation, and API reliability
  • Ensure performance, reliability, and uptime for knowledge graph services
WHAT YOU BRING

Required Qualifications
  • Strong experience as a data engineer or backend engineer working with data intensive systems
  • Experience building ETL or ELT pipelines for large structured or semi structured datasets
  • Strong understanding of database design, schema modeling, and data architecture
  • Experience with graph data models or willingness to learn graph storage concepts
  • Proficiency in Python or similar languages for data engineering
  • Experience designing and maintaining APIs for data access
  • Understanding of versioning, provenance, validation, and reproducibility in data systems
  • Experience with cloud infrastructure and modern data stack tools
  • Strong communication skills and ability to work closely with scientific and engineering teams
Nice to Have
  • Experience with graph databases or graph query languages
  • Experience with biological or chemical data sources
  • Familiarity with ontologies, controlled vocabularies, and metadata standards
  • Experience with data warehousing and analytical storage formats
  • Previous work in a tech bio company or scientific platform environment
WHAT YOU WILL LOVE AT MITHRL
  • You will build the core infrastructure that makes the biological knowledge graph fast, reliable, and usable
  • Team: Join a tight-knit, talent-dense team of engineers, scientists, and builders
  • Culture: We value consistency, clarity, and hard work. We solve hard problems through focused daily execution
  • Speed: We ship fast (2x/week) and improve continuously based on real user feedback
  • Location: Beautiful SF office with a high-energy, in-person culture
  • Benefits: Comprehensive PPO health coverage through Anthem (medical, dental, and vision) + 401(k) with top-tier plans

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.
Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the Data Engineer, Knowledge Graphs in San Francisco, CA vacancy
  • $150k - $200k

     ...commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions...  ...ABOUT THE ROLE We are hiring a Data Scientist, Knowledge Graphs to build and scale the biological knowledge layer... 
    Suggested
    Work at office

    Mithrl

    San Francisco, CA
    24 days ago
  •  ...time, at scale, across every screen. Our data exists with the consent of over a...  ...a mid-level Data Scientist on Samba's Knowledge Graph & Identity team in Warsaw, you will own...  ...work closely with peers, product, and engineering, and play an active role in mentoring junior... 
    Suggested

    Samba

    San Francisco, CA
    17 days ago
  •  ...If that sounds like you, let's build what's next. Senior Data Engineer Hiring Location: San Francisco What you'll do Part 1...  ...financial industries, payment systems, or fintech platforms. Knowledge of data governance practices and regulatory requirements in... 
    Suggested
    Work at office
    Worldwide

    Airwallex

    San Francisco, CA
    1 day ago
  • A technology startup in San Francisco is seeking a Founding AI Engineer to develop an AI-powered knowledge platform. You will participate in product development from architecture to deployment, optimize systems, and implement backend services. Candidates should have an... 
    Suggested
    Flexible hours

    Falconer

    San Francisco, CA
    2 days ago
  • $165k - $175k

     ...Data Engineer Boston, Massachusetts, United States Job Openings Data Engineer Job Title: Data Engineer - AI Platform Location...  ..., and pipeline architecture Experience working with knowledge graphs, particularly using Neo4j, and familiarity with relational... 
    Suggested

    Redbeard Solutions

    San Francisco, CA
    4 days ago
  • $148k - $185k

     ...Data Engineer III Los Angeles, California, United States; San Francisco, CA, United States...  ...PostgreSQL, MySQL), NoSQL (DynamoDB) and Graph Database (Neo4j). Collaborate with service...  ...(RAG) or Graph RAG architectures. Knowledge of fine-tuning and optimizing LLMs for... 
    Flexible hours

    Crunchyroll

    San Francisco, CA
    5 days ago
  •  ...Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + early...  ...backend systems, data pipelines, connector frameworks, and graph-based knowledge models that fuel agentic applications. If you've... 
    Full time

    Fabrion

    San Francisco, CA
    1 day ago
  • $140k - $170k

    About the Role The Big Data R&D team is responsible for building the core identity graph and entity-resolution capabilities that...  ...senior data scientists and engineers while developing your skills in...  ...experience is a plus. Working knowledge of supervised/unsupervised ML... 

    Socure

    San Francisco, CA
    1 day ago
  • $119k - $185k

    Data Engineer Location Remote. Hybrid role. Preference for candidates located in the San Francisco...  ...schedules, sensors, and the dependency graph that ties extraction → loading →...  ...understanding materializations. Working knowledge of an orchestration framework: (Dagster,... 
    Full time
    Remote work

    Salma Health, Inc.

    San Francisco, CA
    1 day ago
  • $166.5k - $266.2k

     ...translational science. Responsibilities Data Harmonization and Lakehouse...  ...Semantic Layer and Schema Engineering Design and maintain a...  ...annotations for text‑to‑SQL Knowledge of data governance practices...  ...auditability Experience with knowledge graph technologies (Neo4j, Amazon... 
    Full time
    Flexible hours

    Initial Therapeutics, Inc.

    San Francisco, CA
    2 days ago
  •  ...approaches to maximizing the potential of data in AI models Defining creative...  ...field of study Strong ML research and engineering utilizing established and emerging NLP...  ...reinforcement learning Experience with knowledge graphs, knowledge bases and ontologies... 

    NovumTech Partners

    San Francisco, CA
    1 day ago
  • A leading technology firm in San Francisco is seeking an experienced Data/ETL Engineer to join the founding team. You will be responsible for building scalable data ingestion pipelines and developing frameworks to manage enterprise data effectively. The ideal candidate... 

    Fabrion

    San Francisco, CA
    1 day ago
  • Job Description An employer is seeking two MDM Data Engineers to join their team as remote workers. They will be joining a scrum team...  ...Salesforce Cloud 360, Snowflake, and Apache Airflow Contribute to knowledge graph development and data ontology efforts (Entonum framework)... 
    Remote job
    Contract work

    Insight Global

    San Francisco, CA
    4 days ago
  • $172.5k - $260.1k

     ...heart of it all.The **Enterprise Data & AI Solutions** group is the...  ...to build the autonomous engines that power executive decision-...  ...Governance & Oversight: Use domain knowledge to ensure deployed tools are...  ...Architectures. Semantic layer. Knowledge Graphs.* Cloud Infrastructure:... 
    Shift work

    Salesforce, Inc.

    San Francisco, CA
    1 day ago
  • $100k - $300k

     ...Founding- and Staff-level Engineers We are looking for Founding-...  ...foundational pillars of Cogent's data platform and integration...  ...transformation of that data into a knowledge base that can be leveraged by...  ...Systems (such as knowledge graphs, search engines, or similar)... 

    Cogent Security, Inc.

    San Francisco, CA
    6 days ago
  • $170k - $210k

     ...the role We are seeking an experienced data engineer who has built enterprise-grade, cloud-native...  ...powered features. You will need to have knowledge and working experience with complex...  ...specifically PostgreSQL). Experience with graph and vector databases is a big plus. Generative... 
    Temporary work
    Work experience placement
    Remote work
    Flexible hours

    Sleuth Insights

    San Francisco, CA
    1 day ago
  • $176k - $179.5k

    Technology & Digital Platform Data Engineer - US Defense Public Sector Job ID: 106488 Boston Chicago New York City...  ...expertise in Python development, including data workflows, knowledge graphs, and/or generative AI will serve you well. You would be working... 
    Hourly pay
    Apprenticeship
    Work at office
    Easy work

    McKinsey & Company

    San Francisco, CA
    9 days ago
  • $192k - $344.85k

     ...global, high-throughput Product Data platform that powers design,...  ...the next generation of knowledge retrieval systems, this may be...  ...three industries - Architecture, Engineering and Construction (AEC),...  ...search, GraphRAG and Context Graphs. Reporting to the VP... 
    For contractors
    Remote work

    Autodesk

    San Francisco, CA
    1 day ago
  •  ...other BI tools • Write SQL for processing raw data, kafka ingestions, adf pipelines, data validation and QA • Knowledge working with APIs to collect or ingest data...  ...technologies Work with product and engineering team to understand requirements, evaluate new... 

    BayOne Solutions

    San Francisco, CA
    1 day ago
  •  ...Lead Data Engineer RADIUMONE IS A GLOBAL PROGRAMMATIC AD BUYING PLATFORM RadiumOne is the 6th largest web property in the U.S. according...  ...as desired Qualifications Leadership experience Knowledge and expertise of No SQL and Big Data technologies Strong... 

    Stepping Up Solutions

    San Francisco, CA
    1 day ago
  •  ...Lead Data Engineer The Office of Information Technology (IT) is responsible for enabling State Bar's internal and external stakeholders...  ...reliability. Ensure comprehensive documentation and lead knowledge transfer efforts to support long-term IT business and user... 
    Work at office

    State Bar CA

    San Francisco, CA
    6 days ago
  •  ...management-and change lives along the way. The Role As a Data Engineer at Air Apps, you will be responsible for designing,...  ...containerization (Docker, Kubernetes), and CI/CD workflows . Knowledge of data security, governance, and compliance (GDPR, CCPA,... 
    Temporary work
    Worldwide

    Air Apps

    San Francisco, CA
    3 days ago
  •  ...Data Engineer Location: San Francisco, CA Required Clearance: Secret Salary: Competitive Job Description We are looking...  ...containerization technologies such as Docker and orchestration tools like Kubernetes. Knowledge of DevOps practices and CI/CD pipelines.... 

    Fullscope

    San Francisco, CA
    1 day ago
  •  ...Technical Data Engineer We are an applied AI lab building end-to-end software agents. We're the makers of Devin, the first AI software...  ...developing scalable backend heavy applications ~ Strong knowledge of statistics and experimentation ~ Based in SF or NYC... 

    Cognition AI

    San Francisco, CA
    2 days ago
  • $115k - $145k

     ...Data Engineer I San Francisco, California, United States At ClimateAi, we choose to act. We believe resilience is just as urgent...  ...apply) Solid programming fundamentals in Python and working knowledge of SQL Exposure to cloud services (AWS, GCP, or Azure) as... 
    Internship
    Remote work
    Flexible hours

    ClimateAI

    San Francisco, CA
    3 days ago
  • $195k - $230k

     ...the latest Whatnot updates on our news and engineering blogs and join us as we enable anyone to...  ...commerce. Role At Whatnot data engineers build foundational data systems...  ...expertise in SQL, including complex model graphs, dependency management, and performance optimization... 
    Full time
    Work at office
    Local area
    Remote work
    Work from home
    Home office

    Whatnot

    San Francisco, CA
    6 days ago
  • $140k - $160k

     ...Senior Data Architect - Data Engineering Location: San Francisco, CA Reports To: VP of Engineering FLSA Status: Exempt Employment Type...  ...or agentic AI workflows with orchestration frameworks Knowledge of LLM evaluation, monitoring, and tracing tools (... 
    Full time
    Local area
    Flexible hours

    Cargomatic

    San Francisco, CA
    3 days ago
  •  ...models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code...  ...for someone who wants to bring a full-stack perspective to data. As a Software Engineer supporting our Data function, you will be responsible for... 
    Work at office
    Relocation package

    Mercor Alabaster

    San Francisco, CA
    1 day ago
  •  ...REQUIRED: - Bachelor's required - 8+ years of experience in Data Engineering (will accept 6+ DE YoE if their resume is otherwise...  ...team members on project goals. • Strong PC skills including knowledge of Microsoft SharePoint. Education/Experience: • Bachelor... 
    Internship

    Rose International

    San Francisco, CA
    7 days ago
  •  ...Data Engineer Responsibilities: Develop and automate large scale, high-performance data processing systems (batch and/or streaming...  ..., and advance effective product solutions ~ Working knowledge of relational databases and query authoring (SQL) ~ Good communication... 

    Omega Solutions Inc

    San Francisco, CA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer, Knowledge Graphs. Be the first to apply!