Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Data Engineer, Knowledge Graphs

Mithrl

ABOUT MITHRL We imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought. Mithrl is building the world’s first commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions in natural language, and Mithrl responds with analysis, novel targets, hypotheses, and patent-ready reports. Our traction speaks for itself: 12X year-over-year revenue growth Trusted by leading biotechs and big pharma across three continents Driving real breakthroughs from target discovery to patient outcomes. ABOUT THE ROLE We are hiring a Data Engineer, Knowledge Graphs to build the infrastructure that powers Mithrl’s biological knowledge layer. You will partner closely with the Data Scientist, Knowledge Graphs to take curated knowledge sources and transform them into scalable, reliable, production ready systems that serve the entire platform. Your work includes building ETL pipelines for large biological datasets, designing schemas and storage models for graph structured data, and creating the API surfaces that allow ML engineers, application teams, and the AI Co-Scientist to query and use the knowledge graph efficiently. You will also own the reliability, performance, and versioning of knowledge graph infrastructure across releases. This role is the bridge between biological knowledge ingestion and the high performance engineering systems that use it. If you enjoy working on data modeling, schema design, graph storage, ETL, and scalable infrastructure, this is an opportunity to have deep impact on the intelligence layer of Mithrl. WHAT YOU WILL DO Build and maintain ETL pipelines for large public biological datasets and curated knowledge sources Design, implement, and evolve schemas and storage models for graph structured biological data Create efficient APIs and query surfaces that allow internal teams and AI systems to retrieve nodes, relationships, pathways, annotations, and graph analytics Partner closely with the Data Scientists to operationalize curated relationships, harmonized variable IDs, metadata standards, and ontology mappings Build data models that support multi tenant access, versioning, and reproducibility across releases Implement scalable storage and indexing strategies for high volume graph data Maintain data quality, validate data integrity, and build monitoring around ingestion and usage Work with ML engineers and application teams to ensure the knowledge graph infrastructure supports downstream reasoning, analysis, and discovery applications Support data warehousing, documentation, and API reliability Ensure performance, reliability, and uptime for knowledge graph services WHAT YOU BRING Required Qualifications Strong experience as a data engineer or backend engineer working with data intensive systems Experience building ETL or ELT pipelines for large structured or semi structured datasets Strong understanding of database design, schema modeling, and data architecture Experience with graph data models or willingness to learn graph storage concepts Proficiency in Python or similar languages for data engineering Experience designing and maintaining APIs for data access Understanding of versioning, provenance, validation, and reproducibility in data systems Experience with cloud infrastructure and modern data stack tools Strong communication skills and ability to work closely with scientific and engineering teams Nice to Have Experience with graph databases or graph query languages Experience with biological or chemical data sources Familiarity with ontologies, controlled vocabularies, and metadata standards Experience with data warehousing and analytical storage formats Previous work in a tech bio company or scientific platform environment WHAT YOU WILL LOVE AT MITHRL You will build the core infrastructure that makes the biological knowledge graph fast, reliable, and usable Team: Join a tight-knit, talent-dense team of engineers, scientists, and builders Culture: We value consistency, clarity, and hard work. We solve hard problems through focused daily execution Speed: We ship fast (2x/week) and improve continuously based on real user feedback Location: Beautiful SF office with a high-energy, in-person culture Benefits: Comprehensive PPO health coverage through Anthem (medical, dental, and vision) + 401(k) with top-tier plans We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. #J-18808-Ljbffr

Vacancy posted 5 hours ago
Similar jobs that could be interesting for youBased on the Data Engineer, Knowledge Graphs in San Francisco, CA vacancy
  •  ...Mithrl is seeking a Data Engineer, Knowledge Graphs to build the infrastructure for their biological knowledge layer. In this role, you will partner closely with data scientists to create scalable ETL pipelines and efficient APIs for data access. Your work will have significant... 
    Suggested
    Work at office

    Mithrl

    San Francisco, CA
    4 hours ago
  •  ...commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into insights in minutes. Scientists ask questions...  ...ABOUT THE ROLE We are hiring a Data Scientist, Knowledge Graphs to build and scale the biological knowledge layer... 
    Suggested
    Work at office

    Mithrl

    San Francisco, CA
    3 days ago
  •  ...time, at scale, across every screen. Our data exists with the consent of over a...  ...a mid-level Data Scientist on Samba's Knowledge Graph & Identity team in Warsaw, you will own...  ...work closely with peers, product, and engineering, and play an active role in mentoring junior... 
    Suggested

    Samba TV

    San Francisco, CA
    3 hours ago
  • Onyx is seeking an AI/ML engineer based in San Francisco, CA, to enhance its knowledge layer on top of LLMs. You will evaluate LLM knowledge graphs and improve user experience through innovative features. The ideal candidate has over 3 years of experience in AI/ML, strong... 
    Suggested

    Onyx

    San Francisco, CA
    5 days ago
  • A leading biotech company in San Francisco is looking for a Data Engineer with a focus on Knowledge Graphs. The successful candidate will build and maintain ETL pipelines, design schemas for biological data, and create APIs that support data access for internal teams and... 
    Suggested

    Mithrl

    San Francisco, CA
    1 day ago
  •  ...If that sounds like you, let's build what's next. Senior Data Engineer Hiring Location: San Francisco What you'll do Part 1...  ...financial industries, payment systems, or fintech platforms. Knowledge of data governance practices and regulatory requirements in... 
    Work at office
    Worldwide

    Airwallex

    San Francisco, CA
    2 days ago
  • A technology startup in San Francisco is seeking a Founding AI Engineer to develop an AI-powered knowledge platform. You will participate in product development from architecture to deployment, optimize systems, and implement backend services. Candidates should have an... 
    Flexible hours

    Falconer

    San Francisco, CA
    3 days ago
  • $166.5k - $266.2k

     ...scientists to petabyte-scale data through natural language interfaces...  .... As a Scientific Data Engineer, you will close that gap. You...  ...annotations for text-to-SQL Knowledge of data governance practices in...  ...Experience with knowledge graph technologies (Neo4j, Amazon Neptune... 
    Full time
    Flexible hours

    Eli Lilly

    San Francisco, CA
    9 hours ago
  • $148k - $185k

     ...Data Engineer III Los Angeles, California, United States; San Francisco, CA, United States...  ...PostgreSQL, MySQL), NoSQL (DynamoDB) and Graph Database (Neo4j). Collaborate with service...  ...(RAG) or Graph RAG architectures. Knowledge of fine-tuning and optimizing LLMs for... 
    Flexible hours

    Crunchyroll

    San Francisco, CA
    21 hours ago
  • $140k - $170k

     ...About the Role The Big Data R&D team is responsible for building the core identity graph and entity-resolution capabilities that...  ...senior data scientists and engineers while developing your skills in...  ...experience is a plus. Working knowledge of supervised/unsupervised ML... 

    Socure Inc

    San Francisco, CA
    1 day ago
  • $147.4k - $272.1k

     ...Senior Data Engineer Work Locations (3) Submit Resume Imagine what you could do here. At Apple,...  ...reporting, and machine learning use cases ~ Knowledge engineering expertise including semantic models and knowledge graphs ~ Experience working with cloud compute... 
    Relocation

    Apple

    San Francisco, CA
    2 days ago
  • Data/ETL Engineer (Founding Team) Location: San Francisco Bay Area Type: Full-Time Compensation: Competitive salary + early-stage...  ...backend systems, data pipelines, connector frameworks, and graph-based knowledge models that fuel agentic applications. If you\u2019ve... 
    Full time

    Fabrion

    San Francisco, CA
    2 days ago
  • $172.5k - $260.1k

     ...Job Category Software Engineering Job Details About Salesforce...  ...Salesforce. The Enterprise Data & AI Solutions group is the...  ...Governance & Oversight: Use domain knowledge to ensure deployed tools are...  .... Semantic layer. Knowledge Graphs. Cloud Infrastructure:... 
    Shift work

    Salesforce.Com Inc

    San Francisco, CA
    9 hours ago
  •  ...approaches to maximizing the potential of data in AI models Defining creative...  ...field of study Strong ML research and engineering utilizing established and emerging NLP...  ...reinforcement learning Experience with knowledge graphs, knowledge bases and ontologies... 

    NovumTech Partners

    San Francisco, CA
    2 days ago
  • Description The Enterprise Data & AI Solutions group is the organization...  ...to build the autonomous engines that power executive decision‑...  ...& Oversight: Use domain knowledge to ensure deployed tools are well...  ...Architectures. Semantic layer. Knowledge Graphs. Cloud Infrastructure:... 
    Shift work

    B Capital

    San Francisco, CA
    1 day ago
  • $197.3k - $313.7k

     ...efforts. Job Category Data Job Details About...  ...Salesforce orgs, Informatica MDM, and graph databases. A key success...  ...workloads, including feature engineering for ML models and real-time scoring...  .... ~ Expert-level knowledge of dimensional modeling (Star... 
    Work at office

    Salesforce

    San Francisco, CA
    3 days ago
  • A leading technology firm in San Francisco is seeking an experienced Data/ETL Engineer to join the founding team. You will be responsible for building scalable data ingestion pipelines and developing frameworks to manage enterprise data effectively. The ideal candidate... 

    Fabrion

    San Francisco, CA
    2 days ago
  • Airwallex- is seeking a Senior Data Engineer in San Francisco to design robust data models and manage ETL pipelines. The role requires extensive experience in SQL, data engineering tools, and cloud platforms. Candidates should hold a Bachelor's degree and have at least... 

    Airwallex-

    San Francisco, CA
    1 day ago
  • $120k - $145k

     ...AI Operating System unifies data, insights, and workflows into...  ...Powered by the Gong Revenue Graph, AI-powered intelligence, specialized...  .... Senior Analyst, Analytics engineering , you’ll enable our analytics...  ...field Advanced SQL knowledge (experience with dbt is a plus... 
    Remote work
    Work from home
    Flexible hours

    Gong

    San Francisco, CA
    2 days ago
  • $240k - $275k

    Staff Analytics Engineer — Data Warehouse About the Role Together AI is building high-performance...  ..., SLA alerting, and clean dependency graphs. Work in our Cosmos (dbt + Airflow)...  ...by experience, skills, and job‑related knowledge. Together AI is an Equal Opportunity Employer... 
    Full time

    Together AI

    San Francisco, CA
    2 days ago
  • $176k - $179.5k

    Technology & Digital Platform Data Engineer - US Defense Public Sector Job ID: 106488 Boston Chicago New York City...  ...expertise in Python development, including data workflows, knowledge graphs, and/or generative AI will serve you well. You would be working... 
    Hourly pay
    Apprenticeship
    Work at office
    Easy work

    McKinsey & Company

    San Francisco, CA
    9 hours ago
  • $170k - $210k

     ...the role We are seeking an experienced data engineer who has built enterprise-grade, cloud-native...  ...powered features. You will need to have knowledge and working experience with complex...  ...specifically PostgreSQL). Experience with graph and vector databases is a big plus. Generative... 
    Temporary work
    Work experience placement
    Remote work
    Flexible hours

    Sleuth Insights

    San Francisco, CA
    2 days ago
  •  ...years of experience in backend engineering, specializing in...  ...MySQL), NoSQL (DynamoDB) and Graph Database (Neo4j) , Excellent...  ...architectures , (Desirable) Knowledge of fine-tuning and optimizing...  ...implementing, and optimizing data services, ensuring that our data... 

    Crunchyroll

    San Francisco, CA
    3 days ago
  • $192k - $344.85k

     ...global, high-throughput Product Data platform that powers design,...  ...the next generation of knowledge retrieval systems, this may be...  ...three industries - Architecture, Engineering and Construction (AEC),...  ...search, GraphRAG and Context Graphs. Reporting to the VP... 
    For contractors
    Remote work

    Autodesk

    San Francisco, CA
    2 days ago
  •  ...based on user interactions. Visualize data for business teams. Develop and...  ...product managers. Balance between hands-on engineering (50%) and team leadership (50%)....  ...understand larger picture. ~ Sound knowledge to understand Architectural Patterns, best... 
    Local area

    My3Tech Inc

    San Francisco, CA
    1 day ago
  •  ...Lead Data Engineer RADIUMONE IS A GLOBAL PROGRAMMATIC AD BUYING PLATFORM RadiumOne is the 6th largest web property in the U.S. according...  ...as desired Qualifications Leadership experience Knowledge and expertise of No SQL and Big Data technologies Strong... 

    Stepping Up Solutions

    San Francisco, CA
    2 days ago
  •  ...Description POSITON DESCRIPTION We are seeking a  Lead Data Engineer to architect, build, and lead the development of scalable,...  ...experience with distributed data platforms, and strong knowledge of data architecture, governance, and performance optimization... 

    Q-Cells

    San Francisco, CA
    2 days ago
  •  ...other BI tools • Write SQL for processing raw data, kafka ingestions, adf pipelines, data validation and QA • Knowledge working with APIs to collect or ingest data...  ...technologies Work with product and engineering team to understand requirements, evaluate new... 

    BayOne Solutions

    San Francisco, CA
    2 days ago
  •  ...Lead Data Engineer With MarTech We are seeking an experienced Lead Data Engineer with strong MarTech expertise to lead the design and...  ...building event-driven architectures and distributed systems. ~ Knowledge of AI-assisted development tools, Vibe Coding, or generative... 
    Contract work
    2 days per week

    Staffing the Universe

    San Francisco, CA
    4 days ago
  • $191.52k - $212.8k

     ...LE POSTE VOTRE PROFIL Lead Data Engineer Enterprise Reporting & Analytics Publiée le 05.05.2026 Sephora Tech Réfé...  ...and frameworks (LlamaIndex, LangChain Retrieval). Working knowledge of the Model Context Protocol (MCP) for connecting AI models... 
    Permanent employment
    Full time

    LVMH

    San Francisco, CA
    9 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Data Engineer, Knowledge Graphs. Be the first to apply!