Advisor - Data Architect, Data Foundry
$151.5k - $222.2kEli Lilly
At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We're looking for people who are determined to make life better for people around the world.
Location: San Diego, CA; San Francisco, CA; Boston, MA; Louisville, CO; Indianapolis, INReports to: Lead, Data Architecture (R9), Architecture4Insight Overview Lilly Small Molecule Discovery is purpose-built to create molecules that make life better for people. Discovery Technology and Platforms (DTP) accelerates molecule discovery by building optimized foundational platforms, streamlining lab operations through advanced technologies and data connectivity, and investing in novel capabilities. Data Foundry is a multidisciplinary team within DTP that enables AI-native drug discovery through four integrated pillars: Architecture4Insight (data infrastructure and scientific software), Methods4Insight (analytical and computational methods), Automation & Scale4Insight (lab automation and agentic workflows), and Preparedness4Insight (data governance and readiness). These pillars empower every Lilly scientist to make optimal decisions by providing seamless access to data, insights, and AI-driven capabilities-serving both human scientists and autonomous AI agents. Position Summary We are seeking Data Architects at multiple levels to design and build the data infrastructure that makes AI-native drug discovery possible. You will create the schemas, ontologies, data models, knowledge graphs, and platform architectures that transform raw scientific data into machine-actionable, FAIR-compliant, insight-ready assets-serving both discovery scientists and autonomous AI agents. This role is the foundation of Architecture4Insight . Everything the software engineering team builds-pipelines, APIs, prototypes-depends on the data models and platform architecture this team designs. You will work with deep knowledge of scientific data (chemical, biological, HTE, automation-generated) to create custom-fit solutions, then partner with View email address on click.appcast.io to scale and maintain them. The role spans three focus areas depending on expertise: data modeling & ontologies , data platform & lakehouse architecture , and knowledge graph & specialized data systems . You will independently design schemas, select technologies, and make build-vs-buy recommendations for their domain. Responsibilities Data Modeling & Ontologies
- Design and implement data models, schemas, and ontologies for chemical, biological, and automation-generated data that serve discovery workflows across the portfolio.
- Define and maintain controlled vocabularies, metadata standards, and FAIR-compliant data frameworks in partnership with Preparedness4Insight.
- Implement semantic data standards (RDF, OWL, SPARQL) and ontology engineering practices to create interoperable, machine-readable scientific data.
- Design and implement data lakehouse architecture using modern platforms (Databricks, Snowflake, or equivalent), including data storage patterns, partitioning strategies, and query optimization.
- Build and optimize ETL/ELT pipelines using Spark, dbt, or similar tools to transform raw scientific data into analytical and ML-ready formats.
- Implement real-time and streaming data integration (Kafka, Kinesis, event-driven patterns) connecting LIMS, instruments, and lab automation systems to the data infrastructure.
- Design and implement knowledge graphs (Neo4j, Amazon Neptune, TigerGraph) that capture molecular, target, pathway, and experimental relationships across the discovery landscape.
- Architect specialized data solutions: array databases (TileDB) for genomics/imaging, document stores (MongoDB) for experimental records, and vector databases for embedding-based retrieval supporting ML and RAG workflows.
- Build query and traversal patterns that enable scientists and AI agents to ask relational questions across the entire data landscape.
- Partner with scientific software engineers to ensure data architectures are implementable, performant, and well-documented.
- Collaborate with Methods4Insight to design data structures that support analytical model training, deployment, and evaluation.
- Work with View email address on click.appcast.io to define scaling strategies, ensure enterprise compliance, and transition data architectures to production-grade management.
- Contribute to build-versus-buy-versus-adopt decisions by evaluating commercial and open-source data platforms against Data Foundry requirements.
- M.S. or PhD in Computer Science, Data Science, Bioinformatics, Computational Biology, Information Science, or related STEM field
- MS (with 6+ years ) and PhD (with 2+ years) of data architecture, data engineering, or scientific informatics experience.
- Deep expertise in at least one of the focus areas: relational databases, data modeling and ontology engineering, data platform and lakehouse architecture (Databricks, Snowflake, Spark), or knowledge graph and specialized database systems (Neo4j, Neptune, MongoDB, TileDB)
- Working familiarity with multiple database paradigms - relational, graph, document, columnar, key-value - and strong SQL skills.
- Understanding of scientific data types and experimental workflows in life sciences or pharma (chemical, biological, HTE data).
- Strong communication skills with ability to translate data architecture concepts for both technical and scientific audiences.
- Familiarity with cloud platforms (AWS, Azure, or GCP) and modern data integration patterns.
- Pharmaceutical or biotech research industry experience, particularly in discovery data management or research informatics.
- Experience with semantic web technologies: RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools.
- Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific data.
- Data lakehouse architecture experience: Databricks (Delta Lake, Unity Catalog), Snowflake, or equivalent; ETL/ELT with Spark, dbt.
- Experience with streaming/real-time data platforms (Kafka, Kinesis, Flink) and event-driven architectures.
- Familiarity with LIMS, ELN systems (e.g., Benchling), and laboratory instrument data integration.
- Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications.
- Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional scientific data.
- FAIR data principles implementation experience and Data Readiness Level frameworks.
- Scientific data standards and controlled vocabularies in chemistry (InChI, SMILES) or biology (Gene Ontology, UniProt).
- Experience with C, C++, or Rust for performance-critical data processing; familiarity with HPC data I/O patterns for large-scale scientific computations.
Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form ( for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response. Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status. Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women's Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups. Actual compensation will depend on a candidate's education, experience, skills, and geographic location. The anticipated wage for this position is
$151,500 - $222,200 Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly's compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees. #WeAreLilly
- ...Overview: Role: Data Architect Location: San Francisco, CA Duration: 6 months Data Architecture Strategy o Define and implement enterprise-wide data architecture standards and best practices. o Develop conceptual, logical, and physical data models...Suggested
- ...The Data Architect is responsible for designing scalable data architectures that support the integration of multiple data sources. This role defines technical standards, ensures data quality and governance, and enables efficient data processing across the platform. Responsibilities...SuggestedRemote work
- ...Data Architect Data Architect Bay Area, CA / Charlotte, NC (1 position for each) Manage multiple projects Design and execute the key initiatives using the subject matter expertise with extensive knowledge of customer complaints & survey and system of records,...Suggested
- ...Lawrence Berkeley National Laboratory is hiring a Data Science Workflows Architect within the NERSC division. The National Energy Research Scientific Computing Center (NERSC) is seeking an engineer with experience in complex scientific workflows to join our team to help...Suggested
$170k - $225k
...Data Architect Atlanta; Boston; Charlotte; Chicago; Dallas; Los Angeles; New York; San Francisco This position is not eligible for immigration sponsorship. Company Overview We are the better way to work in finance. As private equity's value creation partner...SuggestedWork at officeLocal areaRemote work2 days per week- ...GCP Data Architect Location: SFO, CA Rate: Open Experience: 8+ years of total experience with at least 3 years in working with Google Cloud components. GCP Data Experience: Must have good knowledge of Cloud run, Cloud function, Cloud SQL, Pub-sub, Cloud...
- A leading cloud cost management company is seeking a Distinguished Architect to drive the architecture of its data platform. This role involves designing real-time streaming data pipelines and collaborating with various engineering teams to shape the overall engineering...Remote work
- A data solutions company in San Francisco seeks a Data Architect to design scalable data architectures that integrate multiple data sources. The role involves establishing technical standards for data quality, governance, and enabling efficient data processing. Candidates...
- ...Overview: About the job you're considering The Data Governance Architect must have experience in defining and implementing enterprise data governance frameworks that ensure data is trusted, secure, compliant, and fit for business use. This role bridges business...
- ...Microsoft Data Architect Sonsoft, Inc. is a USA based corporation duly organized under the laws of the Commonwealth of Georgia. Sonsoft Inc. is growing at a steady pace specializing in the fields of Software Development, Software Consultancy and Information Technology...Permanent employmentFull time
$226.67k - $236.26k
...Principal Data Architect Berkshire Hathaway Homestate Companies, Workers Compensation Division, is searching for a Principal Data Architect to design and evolve the enterprise data architecture that supports our growing business, AI adoption, and long-term strategy...Work at officeImmediate startWork from homeWork visaFlexible hours- ...Data Architect Location: San Francisco, CA (Onsite) Duration: 6+ Months Job Description: Research and properly evaluate sources of information to determine possible limitations in reliability or usability Apply sampling techniques to effectively determine...Work experience placement
$198k - $290k
...Direct Current Data Center Architect Eaton's Global Data Center Segment is one of the company's most dynamic and rapidly evolving businesses, playing a critical role in powering the digital world. With the rise of big data, edge computing and the cloud, data centers...Work experience placementWork at officeLocal area- ...Job Description: Data Architect, MarTech Data Architect, MarTech our mission is to empower individuals on their journey to better financial health. As part of this mission, the Big Data and Analytics Platform team is building the foundation for a data-driven...Work experience placement
$138k - $210k
...GTM Data Architect Austin | Chicago | New York City | Salt Lake City | San Francisco Gong harnesses the power of AI to transform how revenue teams win. The Gong Revenue AI Operating System unifies data, insights, and workflows into a single, trusted system that...Remote workWork from homeFlexible hours- ...Senior Data Architect Hybrid - San Francisco, California Our mission at Oura is to empower every person to own their inner potential. Our award-winning products help our global community gain a deeper knowledge of their readiness, activity, and sleep quality by...Work at officeFlexible hours
- ...Staff Data Architect At Komodo Health, our mission is to reduce the global burden of disease. And we believe that smarter use of data is essential to this mission. That's why we built the Healthcare Map — the industry's largest, most complete, precise view of the U...
- A leading global professional services firm is seeking a Data Architecture Manager focused on the utilities sector. The role requires extensive experience in data modernization and architecture, alongside leadership capabilities to guide teams and deliver complex technology...
- ...YO IT Consulting is seeking a Senior Data Architect to contribute to how AI systems reason about complex enterprise data. This remote role requires 4+ years of experience in data architecture and proficiency with cloud platforms. Responsibilities include evaluating AI...Remote work
- A forward-thinking company in San Francisco is seeking a Healthcare Data Partnerships Lead to build and manage data partnerships in healthcare. This role requires a strong background in business development or partnerships within the healthcare sector. The candidate will...
- ...We are seeking a Strategic Data Solutions Architect to help enterprise organizations modernize their data ecosystems and build AI-ready solutions within the Salesforce platform. This role combines enterprise architecture, data quality strategy, and client-facing leadership...Work at office2 days per week1 day per week
$190k - $250k
...technical leadership role in realizing Heartflow's vision of a single, comprehensive platform to manage heart disease. As a Staff Data Architect, you will lead the data strategy for our medical device system-defining how clinical, imaging, and operational data is...Work experience placementLocal areaWorldwideRelocation- A technology company is seeking a Data Foundation Account Executive to lead the sales of its integrated data solutions. This role focuses on selling both MuleSoft and Informatica to drive business value for clients, empowering them to leverage data effectively in AI initiatives...
- A leading technology firm is seeking a Senior Consultant for Data Governance in San Francisco. You will develop and implement data governance frameworks and policies, work with data teams, and ensure high data quality standards. The ideal candidate will have over 7 years...
$226.67k - $236.26k
A national insurance group is seeking a Principal Data Architect to shape the enterprise data architecture and support their strategic transition toward AI adoption. The ideal candidate will lead efforts in data integration, real-time analytics, and the establishment of...- A tech-driven marketing firm in San Francisco seeks a motivated individual to lead its Data Partnerships marketing efforts. This role includes developing joint marketing initiatives with numerous data partners, crafting effective messaging, and maintaining engagement programs...
- A leading global consulting firm is seeking a Senior Manager in Data Architecture to oversee technology projects within the automotive and aerospace sector. This role involves crafting and implementing data architectures, leading teams, and managing client relationships...Flexible hours
- Violet Research in San Francisco is seeking a founding Bioinformatician to architect the entire genomics data foundation. This role includes managing data from sequencing ingestion to clinical interpretation, impacting real patient outcomes. Ideal candidates should have...
$233.5k - $350.5k
A dynamic nonprofit organization in San Francisco seeks a Senior Staff Data Platform Architect to modernize their data ecosystem and manage a cloud-native platform. This impactful role requires 10+ years of data architecture experience, hands-on skills with platforms like...$225k - $290k
Circle, a leader in the internet financial platform space, seeks a Senior Data Strategy Lead in San Francisco. The role focuses on driving data quality, reliability, and operational excellence. Candidates should have significant experience in designing scalable data platforms...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Advisor - Data Architect, Data Foundry. Be the first to apply!
- innovation advisor San Francisco, CA
- college advisor San Francisco, CA
- trust advisor San Francisco, CA
- service advisor assistant service manager San Francisco, CA
- work from home advisor San Francisco, CA
- at home advisor San Francisco, CA
- scientific advisor San Francisco, CA
- comfort advisor San Francisco, CA
- international police advisor San Francisco, CA
- cultural advisor San Francisco, CA

