Advisor - Data Architect, Data Foundry
Eli Lilly
Data Architect
At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We're looking for people who are determined to make life better for people around the world.
Location: San Diego, CA; San Francisco, CA; Boston, MA; Louisville, CO; Indianapolis, IN
Reports To: Lead, Data Architecture (R9), Architecture4Insight
Overview
Lilly Small Molecule Discovery is purpose-built to create molecules that make life better for people. Discovery Technology and Platforms (DTP) accelerates molecule discovery by building optimized foundational platforms, streamlining lab operations through advanced technologies and data connectivity, and investing in novel capabilities.
Data Foundry is a multidisciplinary team within DTP that enables AI-native drug discovery through four integrated pillars: Architecture4Insight (data infrastructure and scientific software), Methods4Insight (analytical and computational methods), Automation & Scale4Insight (lab automation and agentic workflows), and Preparedness4Insight (data governance and readiness). These pillars empower every Lilly scientist to make optimal decisions by providing seamless access to data, insights, and AI-driven capabilities—serving both human scientists and autonomous AI agents.
Position Summary
We are seeking Data Architects at multiple levels to design and build the data infrastructure that makes AI-native drug discovery possible. You will create the schemas, ontologies, data models, knowledge graphs, and platform architectures that transform raw scientific data into machine-actionable, FAIR-compliant, insight-ready assets—serving both discovery scientists and autonomous AI agents.
This role is the foundation of Architecture4Insight. Everything the software engineering team builds—pipelines, APIs, prototypes—depends on the data models and platform architecture this team designs. You will work with deep knowledge of scientific data (chemical, biological, HTE, automation-generated) to create custom-fit solutions, then partner with View email address on click.appcast.io to scale and maintain them. The role spans three focus areas depending on expertise: data modeling & ontologies, data platform & lakehouse architecture, and knowledge graph & specialized data systems. You will independently design schemas, select technologies, and make build-vs-buy recommendations for their domain.
Responsibilities
Data Modeling & Ontologies
- Design and implement data models, schemas, and ontologies for chemical, biological, and automation-generated data that serve discovery workflows across the portfolio.
- Define and maintain controlled vocabularies, metadata standards, and FAIR-compliant data frameworks in partnership with Preparedness4Insight.
- Implement semantic data standards (RDF, OWL, SPARQL) and ontology engineering practices to create interoperable, machine-readable scientific data.
Data Platform & Lakehouse Architecture
- Design and implement data lakehouse architecture using modern platforms (Databricks, Snowflake, or equivalent), including data storage patterns, partitioning strategies, and query optimization.
- Build and optimize ETL/ELT pipelines using Spark, dbt, or similar tools to transform raw scientific data into analytical and ML-ready formats.
- Implement real-time and streaming data integration (Kafka, Kinesis, event-driven patterns) connecting LIMS, instruments, and lab automation systems to the data infrastructure.
Knowledge Graph & Specialized Data Systems
- Design and implement knowledge graphs (Neo4j, Amazon Neptune, TigerGraph) that capture molecular, target, pathway, and experimental relationships across the discovery landscape.
- Architect specialized data solutions: array databases (TileDB) for genomics/imaging, document stores (MongoDB) for experimental records, and vector databases for embedding-based retrieval supporting ML and RAG workflows.
- Build query and traversal patterns that enable scientists and AI agents to ask relational questions across the entire data landscape.
Cross-Functional Partnership
- Partner with scientific software engineers to ensure data architectures are implementable, performant, and well-documented.
- Collaborate with Methods4Insight to design data structures that support analytical model training, deployment, and evaluation.
- Work with View email address on click.appcast.io to define scaling strategies, ensure enterprise compliance, and transition data architectures to production-grade management.
- Contribute to build-versus-buy-versus-adopt decisions by evaluating commercial and open-source data platforms against Data Foundry requirements.
Basic Requirements
- M.S. or PhD in Computer Science, Data Science, Bioinformatics, Computational Biology, Information Science, or related STEM field
- MS (with 6+ years) and PhD (with 2+ years) of data architecture, data engineering, or scientific informatics experience.
- Deep expertise in at least one of the focus areas: relational databases, data modeling and ontology engineering, data platform and lakehouse architecture (Databricks, Snowflake, Spark), or knowledge graph and specialized database systems (Neo4j, Neptune, MongoDB, TileDB)
Preferred Qualifications
- Working familiarity with multiple database paradigms — relational, graph, document, columnar, key-value — and strong SQL skills.
- Understanding of scientific data types and experimental workflows in life sciences or pharma (chemical, biological, HTE data).
- Strong communication skills with ability to translate data architecture concepts for both technical and scientific audiences.
- Familiarity with cloud platforms (AWS, Azure, or GCP) and modern data integration patterns.
- Pharmaceutical or biotech research industry experience, particularly in discovery data management or research informatics.
- Experience with semantic web technologies: RDF, OWL, SPARQL, Protégé, or equivalent ontology engineering tools.
- Hands-on experience with graph databases (Neo4j, Neptune, TigerGraph) and knowledge graph design patterns for scientific data.
- Data lakehouse architecture experience: Databricks (Delta Lake, Unity Catalog), Snowflake, or equivalent; ETL/ELT with Spark, dbt.
- Experience with streaming/real-time data platforms (Kafka, Kinesis, Flink) and event-driven architectures.
- Familiarity with LIMS, ELN systems (e.g., Benchling), and laboratory instrument data integration.
- Experience with vector databases (Pinecone, Weaviate, pgvector) and embedding-based retrieval for ML/RAG applications.
- Array database experience (TileDB, Zarr) for genomics, imaging, or high-dimensional scientific data.
- FAIR data principles implementation experience and Data Readiness Level frameworks.
- Scientific data standards and controlled vocabularies in chemistry (InChI, SMILES) or biology (Gene Ontology, UniProt).
- Experience with C, C++, or Rust for performance-critical data processing; familiarity with HPC data I/O patterns for large-scale scientific computations.
Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce, ensuring equal opportunities when vying for positions. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form ( for further assistance. Please note this is for individuals to request an accommodation as part of the application process and any other correspondence will not receive a response.
Lilly is proud to be an EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status.
Our employee resource groups (ERGs) offer strong support networks for their members and are open to all employees. Our current groups include: Africa, Middle East, Central Asia Network, Black Employees at Lilly, Chinese Culture Network, Japanese International Leadership Network (JILN), Lilly India Network, Organization of Latinx at Lilly (OLA), PRIDE (LGBTQ+ Allies), Veterans Leadership Network (VLN), Women's Initiative for Leading at Lilly (WILL), enAble (for people with disabilities). Learn more about all of our groups.
Actual compensation will depend on a candidate's education, experience, skills, and geographic location.
- A leading technology consulting firm is seeking a Principal Data Platform Architect to drive the migration from Palantir Foundry to Databricks. This role demands extensive experience in Databricks architecture, PySpark, and SQL for effective data processing workflows....Suggested
$64.5k - $167.2k
...Automation Engineering Department at Eli Lilly, Foundry. The engineer will provide automation... ...provide Automation, Process Controls and Data Historian expertise and support historian... ...will also function as the main Data Architect to ensure proper data contextualization and...SuggestedPermanent employmentFull timeWork experience placementH1bRemote workVisa sponsorshipWork visaFlexible hours$126k - $204.6k
...to make life better for people around the world. Lilly recently announced a $4.5 billion investment to create the Lilly Medicine Foundry, a new center for advanced manufacturing and drug development. The first‑ever facility of its kind, combining research and manufacturing...SuggestedFull timeH1bVisa sponsorshipWork visaFlexible hoursNight shift- ...A leading biopharmaceutical company is seeking a Principal Data/AI Engineer to drive the technical strategy and architecture of enterprise... ...engineering, proficiency in Python and SQL, and the ability to architect and maintain data pipelines effectively. This position will be...SuggestedRemote work
$156.64k
...Maximus is currently seeking an IT Principal Data Architect. The IT Principal Data Architect - (Data Migration) is a senior technical leader responsible for the architecture, engineering, and delivery of large-scale, complex data migration programs, transforming data...SuggestedLocal areaRemote work- ...Onsite DM Architect: : Job Description • At least 10+ years overall experience overall and 5+ years in handling ERP data migration as Data migration Architect/Lead capacity . • Implementation experience of data migration projects for PeopleSoft ERP or...
- ...Job Family : Data Engineering & Architecture Consulting Travel Required :... ...Secret What You Will Do As a Data Architect, you will lead the design, implementation... ...architectures, leveraging Databricks and Palantir Foundry as core delivery platforms. Key...Temporary workFlexible hours
- ...Responsibilities Design overall data architecture strategy aligned with business objectives and Salesforce best practices Architect and implement integrations between Salesforce Data Cloud and enterprise systems Ensure data consistency and integrity across...
$165k
...We are seeking a highly experienced Data Architect with deep healthcare and Medicaid domain expertise to design, modernize, and govern our enterprise data ecosystem. This role is critical to supporting Medicaid operations, compliance, reporting, analytics, and population...Remote work- ...This is a hybrid role based in Indianapolis, IN. About the job you're considering We are seeking an accomplished Senior Data & AI Architect to lead the design and delivery of enterprise-scale data engineering and AI-driven solutions. This role requires deep...
- ...Role name: Data Architect Work site: Indianapolis, IN (onsite, local only) Job Description: Local candidates only. 20 years of experience delivering Data & AI-driven projects. 10 Years of experience in Architecture, Providing Solutions especially in Cloud...Local area
- ...Job Title: IT Data Visualization Architect Location: Indianapolis, IN Duration: 12+ Months Description: The IT Data Visualization Architect will craft visualizations and dashboards relevant to agency's different divisions serving both statewide and...Work experience placementRemote work
$100.71k - $157.63k
...is responsible for driving automation, data-driven decision making and software modernization... ...and system availability. As AI Data Architect, you will be responsible for designing... ...we are We are more than a company. We are advisors, consultants, problem solvers, friends,...Local areaRemote workFlexible hours- ...NAVA Software solutions is looking for a Sr Data Architect Details: Sr Data Architect Location: Hybrid - Indianapolis, IN / Remote also ok Duration: 6-12 months As the Senior Data Architect, you will be responsible for architecting...Remote work
- ...About TetraScience TetraScience is the Scientific Data and AI Company building Tetra OS, the operating system for scientific intelligence. We help the world’s leading life sciences firms turn fragmented scientific data into AI-native assets and scientific workflows...Remote workFlexible hours
$79.12k
...Data Architect Date Posted: May 6, 2026 Requisition ID: 477179 Location: Indianapolis, IN, US, 46204 Work for Indiana Begin a fulfilling career with the State of Indiana by joining one of the largest employers in the state, offering a range of opportunities...Full timeWork experience placement$123.1k - $186.3k
...future of AI, and you are the future of Salesforce. Responsibilities: * Be a trusted Agentforce + Data Cloud subject-matter expert for the broader Success Architect organization, including how Data Cloud relates to the success of AI * Engage with our Signature and...Work experience placementWork at officeRemote workFlexible hours3 days per week- ...Data Architect The Data Architect is a builder-architect: the person who writes the code, sets the patterns, and makes the hard tradeoffs that shape the platform the rest of the team builds on. This is a modern full-stack data & analytics engineering role with hands...Work at office
$161.18k - $213.81k
...Data Architect - Databricks Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues around the world, and where you’ll be able...Permanent employmentFull timeContract workLocal area$142k
...Description : JOB SUMMARY We are seeking a hands-on Principal Data Architect to serve as a leader in our ongoing data and digital... ...platforms. We require a natural thought leader and strategic advisor, shaping and driving the evolution of complex architectures across...Temporary workFor contractorsWork at officeLocal areaRemote work$97.5k - $199.5k
...insights via dynamic visualizations, and enabling collaborative, data-driven improvement projects across the enterprise.... ...Federal Security Clearance. Responsibilities As a Senior Data Architect on the Oracle Health Advance research and development team, you...Temporary workWork experience placementFlexible hours- ...EDI (Electronic Data Interchange) Technical Architect Location: Indianapolis, IN Duration: 7+ Months The EDI Technical Architect is responsible for designing, implementing, and maintaining the technical infrastructure and architecture that supports electronic...
- ...Data/ Infrastructure Architect Indianapolis, IN Duration: 1 year Rate: W-2 & 1099 only USC,GC,TN & GC/H4-EAD Preferred 7+ years of relevant experience... ...tradeoff analysis Establish and maintain a trusted advisor and partnership relationship to business users and...
- A leading Scientific Data and AI firm in the United States seeks a data-driven professional to design solutions for scientific AI challenges. The ideal candidate will have a PhD or relevant master's degree with significant experience in drug discovery and a strong background...Remote job
- Gainbridge Fieldhouse in Indianapolis is looking for a Data Architect to lead the design of a modern data and analytics platform. This role encompasses platform architecture, data modeling, and governance standards, with a focus on hands-on technical leadership. Candidates...
$132k - $193.6k
A leading global healthcare company is seeking a Data Architect to design and build data infrastructure critical for AI-native drug discovery. You will develop data models, ontologies, and platform architectures to transform raw scientific data into actionable insights...Full time- A global AI and Data Analytics consulting firm is seeking a skilled Databricks Technical Solution Architect to drive enterprise-scale transformations. You will architect solutions using the Databricks Lakehouse Platform, leading pre-sales efforts and advising executives...
- Dovel Technologies, Inc in Indianapolis, IN is looking for a Data Architect to lead the design and implementation of enterprise-scale data... ..., requiring hands-on experience with Databricks and Palantir Foundry. Ideal candidates will have at least five years in data...
- A global consulting firm seeks a Senior Manager in Data Architecture focused on the Power & Utilities sector. This role offers the chance to lead transformative projects while managing teams, ensuring quality outcomes, and building client relationships. The ideal candidate...
- A leading automotive manufacturer located in Indianapolis is seeking a Senior Data Analytics and Technology Modernization leader. This key role is responsible for defining and executing the enterprise strategy for advanced analytics and modern technologies. The ideal candidate...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Advisor - Data Architect, Data Foundry. Be the first to apply!
- innovation advisor Indianapolis, IN
- college advisor Indianapolis, IN
- trust advisor Indianapolis, IN
- service advisor assistant service manager Indianapolis, IN
- work from home advisor Indianapolis, IN
- at home advisor Indianapolis, IN
- scientific advisor Indianapolis, IN
- comfort advisor Indianapolis, IN
- international police advisor Indianapolis, IN
- cultural advisor Indianapolis, IN

