Associate Director, Data Engineering (Real World Data)
$204.5k - $267kFormation Bio
About Formation Bio
Formation Bio is a tech and AI driven pharma company differentiated by radically more efficient drug development. Advancements in AI and drug discovery are creating more candidate drugs than the industry can progress because of the high cost and time of clinical trials. Recognizing that this development bottleneck may ultimately limit the number of new medicines that can reach patients, Formation Bio, founded in 2016 as TrialSpark Inc., has built technology platforms, processes, and capabilities to accelerate all aspects of drug development and clinical trials. Formation Bio partners, acquires, or in-licenses drugs from pharma companies, research organizations, and biotechs to develop programs past clinical proof of concept and beyond, ultimately helping to bring new medicines to patients. The company is backed by investors across pharma and tech, including a16z, Sequoia, Sanofi, Thrive Capital, John Doerr, Spark Capital, SV Angel Growth, and others.
About The Position
As Associate Director of RWD Intelligence at Formation Bio, you will lead the strategy and execution of our real‑world data (RWD) capabilities, building the data foundations that power drug acquisition, clinical development, and portfolio decision‑making. You will own the end‑to‑end lifecycle of RWD: sourcing, procurement, ingestion, harmonization, quality assurance, and delivery of analysis‑ready datasets to downstream consumers across Product, Data Science, Clinical Development, and Business Development.
This role sits at the intersection of data engineering, data science, and drug development. You will build and maintain scalable data infrastructure (pipelines, data models, lakes/marts) while ensuring semantic interoperability across heterogeneous data sources through ontology‑driven harmonization frameworks such as OMOP. You will also manage vendor relationships and data procurement, evaluating and integrating new data assets as the portfolio evolves. The ideal candidate combines deep RWD domain expertise with strong data fluency, enabling Formation Bio to treat real‑world evidence as a first‑class strategic asset.
Responsibilities
- Lead the RWD Intelligence function within Data Science, owning data strategy, sourcing, and delivery of analysis‑ready datasets
- Architect and maintain the supporting infrastructure (pipelines, ingestion workflows, data models, lakes/marts) across EHR/EMR, claims, registries, and genomics‑linked cohorts
- Drive adoption and extension of harmonization frameworks (e.g., OMOP CDM) across heterogeneous data sources, leveraging AI/ML tools for entity resolution, ontology mapping, data quality monitoring, and automated harmonization
- Manage RWD vendor relationships end‑to‑end: evaluate providers, negotiate data use agreements, broker new partnerships, and integrate acquired datasets into the platform
- Partner with Data Science, Clinical Development, Business Development, and Engineering teams to define RWD use cases (trial feasibility, synthetic control arms, epidemiology, label expansion) and productize ad hoc pipelines into scalable, production‑grade systems
- Foster a culture of data quality rigor, documentation, and reproducibility across all RWD assets
Required Qualifications
- BSc or MSc in biomedical informatics, computational sciences, epidemiology, or a related quantitative field
- 5+ years of industry experience working directly with real‑world data (EHR/EMR, claims, registries, linked biobank data) in pharma, biotech, health tech, or consulting, with at least 2+ years in people management
- Strong data engineering proficiency (pipelines, ingestion frameworks, data models, data lakes/marts) combined with deep working knowledge of biological and medical ontologies (ICD, SNOMED CT, MedDRA, RxNorm, ATC) and harmonization standards, particularly OMOP CDM
- Demonstrated experience with RWD procurement and vendor management: evaluating data providers, negotiating agreements, and integrating new data assets
- Proven ability to deliver RWD‑derived insights across multiple drug development use cases (e.g., trial design, epidemiology, comparative effectiveness, label expansion), with familiarity across the development lifecycle from target selection through post‑market
- Proficiency with modern AI/ML tools, including large language models, and their applications in data engineering and harmonization workflows
- Strong communication skills with the ability to translate complex data infrastructure concepts for clinical, scientific, and executive audiences
Preferred Qualifications
- PhD in biomedical informatics, epidemiology, computational biology, or a related field
- Experience with large‑scale biobank and genomics‑linked RWD platforms (UK Biobank, FinnGen, All of Us), with a track record of building RWD infrastructure that directly influenced drug acquisition, licensing, or portfolio decisions
- Familiarity with additional biomedical data modalities (scientific literature mining, -omics datasets, molecular data integration) and with data science/analytics methodologies applied to RWD (causal inference, trial simulation, propensity score methods)
- Background transitioning data infrastructure from research/ad‑hoc to production‑grade systems in regulated environments
- Experience working at the intersection of data engineering, data science, and business strategy in pharma/biotech
Compensation
Total Compensation Range: $204,500 - $267,000. Individual compensation is determined by several factors, including role scope, geographic location, and skills & experience. The offer will reflect where you fall within the range based on these considerations. In addition to base salary, we offer equity, comprehensive benefits, and generous perks. If the posted range doesn’t match your expectations, we still encourage you to apply!
Where We Hire
Formation Bio is prioritizing hiring in key hubs, primarily the New York City and Boston metro areas, with a hybrid model requiring 3 days per week in office. Applicants from the Research Triangle (NC) and San Francisco Bay Area may also be considered. Please apply only if you reside in these locations or are willing to relocate.
Equal Opportunity
Formation Bio is committed to building a diverse and inclusive team. We are an equal opportunity employer and welcome candidates from all backgrounds. All qualified applicants will receive consideration for employment without regard to race, color, creed, religion, national origin, ancestry, sex (including pregnancy, childbirth, breastfeeding, and related medical conditions), gender identity or expression, sexual orientation, age, disability, genetic information, marital status, military or veteran status, or any other characteristic protected by federal, state, or local law.
#J-18808-Ljbffr$150k - $160k
## Sr. Data EngineerApplylocations: Boston: New York... ...Team, and make a real impact on a global scale... ...work with some of the world’s most successful and innovative... ...role of Senior Data Engineer is essential for the... ...modeling skills, including associated tools.* Strong...Suggested$204.5k - $267k
...Formation Bio is seeking an Associate Director of RWD Intelligence to lead real-world data capabilities, managing data strategy and infrastructure. The ideal... ...will have 5+ years of industry experience in data engineering and a strong background in biological ontology....SuggestedWork at office- ...the retail, commercial and industrial, real estate, manufacturing, brand and intellectual... ...practice.## Position OverviewAs an Associate Director in Hilco Global Investigations & Dispute... ...user and system activity reconstruction, data movement analysis, device and cloud evidence...Suggested
- A forward-thinking software organization in New York is seeking a Staff Data Engineer and Tech Lead to develop AI-Powered Web Data Acquisition systems. The role emphasizes deep technical ownership and mentoring engineers to uphold the highest operational standards within...Suggested
- ...features. We built them by understanding the world of music and podcasts better than anyone... ...new approaches in areas like synthetic data, fairness, and responsible AI. Our focus is... ...You understand how to apply ML solutions to real‑world product challenges, ideally in...SuggestedWork from homeFlexible hours
- ...researchers. We are looking for a Senior AI/ML Engineer to join a high-performing AI engineering team building and deploying real-world AI systems used by Fortune 500 customers.... ...pipelines for structured and unstructured data Own model performance, reliability,...Local areaImmediate startRelocation package
$220k - $270k
...Senior Manager – Provider Engineering Datavant is the data collaboration platform trusted for healthcare. Guided by our mission to make the world’s health data secure, accessible and actionable, we provide critical data solutions for organizations across the healthcare...- ...’re hiring a Manager to lead Data & AI strategy engagements across... ...with senior stakeholders on real decisions This role is not... ...you if you: Want a hands-on engineering or build-focused role Prefer... ...progression toward Senior Manager and Director Strong emphasis on leadership...Permanent employmentRemote workFlexible hours
$148.5k - $266.2k
...with cloud-native capabilities, including data at scale, edge computing, AI-based... ...Development Manager in the Architecture, Engineering, and Construction (AEC) Solutions Team.We... ...partners, and defines how we show up in the world.**Benefits**From health and financial benefits...$170k - $185k
...Voya Financial, Inc. is looking for a Director of Enterprise PMO – Digital & Data to oversee digital transformation and data initiatives. This role requires significant program management experience and collaboration across various business lines. The successful candidate...$138.25k - $197.5k
...listener. We seek to understand the world of music, podcasts and... ...looking for a Machine Learning Engineer to join our team to build and... ...spanning user research, design, data science, product management, and... ...motivated to work on complex real‑world problems in a fast‑paced...Flexible hours- ...enhancements* Bachelor's degree in Business, Computer Science, Real Estate, or a related field* 4-7 years of experience with a proven... ...belong. Together, we strive to be exceptional and shape a better world.For over 200 years, JLL (NYSE: JLL), a leading global commercial...Local areaFlexible hours
$172.2k - $236.9k
...looking for a Principal Applied AI Engineer, a technical leader who... ...environments and is motivated by solving real problems with pragmatic,... ...engineering of high-quality data, feature, and evaluation... ...support whole-person well-being. Associate benefits are designed to encourage...Bi-weekly payWeekly payFull timeTemporary workApprenticeshipWork at officeWork from homeHome office$62.77k - $118.99k
...of the U.S. and in many of the world's leading financial centers -... ...serve as a Senior Accounting Associate for multiple clients across the... ...for ownership entities holding real estate assets* Monitor... ...dispositions, and financings* Prepare data to support the annual...Local areaWorldwide- ...Independence Pet Group is seeking a Director of Data & Analytics to lead enterprise data initiatives. The role requires extensive experience in data leadership, migration support in complex environments, and strong communication skills. This position is hybrid with remote...Remote work
- ...A leading technology consulting firm is seeking a Lead Consultant for Data & Analytics Solutions in the Greater Boston or New York Metro Areas. The ideal candidate will lead project teams, engage with clients, and implement advanced data solutions while mentoring junior...Remote work
$130k - $150k
...servant leadership, and teamwork. Our associates take pride and ownership in their... ...medical technology in the world. Together, We’re in It for Life. The Engineering Manager will lead and develop the... ...lean, continuous improvement, and data-driven methodologies to improve manufacturing...Local areaWorldwide- ...caring community and help us put health first**The Principal AI Engineer serves as a senior technical authority responsible for guiding Humana... ...agentic AI infrastructure complexity, enabling application and data science teams to self-service their agent/agent group training,...
- ...A leading company in diagnostic imaging is seeking an Associate Director of Product Marketing to execute strategies for marketing, focusing on downstream activities. The ideal candidate will possess a BS/BA in Business or Marketing and have more than 5 years of experience...
$148.5k - $266.2k
...lead the development of foundational ML models and tools for the Architecture, Engineering, and Construction (AEC) industry. This hybrid role involves building a diverse team, ensuring data privacy, and enhancing product quality through innovative AI solutions. Candidates...$159.8k - $188k
...searching for a Principal AI Software Engineer, to join our team in our Technology & Experience... ...complex information, and support data-driven recommendations for clients,... ...measurable value for our clients, delivering real-world solutions. The combination of...Local areaImmediate startFlexible hours$90k - $115k
...engagement, the Growth & GTM Operations Associate will play a foundational role in keeping... ...phase: translating technical progress into real-world engagement with pharma, biotech, and... ...efforts, and help build the operational engine behind how a company shows up to the world...Work experience placementFlexible hoursShift work- ...Associate Director of Product Marketing-RX/Cardiology Description Our client is a market leader in diagnostic imaging agents for the cardiology sector of patient care. They are a 60 year old company and this position is for their legacy product, with a focus on downstream...
- ...Spark About the role As a Machine Learning Engineer, the candidate will design, build, and... ...ML lifecycle—from problem formulation and data preparation through model training, evaluation... ...into production systems that serve real users. They will collaborate with data engineers...Flexible hours
$148.9k - $212.72k
...subscription funnel Work closely with a cross-functional team of engineers, data scientists, product managers, designers, and researchers to... ...that we will find the power to keep revolutionizing the way the world listens. At Spotify, we are passionate about inclusivity and...Work from homeFlexible hours$208k - $270k
## Head of Data Science Technology SolutionsApplylocations: USA-SR100-Stamford: New York... ...outcomes for our clients around the world!**The Head of Data Science will lead the... ...alpha signals, risk models, optimization engines, liquidity analytics, and scenario modelling...Work at officeLocal areaFlexible hours- ...Research Intern, Women's World Banking Institute, Summer 2026 Title: Research Intern, Women... ...the fuel for our advisory and advocacy engines, uncovering the insights on which the... ...Banking Institute is the global source for data, insights, and training to advance women'...Summer workInternshipSummer internshipLocal areaRemote work
$150k - $175k
...Director of CX AI page is loaded## Director of CX AIlocations: San... ...healthcare organizations, from the world's largest companies to small... ...systems; partner with Engineering and IT on integration standards... ..., including bias monitoring, data privacy, audit trails, and regulatory...Work experience placementCasual workWork at officeFlexible hours$80.63k
...professionals to grow, learn, and make a real difference. If you are interested in exploring... ...The Multifamily CRE Portfolio Management Associate will support the three objectives of the... ...on modification requests Perform daily data pulls from internal databases to ensure...Hourly payFull timeContract workWork experience placementWork at officeShift work2 days per week$117k - $167k
...Creandum is looking for a Data Scientist II in New York or Boston to lead safety research projects for Spotify's Product Trust Insights. You will evaluate product safety, translate complex findings into clear recommendations, and partner with various teams to ensure user...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Associate Director, Data Engineering (Real World Data). Be the first to apply!

