Postdoctoral Researcher in AI-Driven Data Curation and Data Integration
University of Pennsylvania
Faculty Mentor: Joost Wagenaar Department: Informatics Number of Positions: 2 Open to applications from US Citizens and foreign nationals. The Wagenaar Lab is seeking a highly motivated Postdoctoral Researcher to conduct research at the intersection of artificial intelligence, common data elements (CDEs), and large-scale biomedical datasets. The Wagenaar Lab is jointly based in the Institute for Biomedical Informatics and the Department of Biostatistics, Epidemiology, and Informatics at the University of Pennsylvania, and leads the academic development of the Pennsieve scientific data platform. The lab's mission is to create scalable, sustainable infrastructure that enables data integration, reuse, and discovery across clinical and scientific research domains. This postdoctoral position will focus on developing AI-enabled methods to automate and augment data curation, with an emphasis on leveraging CDEs to improve the usability, interoperability, and scientific value of public datasets. The successful candidate will work across disease areas-including Epilepsy, Immune Health, and programs within the NIH HEAL Initiative-to design approaches that harmonize heterogeneous datasets, enrich metadata, and support scalable data exploration. The Postdoctoral Researcher will work closely with the Pennsieve development team and a broad network of scientific collaborators to translate industry best practices in data engineering and AI into the academic research ecosystem. A central goal of this role is to move beyond manual, project-specific curation toward reproducible, automated, and extensible curation workflows that can be applied across datasets, programs, and institutions. In addition to platform and method development, the Postdoctoral Researcher is expected to contribute to peer-reviewed publications, open-source software, and community-facing resources that advance AI-enabled data stewardship and reuse. Responsibilities Develop AI-based methods and deploy them at scale to automate and augment data curation using Common Data Elements Design workflows to harmonize, validate, and enrich public datasets across Epilepsy, Immune Health, and NIH HEAL programs Develop novel mechanisms to interrogate, visualize and interact with complex scientific datasets and increase the value of these datasets for the scientific community. Integrate curation methods into scalable, cloud-based scientific data platforms Collaborate with the Pennsieve development team and scientific partners to align methods with real research workflows Evaluate and validate curation approaches using large, heterogeneous public datasets Prepare manuscripts, technical documentation, and presentations describing methods and outcomes. Qualifications Ph.D. (preferred) or Master's degree in Biomedical Informatics, Computer Science, Data Science, Bioinformatics, or a related field Experience with machine learning, natural language processing, or AI applied to structured and unstructured data Familiarity with Common Data Elements, data standards, or ontology-based data representation (preferred) Strong programming skills in Docker, Python, Go, Java, or related languages Experience working with large-scale biomedical or clinical datasets Experience with cloud-based data processing and scalable analytics environments (AWS preferred) Strong written and verbal communication skills and an interest in interdisciplinary collaboration Please include a cover letter and a CV for consideration. The University of Pennsylvania is an equal opportunity employer. Candidates are considered for employment without regard to race, color, sex, sexual orientation, religion, creed, national origin (including shared ancestry or ethnic characteristics), citizenship status, age, disability, veteran status or any class protected under applicable federal, state, or local law.
$113k - $170k
...digital marketing. The Manager of AI Digital & Content Personalization... ...pillars. Utilize advanced data analytics to assess the effectiveness of AI-driven and interactive content, making informed... ...business and marketing teams to integrate AI-driven and innovative content...Data- ...AI Data Engineer Join the Data Science team as an AI Data Engineer... ...from raw source data to curated gold-layer datasets, create semantic... ...Complete, and semantic-model-driven natural-language querying.... ...processing, testing, or API integrations. Experience designing...DataFlexible hours
- ...Sr Data Engineer IntegriChain is the data and application... ...to focus on more data-driven decision support. With... ...define and mature data integration, data consolidation,... ...Snowflake, analytics, AI, and operational use... ...mastered data, Snowflake curated data, and downstream...Data
- ...AI Agent Development Specialist Expertise in LangChain, LlamaIndex... ..., or LlamaIndex. • LLM Integration & Optimization: Deploy, fine-... ...semantic ranking, and enterprise data sources (structured +... ...Experience Design: Build intuitive AI-driven applications using Databricks...Data
$70k - $90k
...Associate Data Engineer Our purpose is to help a billion people... ...the right job! Phenom is an AI-Powered talent experience platform... ...looking for a passionate and driven associate engineer to join our... ...to work on real-world data integration challenges and gain hands-on experience...DataFull timeInternshipH1bWork at officeWork visaFlexible hours3 days per week- ...Machine Learning group and seeking exceptional researchers to join our dynamic team. As a Machine... ...applying deep learning on time series data Strong foundation in mathematics, statistics... ...solving skills with a creative, research-driven mindset Demonstrated ability to work...Data
- ...applying strong analytical and problem-solving skills to large data sets. The ideal candidate will have experience in physical modeling... ...skills, especially Python and Tableau, to effectively communicate complex data-driven insights. #J-18808-Ljbffr Quant Blueprint LLCData
- ...AI Integration Business Analyst LOCATION - Hybrid – 3 Days Charlotte, NC; Chicago, IL; Colorado Springs, CO; Conshohocken, PA; Dallas,... ...Create documentation inclusive of business use cases, process / data flows, traceability matrices, and report mock-ups. Plan, facilitate...Data
$117.1k - $187.3k
...Content workflows and Content AI platforms. As a successful... ...design teams and lead through data driven insights. You will be obsessing... ...and industry trends and integrate relevant advancements into our... ...English language teaching and research markets worldwide. Through our...DataWork experience placementLive inLocal areaRemote workWorldwide- ...Machine Learning group and seeking exceptional researchers to join our dynamic team. As a Machine... ...applying deep learning on time series data Strong foundation in mathematics, statistics... ...solving skills with a creative, research‑driven mindset Demonstrated ability to work...Data
- ...IntegriChain is the data and application backbone... ...to focus on more data-driven decision support. With... ...Data Science team as an AI Data Engineer responsible... ...from raw source data to curated gold-layer datasets,... ...performance. Data Modeling, Integration, and Consolidation...DataWork at officeVisa sponsorshipFlexible hours
$54.63k - $81.94k
Postdoctoral Researcher Apply now Job no: 506801 Work type: Full-Time Location: University City - Philadelphia, PA Categories: College of Engineering Job Summary The A.J. Drexel Nanomaterials Institute and Professor Yury Gogotsi ( are seeking a Postdoctoral...For PostDocFull time- ...Sr Data Engineer Location: Philadelphia, PA/ Durham, NC Years of Experience: 15... ...data movement. Azure SharePoint / M365 integration. Practical knowledge of SharePoint Online... ...connectors, webhooks, and rate limits. "AI tools may assist in the recruitment process...Data
- ...modern.NET development and the Microsoft AI ecosystem. This role combines traditional... ...applications, develop Copilot Studio agents integrated, contribute to Azure OpenAI-powered... ...SharePoint Online, Microsoft Graph, and internal data sources. • Integrate SharePoint Online...DataWork at office3 days per week
- ...professional who partners with leaders, faculty, researchers, managers, and Monellians across the... ...scientists, visiting students, postdoctoral fellows, trainees, interns, and... ...initiatives. Assist with reporting, analytics, data integrity, and process improvement efforts....DataFor PostDocTemporary workTraineeshipWork at officeLocal area
- ...or C# Minimum of 2 years experience developing Data Management sequences and steps Minimum of 3 years... ...with advanced OneStream capabilities; Sensible AI, CPM Express etc. You have experience with Data Integration tools You have completed one of the OneStream...DataWork experience placementLive inWork at officeLocal area
- Susquehanna International Group seeks a Quantitative Researcher in Bala Cynwyd, PA. This role combines research with trading to develop,... ...are proficient in Python. Responsibilities include large-scale data analysis and model execution. The firm promotes a collaborative...DataFor PostDocVisa sponsorship
- ...Data Pipeline & Operations Build, manage, and monitor ETL/ELT pipelines for data ingestion and transformation. Ensure smooth data... ..., cleansing, and reconciliation processes. Maintain data integrity, consistency, and accuracy across systems. Define and enforce data...Data
- ...cloud-native capabilities. • Integrate security into CI/CD pipelines... ...workflows. • Define enterprise data protection standards for... ...2, PCI). • Exposure to AWS AI services such as AWS Security... ...Inspector, Amazon GuardDuty - AI Driven Threat Detection • IaC and automation...Data
- ...strategic role focuses on operational efficiency through AI, performance management, and data-driven decisions. Responsibilities include designing the... ...environments. The position emphasizes collaboration and data integrity within a fast-paced financial wellness platform. #J-18...Data
- ...Associate Director, Commercial AI & Advanced Analytics in... ...This role focuses on leading AI-driven initiatives in Immunology, bridging... ..., medical, and patient data to enhance decision-making. The... ...analytics and healthcare data integration is essential. #J-18808-Ljbffr...Data
$80k - $130k
...Data Engineer Create trusted, scalable data foundations... ...is on modular, event-driven, API-first and cloud... ...our SRE and AI practices. This is a large... ...pipelines, or AI/analytics integration concepts • Understanding... ...pipelines, models, and curated datasets that support...DataPermanent employmentTemporary workWork at officeFlexible hours- ...pursue advanced machine learning research in a fast-paced, real-world... ...actionable insights - driving data-informed decisions from predictive... ...of the machine learning and AI innovations that they... ...Our culture is intellectually driven and highly collaborative, bringing...DataFor PostDoc
$107k - $135k
...Lead Data Engineer The Lead Data Engineer is responsible for the design, architecture... ...of all forms of data to enable data-driven decisions and outcomes across the enterprise... ...applications. Assist the selection and integration of data related tools, frameworks and applications...DataContract workWork at officeRemote workWork visaRelocation package3 days per week$140.4k - $213.9k
...technical concepts into clear, outcome-driven narratives that build trust and drive decisions... ...Connect technical capabilities (e.g., integrations, data flow, security, observability) to real... ...stage. #LI-CL1 Job ID: 23711 AI in Action – Responsible Use of AI in Recruitment...DataRemote workFlexible hoursShift work$90k - $110k
Kroll Bond Rating Agency seeks a CMBS / CRE Research Associate in Dresher, PA. The position involves supporting the CMBS group in research, data analysis, and publications relevant to commercial real estate and ratings processes. The ideal candidate holds a bachelor's...Data- ...AI Engineer Philadelphia, Pennsylvania, United States About... ...that transforms the creation, curation, and delivery of short-form... ...tree structures, to efficiently integrate AI outputs into production-... ...Work closely with product, data science, engineering, and creative...Data
$95k - $105k
...hospitals, health systems and research centers around the... ...providing high-quality data analytics, reporting,... ...science concepts, and AI-enabled insights over time... ...data to ensure integrity for reporting and analysis... ...initiatives through data-driven insights. Increasingly...DataFull timeContract workTemporary workWork at officeLocal areaWorldwide- ...AI / Machine Learning Engineer (Contract) Location: Philadelphia... ...using LangChain and LangGraph. Integrate AI solutions with enterprise applications, APIs, and data platforms. Optimize model... ...translate business requirements into AI-driven solutions. Ensure adherence...DataContract workMonday to Friday3 days per week
- ...legacy systems and next-generation AI capabilities. While... ...By ensuring communications integrity across any network, FORT empowers... ...most valuable assets—people, data, and machines—ensuring they remain... ...design multi-bench systems, CI/CD-driven workflows, and reliable lab...DataRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Postdoctoral Researcher in AI-Driven Data Curation and Data Integration. Be the first to apply!
- data collection researcher Philadelphia, PA
- postdoctoral position Philadelphia, PA
- postdoc Philadelphia, PA
- data cabling installation Philadelphia, PA
- data recovery Philadelphia, PA
- data capturer Philadelphia, PA
- sap master data Philadelphia, PA
- data loss prevention engineer Philadelphia, PA
- data technician Philadelphia, PA
- data analysis part time Philadelphia, PA

