Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Postdoctoral Researcher in AI-Driven Data Curation and Data Integration

University of Pennsylvania

Faculty Mentor: Joost Wagenaar Department: Informatics Number of Positions: 2 Open to applications from US Citizens and foreign nationals. The Wagenaar Lab is seeking a highly motivated Postdoctoral Researcher to conduct research at the intersection of artificial intelligence, common data elements (CDEs), and large-scale biomedical datasets. The Wagenaar Lab is jointly based in the Institute for Biomedical Informatics and the Department of Biostatistics, Epidemiology, and Informatics at the University of Pennsylvania, and leads the academic development of the Pennsieve scientific data platform. The lab's mission is to create scalable, sustainable infrastructure that enables data integration, reuse, and discovery across clinical and scientific research domains. This postdoctoral position will focus on developing AI-enabled methods to automate and augment data curation, with an emphasis on leveraging CDEs to improve the usability, interoperability, and scientific value of public datasets. The successful candidate will work across disease areas-including Epilepsy, Immune Health, and programs within the NIH HEAL Initiative-to design approaches that harmonize heterogeneous datasets, enrich metadata, and support scalable data exploration. The Postdoctoral Researcher will work closely with the Pennsieve development team and a broad network of scientific collaborators to translate industry best practices in data engineering and AI into the academic research ecosystem. A central goal of this role is to move beyond manual, project-specific curation toward reproducible, automated, and extensible curation workflows that can be applied across datasets, programs, and institutions. In addition to platform and method development, the Postdoctoral Researcher is expected to contribute to peer-reviewed publications, open-source software, and community-facing resources that advance AI-enabled data stewardship and reuse. Responsibilities Develop AI-based methods and deploy them at scale to automate and augment data curation using Common Data Elements Design workflows to harmonize, validate, and enrich public datasets across Epilepsy, Immune Health, and NIH HEAL programs Develop novel mechanisms to interrogate, visualize and interact with complex scientific datasets and increase the value of these datasets for the scientific community. Integrate curation methods into scalable, cloud-based scientific data platforms Collaborate with the Pennsieve development team and scientific partners to align methods with real research workflows Evaluate and validate curation approaches using large, heterogeneous public datasets Prepare manuscripts, technical documentation, and presentations describing methods and outcomes. Qualifications Ph.D. (preferred) or Master's degree in Biomedical Informatics, Computer Science, Data Science, Bioinformatics, or a related field Experience with machine learning, natural language processing, or AI applied to structured and unstructured data Familiarity with Common Data Elements, data standards, or ontology-based data representation (preferred) Strong programming skills in Docker, Python, Go, Java, or related languages Experience working with large-scale biomedical or clinical datasets Experience with cloud-based data processing and scalable analytics environments (AWS preferred) Strong written and verbal communication skills and an interest in interdisciplinary collaboration Please include a cover letter and a CV for consideration. The University of Pennsylvania is an equal opportunity employer. Candidates are considered for employment without regard to race, color, sex, sexual orientation, religion, creed, national origin (including shared ancestry or ethnic characteristics), citizenship status, age, disability, veteran status or any class protected under applicable federal, state, or local law.

Vacancy posted 3 days ago
Similar jobs that could be interesting for youBased on the Postdoctoral Researcher in AI-Driven Data Curation and Data Integration in Philadelphia, PA vacancy
  • $113k - $170k

     ...digital marketing. The Manager of AI Digital & Content Personalization...  ...pillars. Utilize advanced data analytics to assess the effectiveness of AI-driven and interactive content, making informed...  ...business and marketing teams to integrate AI-driven and innovative content... 
    Data

    Johnson & Johnson

    Horsham, PA
    1 day ago
  •  ...AI Data Engineer Join the Data Science team as an AI Data Engineer...  ...from raw source data to curated gold-layer datasets, create semantic...  ...Complete, and semantic-model-driven natural-language querying....  ...processing, testing, or API integrations. Experience designing... 
    Data
    Flexible hours

    IntegriChain

    Philadelphia, PA
    3 days ago
  •  ...Sr Data Engineer IntegriChain is the data and application...  ...to focus on more data-driven decision support. With...  ...define and mature data integration, data consolidation,...  ...Snowflake, analytics, AI, and operational use...  ...mastered data, Snowflake curated data, and downstream... 
    Data

    IntegriChain

    Philadelphia, PA
    13 hours ago
  •  ...AI Agent Development Specialist Expertise in LangChain, LlamaIndex...  ..., or LlamaIndex. • LLM Integration & Optimization: Deploy, fine-...  ...semantic ranking, and enterprise data sources (structured +...  ...Experience Design: Build intuitive AI-driven applications using Databricks... 
    Data

    Diverse Lynx

    Camden, NJ
    13 hours ago
  • $70k - $90k

     ...Associate Data Engineer Our purpose is to help a billion people...  ...the right job! Phenom is an AI-Powered talent experience platform...  ...looking for a passionate and driven associate engineer to join our...  ...to work on real-world data integration challenges and gain hands-on experience... 
    Data
    Full time
    Internship
    H1b
    Work at office
    Work visa
    Flexible hours
    3 days per week

    Phenom

    Ambler, PA
    13 hours ago
  •  ...Machine Learning group and seeking exceptional researchers to join our dynamic team. As a Machine...  ...applying deep learning on time series data Strong foundation in mathematics, statistics...  ...solving skills with a creative, research-driven mindset Demonstrated ability to work... 
    Data

    Susquehanna International Group

    Bala Cynwyd, PA
    3 days ago
  •  ...applying strong analytical and problem-solving skills to large data sets. The ideal candidate will have experience in physical modeling...  ...skills, especially Python and Tableau, to effectively communicate complex data-driven insights. #J-18808-Ljbffr Quant Blueprint LLC
    Data

    Quant Blueprint LLC

    Bala Cynwyd, PA
    3 days ago
  •  ...AI Integration Business Analyst LOCATION - Hybrid – 3 Days Charlotte, NC; Chicago, IL; Colorado Springs, CO; Conshohocken, PA; Dallas,...  ...Create documentation inclusive of business use cases, process / data flows, traceability matrices, and report mock-ups. Plan, facilitate... 
    Data

    RIT Solutions

    Conshohocken, PA
    5 days ago
  • $117.1k - $187.3k

     ...Content workflows and Content AI platforms. As a successful...  ...design teams and lead through data driven insights. You will be obsessing...  ...and industry trends and integrate relevant advancements into our...  ...English language teaching and research markets worldwide. Through our... 
    Data
    Work experience placement
    Live in
    Local area
    Remote work
    Worldwide

    Cengage Group

    Philadelphia, PA
    2 days ago
  •  ...Machine Learning group and seeking exceptional researchers to join our dynamic team. As a Machine...  ...applying deep learning on time series data Strong foundation in mathematics, statistics...  ...solving skills with a creative, research‑driven mindset Demonstrated ability to work... 
    Data

    Susquehanna International Group

    Bala Cynwyd, PA
    2 days ago
  •  ...IntegriChain is the data and application backbone...  ...to focus on more data-driven decision support.  With...  ...Data Science team as an AI Data Engineer responsible...  ...from raw source data to curated gold-layer datasets,...  ...performance. Data Modeling, Integration, and Consolidation... 
    Data
    Work at office
    Visa sponsorship
    Flexible hours

    IntegriChain

    Philadelphia, PA
    7 days ago
  • $54.63k - $81.94k

    Postdoctoral Researcher Apply now Job no: 506801 Work type: Full-Time Location: University City - Philadelphia, PA Categories: College of Engineering Job Summary The A.J. Drexel Nanomaterials Institute and Professor Yury Gogotsi ( are seeking a Postdoctoral... 
    For PostDoc
    Full time

    Drexel University

    Philadelphia, PA
    13 hours ago
  •  ...Sr Data Engineer Location: Philadelphia, PA/ Durham, NC Years of Experience: 15...  ...data movement. Azure SharePoint / M365 integration. Practical knowledge of SharePoint Online...  ...connectors, webhooks, and rate limits. "AI tools may assist in the recruitment process... 
    Data

    Yantran LLC

    Philadelphia, PA
    3 days ago
  •  ...modern.NET development and the Microsoft AI ecosystem. This role combines traditional...  ...applications, develop Copilot Studio agents integrated, contribute to Azure OpenAI-powered...  ...SharePoint Online, Microsoft Graph, and internal data sources. • Integrate SharePoint Online... 
    Data
    Work at office
    3 days per week

    Cozen O Connor

    Philadelphia, PA
    13 hours ago
  •  ...professional who partners with leaders, faculty, researchers, managers, and Monellians across the...  ...scientists, visiting students, postdoctoral fellows, trainees, interns, and...  ...initiatives. Assist with reporting, analytics, data integrity, and process improvement efforts.... 
    Data
    For PostDoc
    Temporary work
    Traineeship
    Work at office
    Local area

    Monell Chemical Senses Center

    Philadelphia, PA
    3 days ago
  •  ...or C# Minimum of 2 years experience developing Data Management sequences and steps Minimum of 3 years...  ...with advanced OneStream capabilities; Sensible AI, CPM Express etc. You have experience with Data Integration tools You have completed one of the OneStream... 
    Data
    Work experience placement
    Live in
    Work at office
    Local area

    Accenture

    Philadelphia, PA
    3 days ago
  • Susquehanna International Group seeks a Quantitative Researcher in Bala Cynwyd, PA. This role combines research with trading to develop,...  ...are proficient in Python. Responsibilities include large-scale data analysis and model execution. The firm promotes a collaborative... 
    Data
    For PostDoc
    Visa sponsorship

    Susquehanna International Group

    Bala Cynwyd, PA
    1 day ago
  •  ...Data Pipeline & Operations Build, manage, and monitor ETL/ELT pipelines for data ingestion and transformation. Ensure smooth data...  ..., cleansing, and reconciliation processes. Maintain data integrity, consistency, and accuracy across systems. Define and enforce data... 
    Data

    Diverse Lynx

    Philadelphia, PA
    1 hour ago
  •  ...cloud-native capabilities. • Integrate security into CI/CD pipelines...  ...workflows. • Define enterprise data protection standards for...  ...2, PCI). • Exposure to AWS AI services such as AWS Security...  ...Inspector, Amazon GuardDuty - AI Driven Threat Detection • IaC and automation... 
    Data

    Diverse Lynx

    Philadelphia, PA
    3 days ago
  •  ...strategic role focuses on operational efficiency through AI, performance management, and data-driven decisions. Responsibilities include designing the...  ...environments. The position emphasizes collaboration and data integrity within a fast-paced financial wellness platform. #J-18... 
    Data

    Credit Genie

    Plymouth Meeting, PA
    3 days ago
  •  ...Associate Director, Commercial AI & Advanced Analytics in...  ...This role focuses on leading AI-driven initiatives in Immunology, bridging...  ..., medical, and patient data to enhance decision-making. The...  ...analytics and healthcare data integration is essential. #J-18808-Ljbffr... 
    Data

    Johnson & Johnson Innovative Medicine

    Horsham, PA
    13 hours ago
  • $80k - $130k

     ...Data Engineer Create trusted, scalable data foundations...  ...is on modular, event-driven, API-first and cloud...  ...our SRE and AI practices. This is a large...  ...pipelines, or AI/analytics integration concepts • Understanding...  ...pipelines, models, and curated datasets that support... 
    Data
    Permanent employment
    Temporary work
    Work at office
    Flexible hours

    United States Cold Storage

    Camden, NJ
    1 day ago
  •  ...pursue advanced machine learning research in a fast-paced, real-world...  ...actionable insights - driving data-informed decisions from predictive...  ...of the machine learning and AI innovations that they...  ...Our culture is intellectually driven and highly collaborative, bringing... 
    Data
    For PostDoc

    SIG Susquehanna

    Bala Cynwyd, PA
    13 hours ago
  • $107k - $135k

     ...Lead Data Engineer The Lead Data Engineer is responsible for the design, architecture...  ...of all forms of data to enable data-driven decisions and outcomes across the enterprise...  ...applications. Assist the selection and integration of data related tools, frameworks and applications... 
    Data
    Contract work
    Work at office
    Remote work
    Work visa
    Relocation package
    3 days per week

    Transamerica

    Philadelphia, PA
    4 days ago
  • $140.4k - $213.9k

     ...technical concepts into clear, outcome-driven narratives that build trust and drive decisions...  ...Connect technical capabilities (e.g., integrations, data flow, security, observability) to real...  ...stage. #LI-CL1 Job ID: 23711 AI in Action – Responsible Use of AI in Recruitment... 
    Data
    Remote work
    Flexible hours
    Shift work

    Pegasystems

    Philadelphia, PA
    4 days ago
  • $90k - $110k

    Kroll Bond Rating Agency seeks a CMBS / CRE Research Associate in Dresher, PA. The position involves supporting the CMBS group in research, data analysis, and publications relevant to commercial real estate and ratings processes. The ideal candidate holds a bachelor's... 
    Data

    Kroll Bond Rating Agency

    Dresher, PA
    3 days ago
  •  ...AI Engineer Philadelphia, Pennsylvania, United States About...  ...that transforms the creation, curation, and delivery of short-form...  ...tree structures, to efficiently integrate AI outputs into production-...  ...Work closely with product, data science, engineering, and creative... 
    Data

    Inizio Partners

    Philadelphia, PA
    13 hours ago
  • $95k - $105k

     ...hospitals, health systems and research centers around the...  ...providing high-quality data analytics, reporting,...  ...science concepts, and AI-enabled insights over time...  ...data to ensure integrity for reporting and analysis...  ...initiatives through data-driven insights. Increasingly... 
    Data
    Full time
    Contract work
    Temporary work
    Work at office
    Local area
    Worldwide

    Owens & Minor

    Philadelphia, PA
    4 days ago
  •  ...AI / Machine Learning Engineer (Contract) Location: Philadelphia...  ...using LangChain and LangGraph. Integrate AI solutions with enterprise applications, APIs, and data platforms. Optimize model...  ...translate business requirements into AI-driven solutions. Ensure adherence... 
    Data
    Contract work
    Monday to Friday
    3 days per week

    Merican

    Philadelphia, PA
    3 days ago
  •  ...legacy systems and next-generation AI capabilities. While...  ...By ensuring communications integrity across any network, FORT empowers...  ...most valuable assets—people, data, and machines—ensuring they remain...  ...design multi-bench systems, CI/CD-driven workflows, and reliable lab... 
    Data
    Remote work

    Fort Robotics

    Philadelphia, PA
    13 hours ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Postdoctoral Researcher in AI-Driven Data Curation and Data Integration. Be the first to apply!