Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

AI & NLP Fellowship: Data Engineering for Social Impact

Institute for Development Impact - I4DI

AI & NLP Fellowship: Data Engineering for Social Impact 2 days ago Be among the first 25 applicants Institute for Development Impact (I4DI) | DECipher Project About the Project DECipher is an AI-powered platform developed by the Institute for Development Impact (I4DI) to help global development professionals access and interpret decades of USAID-funded learning. It draws from one of the largest public document archives in international development, transforming raw PDFs into structured insights using modern machine learning techniques. At its core, DECipher is a public infrastructure project. It connects natural language processing with real-world policy and program decisions. The work is technical, but the impact is human. It supports smarter, more accountable development efforts worldwide. The Opportunity We are offering a volunteer summer fellowship for individuals who want to gain real experience working with applied AI systems. Fellows will help us prepare a large, high-value dataset for fine-tuning domain-specific language models. This is not a theoretical exercise. You will be working directly with tens of thousands of documents, contributing to the quality and integrity of training data that powers an open-access AI tool for public benefit. While unpaid, this role offers serious technical learning and the chance to be part of something that is both ambitious and grounded. What You Will Work On Process and clean large volumes of unstructured PDF documents Develop and manage text extraction workflows using Python and NLP tools Review document structure and metadata for consistency and quality Label and classify documents to support supervised and semi-supervised learning Support QA and data validation steps critical for model fine-tuning Work with experienced engineers and researchers on a functioning AI pipeline What You Will Learn How to build structured datasets for training large language models Techniques in OCR, document parsing, tokenization, and quality assurance How NLP systems are adapted to real-world, domain-specific use cases What it takes to make AI systems both reliable and accountable Who You Are Current student, recent graduate, or early-career professional with experience in Python and interest in NLP, machine learning, or data engineering Comfortable working with complex documents, legacy formats, and detailed guidelines Motivated by mission-driven tech and open-access knowledge Looking for more than just a credential, you want meaningful work and real learning What You Will Gain Applied experience with large-scale data preparation A practical, portfolio-worthy contribution to an operational AI system Mentorship from a team experienced in responsible AI and development practice Flexible hours and remote collaboration Possibility for extended work or future opportunities based on performance Fully remote Summer 2025 8 to 12 week commitment 30 to 40 hours per week, flexible scheduling How to Apply Send a brief message describing your interest and experience, along with a resume and link to relevant work, to View email address on click.appcast.io. Seniority level Internship Employment type Internship Job function Research, Analyst, and Information Technology Industries International Trade and Development #J-18808-Ljbffr Institute for Development Impact - I4DI

Vacancy posted 4 days ago
Similar jobs that could be interesting for youBased on the AI & NLP Fellowship: Data Engineering for Social Impact in Washington DC vacancy
  • Data Science / Machine Learning Engineer (Remote, Continental United States) 3 weeks ago Be among...  ...to the advancement of our AI capabilities. About You...  ...that deliver real‑world impact. Responsibilities Identify...  ..., Deep Learning, NLP, Time Series Analysis, etc... 
    Suggested
    Remote job
    Local area
    Flexible hours

    ICA, Inc.

    Arlington, VA
    3 days ago
  • $140k - $200k

    A growing technology company in Arlington, VA, is seeking an AI/ML Engineer to pioneer machine learning solutions for the defense sector. This role involves designing models for tackling complex challenges, deploying production-grade ML models, and driving significant... 
    Suggested
    Flexible hours

    Obviant

    Arlington, VA
    3 days ago
  • $150k - $210k

    Enterprise Knowledge (EK) is hiring for a full-time Semantic Data and AI Engineer to join our growing Knowledge and Data Services Sector. In this...  ...understand and model their domain of knowledge Apply NLP techniques (entity extraction, classification, and document processing... 
    Suggested
    Full time
    Work at office
    Local area
    Remote work

    Enterprise Knowledge, LLC

    Arlington, VA
    2 days ago
  • $150k - $210k

    Enterprise Knowledge, LLC is hiring for a full-time Semantic Data and AI Engineer in Arlington, VA. This role involves designing and deploying...  ...years of experience in data analysis, machine learning, and NLP techniques. The position operates on a hybrid model with a salary... 
    Suggested
    Remote job
    Full time

    Enterprise Knowledge, LLC

    Arlington, VA
    2 days ago
  •  ...actively building a pipeline of Data Professionals to support...  ...This pipeline focuses on Data Engineering and AI Engineering as primary capabilities...  ...orchestration) Develop NLP pipelines and text-based data...  ...company that is making a real impact. #J-18808-Ljbffr Iberia... 
    Suggested
    Contract work

    Iberia Advisory

    Washington DC
    5 days ago
  •  ...Applied Machine Learning Engineer to join our team. This...  ...and maintain innovative AI/ML solutions that enhance...  ...insights. Utilize NLP and machine learning techniques...  ...and unstructured data, including text and images...  ...role where you can make an impact, we want to hear from you... 
    Remote work

    NLP PEOPLE

    Washington DC
    4 days ago
  •  ...-Stack Developer to design and deliver modern applications and APIs. This role emphasizes building complex, data-driven systems, and incorporates cutting-edge AI/ML technologies. The ideal candidate has 7+ years of experience with Microsoft .NET, AI/ML implementation, and... 

    Aristotle

    Washington DC
    3 days ago
  •  ...Security leaders with decision-worthy data and analysis to improve the...  ...//SCI cleared Full Stack Data Engineer with a variety of technical...  ...parsing tools to include regex and NLP Preferred technical and...  ...and CSS Experience in advanced AI techniques - fine-tuning ML models... 
    Full time
    Work at office
    Immediate start

    IBM Computing

    Bethesda, MD
    3 days ago
  • $89.9k - $160.6k

     ...structured and unstructured data from various sources (e...  ...into conversational AI platforms (e.g., Amazon...  ...of experience in data engineering, preferably in an AI...  ...degree Familiarity with NLP concepts and conversational...  ...to mitigating our impact on the environment and... 
    Minimum wage
    Full time
    Work experience placement
    Local area

    UnitedHealth Group

    Washington DC
    4 days ago
  • $87k - $178.1k

     ...Job Description The Data Center Construction organization at...  ...data centers. As a Principal Engineer, you will serve as a technical...  ...order process, validating scope impacts, pricing accuracy, and alignment...  ...to life-saving care. And with AI embedded across our products... 
    Temporary work
    Live in
    Local area
    Worldwide
    Relocation
    Relocation package
    Flexible hours

    Oracle

    Washington DC
    1 day ago
  • Agile Defense, LLC seeks a Senior Data Scientist/Engineer in McLean, VA, to drive intelligence data-analysis initiatives. You will collaborate...  ...of experience in data science and a firm grasp of Python, NLP, and AI frameworks. This full-time position requires a TS/SCI... 
    Full time

    Agile Defense, LLC

    Mc Lean, VA
    3 days ago
  •  ...analytical externship as part of the Hiring Our Heroes Skillbridge Fellowship program. The role involves providing analytical support in the...  ...'s degree in a related STEM field along with experience in data analysis. Candidates with varying levels of experience are encouraged... 

    Arenatechnologies

    Washington DC
    2 days ago
  •  ...to power an abundant electric future. As AI data centers drive a surge in electricity demand...  ...We're looking for a Data / Analytics Engineer to own the data infrastructure that powers...  ...not just tickets to close. This is a high-impact, high-ownership role on a lean team. We move... 
    Flexible hours

    Unchain Data

    Washington DC
    2 days ago
  • $86.8k - $198k

     ...Data Security Engineer The Opportunity: Architect, deploy, and configure data security solutions across various clients for DoD, IC, and...  ...picture to verify your identity and prevent fraud. Candidate AI Usage Policy AI is a part of our daily work at Booz... 
    Full time
    Contract work
    Part time
    Work at office
    Local area
    Remote work

    BOOZ, ALLEN & HAMILTON, INC.

    Arlington, VA
    3 days ago
  • Senior Associate, Data Scientist - NLP Data is at the center of everything we do. As a startup, we...  ...their financial lives. Team Description AI Foundations Specialist Models Data...  ...functional team of data scientists, software engineers, machine learning engineers and product... 
    Local area
    Flexible hours

    SwiftCruit

    Mc Lean, VA
    4 days ago
  •  ...Analytica is seeking a Data Scientist to support long term federal...  ...and tokenization. Feature Engineering and Attribute Evaluation -...  ...must demonstrate experience with NLP feature engineering methods such...  ...accuracy, Analytica may use AI-assisted tools to support certain... 
    Full time
    For contractors
    Local area
    Remote work

    Analytica

    Washington DC
    1 day ago
  •  ...that is Built to Conquer Risk ®. Summary Potomac is continuing to invest in modern data and AI capabilities to support our growing business. We are seeking a Machine Learning Data Engineer to join our team and play a critical role in building and scaling our data... 
    Contract work

    Potomac

    Bethesda, MD
    5 days ago
  • $119.7k - $199.3k

     ...Matters We are seeking a skilled and experienced Senior Data Engineer for our Data and AI team to contribute to the development of innovative AI-powered...  ...about solving complex technical challenges and driving impactful projects that shape the user experience for a news media... 

    Nashville Public Radio

    Washington DC
    5 days ago
  • $89.9k - $160.6k

    UnitedHealth Group is seeking a Data Engineer in Washington, DC, to design scalable ETL/ELT pipelines and ensure data quality for their AI systems. The ideal candidate will have a Bachelor's degree and 8+ years of experience in the field, with strong skills in Python,... 

    UnitedHealth Group

    Washington DC
    4 days ago
  •  ...forefront of this transformation, leveraging AI, automation, and modern data platforms to drive smarter investment...  ...‑edge tools to make a measurable impact on the business. FCP is a leading...  ...are seeking an experienced Sr. Data Engineer to join our Washington, DC team. This... 
    Work experience placement

    Federal Capital Partners

    Chevy Chase, MD
    5 days ago
  • Job Title: Data Engineer Location(s): Arlington, VA & Washington DC (DUE...  ...information systems, social sciences, physics, or decision...  ...stakeholders to design and deploy impactful data applications and visualizations...  ..., started with the help of AI. #J-18808-Ljbffr Elder... 
    Full time
    For contractors
    Work at office
    Immediate start
    Remote work
    Flexible hours

    Elder Research

    Arlington, VA
    1 day ago
  • An established industry player is seeking a skilled data engineer to leverage big data for impactful missions. In this dynamic role, you will design and develop scalable data platforms, ensuring that complex data applications are operationalized effectively. Collaborating... 
    Remote job

    Phase2 Technology

    Arlington, VA
    5 days ago
  • Talascend is currently seeking a Databricks Data Engineer for a contract opportunity with our client in Washington, District of Columbia ....  ...optimization Collaborate with cross-functional teams to enable AI-driven analytics and workflows Integrate with Azure services such... 
    Contract work

    Talascend, LLC

    Washington DC
    1 day ago
  • Black Cape is seeking a skilled Data Engineer to join their team in Arlington, VA. The ideal candidate will have 3+ years of experience and...  ...'ll be responsible for ensuring data availability and building AI applications that support national security missions. The position... 

    BLACK CAPE LLC

    Arlington, VA
    4 days ago
  • A leading AI-driven technology company is seeking a skilled Data Engineer to design and build efficient data pipelines. Responsibilities include developing scalable data architectures and integrating with AWS cloud services for data processing. The ideal candidate will... 

    BigBear Inc

    Washington DC
    4 days ago
  • $95k - $120.65k

    ## Data EngineerApplylocations: US DC Remotetime type: Full timeposted on: Posted Todayjob...  ...measurable results for clients.At Zelis, AI is woven into the fabric of how we work. Every...  ...accelerate innovation, and amplify their impact. This is a place for builders with a... 
    Full time
    Work at office
    Local area
    Visa sponsorship
    Flexible hours

    Zelis Healthcare Inc.

    Washington DC
    2 days ago
  • A leading data analytics firm seeks a Data Engineer to design, implement, and optimize data architectures. The ideal candidate has over 6 years of experience...  ...and SQL, along with a Bachelor’s degree in a related field. Join us to drive impactful change! #J-18808-Ljbffr Indev

    Indev

    Washington DC
    2 days ago
  • McKinsey & Company is seeking a Data Engineer to develop the foundational data infrastructure for innovative AI applications. You will be part of a diverse team, designing scalable data pipelines and collaborating with experts across various industries to solve complex... 

    McKinsey & Company

    Washington DC
    5 days ago
  • $90.7k - $141.78k

    Responsibilities Noblis is seeking a cleared Data Enginee r to support our customer in Bethesda, MD. SETA. The Data Engineer will serve as a subject matter expert supporting...  ...preprocessed, vectorized datasets ready for AI model training Develop and implement data standards... 
    Local area
    Remote work

    Noblis

    Bethesda, MD
    3 days ago
  • SAIC is looking for qualified candidates for a data architecture and engineering position to support a cutting-edge data, analytics, and AI platform in Arlington, Virginia. Responsibilities include managing customer relationships, software engineering, and data pipeline... 

    SAIC

    Arlington, VA
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to AI & NLP Fellowship: Data Engineering for Social Impact. Be the first to apply!