Data Engineer
Code4lib
Start date: Available starting mid-May 2026 End date: 3 months after first date of work. HBS’s Baker Library is seeking a temporary Data Engineer to help launch a faculty citation data project aimed at better understanding how its collections support and influence scholarly research. This initiative involves identifying faculty publications, extracting their cited references, and analyzing the relationships within this data to generate meaningful insights into patterns of use and library collection impact. By analyzing citations, the project seeks to surface evidence of how Baker’s resources contribute to the research ecosystem at HBS. Reporting to Baker Library’s User Needs and Assessment Librarian, this temporary Data Engineer role will focus on the final phase of the project, where a corpus of raw citation data has already been collected and aggregated from multiple sources. At this stage, the data requires careful cleaning, normalization, and transformation to ensure it is accurate, consistent, and suitable for analysis. The individual in this role will work with this messy dataset to standardize fields, resolve inconsistencies, and prepare the data for downstream analytical work. This phase is critical to ensuring the reliability and interpretability of the project’s findings and will directly shape the quality of insights generated about Baker’s impact. This is a temporary, full-time, remote position. Employees in fully remote positions must work all scheduled hours in a Harvard registered state in compliance with the University’s Policy on Employment Outside of Massachusetts . Specific hours and work days will be determined by business needs and are subject to change with appropriate advanced notice. Responsibilities Clean and normalize raw citation data by resolving inconsistencies in author names, publication titles, journal names, and other variables Co‑develop and apply standardized schemas for field names and data structures to ensure consistency across the dataset Design and implement reproducible data cleaning workflows using scripts that can be reused Co‑create or locate unique identifiers (e.g., for authors, works, journals) to enable accurate linking and deduplication across records Perform record linkage and deduplication using techniques such as fuzzy matching and string comparison Assess and improve data quality by identifying missing, inconsistent, or anomalous values and determining appropriate remediation strategies Conduct exploratory analysis to evaluate the completeness and reliability of the dataset, including identifying patterns of data gaps Collaborate with project stakeholders to align data cleaning decisions with project goals Explore connection points for citation data with other HBS administrative datasets Document data transformations, data dictionaries, and workflows to support transparency, reproducibility, and future project phases Qualifications Experience working with messy, real‑world datasets Advanced proficiency in R (preferred), using libraries such as dplyr, tidyr, and tidyverse, or Python, using libraries such as pandas Familiarity with regular expressions (regex), string comparison, and fuzzy matching Proficient understanding of standardization principles and controlled vocabularies Ability to balance precision and pragmatism when making decisions in the absence of perfect information Comfort documenting processes and decisions for both technical and non‑technical audiences Ability to work independently while also seeking input when project ambiguity or edge cases arise Ability to envision how data cleaning and manipulation serve larger project goals Basic understanding of academic publishing and citation formats Proficiency in Microsoft Office tools (Outlook email, Teams sites, folder management, file retrieval) #J-18808-Ljbffr
$130k - $160k
...operations and IT to make real progress in breakthrough therapeutics, food, energy, and more. The Position We are seeking a Senior Data Engineer to join our growing Enterprise Data & Analytics team. Our mission is to empower every team across the organization with timely,...SuggestedRemote workFlexible hours- ...authorized to work in the United States. We do not provide sponsorship for employment visas now or in the future. Welcome to Love's The Data Engineer III will play a pivotal role in building, managing, and optimizing data pipelines for the Enterprise Data & Analytics...Suggested
- ...an alternative application process. Full Time Information Tech Oklahoma City, OK, US 30+ days ago Requisition ID: 1759 Job Title Data Engineer Reports To Manager, Data and Development Position Summary In this role, you will be a key contributor in developing, optimizing,...SuggestedFull time
- ...A leading data solutions company in Oklahoma City is seeking a Data Engineer to develop and maintain data infrastructure supporting analytics and business intelligence. This role involves designing scalable data pipelines, ensuring data quality and integrity, and collaborating...Suggested
$48k - $68k
...US Salary: $48,000 - $68,000 About the Role The Data Engineer I supports the execution of conversions, data hygiene, basic edits and custom data services by performing basic data manipulation, editing, and transformation tasks. This role helps maintain the accuracy and...SuggestedFull timeLocal area- ...Job Description We are offering an exciting opportunity for a Data Engineer in Oxford, Massachusetts. This role is pivotal to our operations, utilizing your expertise in Cloud Technologies, Database, EO/IR systems, Algorithm Implementation, Analytics, API Development,...
$25.48 - $63.65 per hour
...Job Description The Data Migration Engineer II is responsible for designing, building, testing, and executing data migration solutions that support Oracle Health implementation and interface projects. As part of the Data Management and Migration Delivery Team...Hourly payContract workTemporary workLocal areaFlexible hours$99k - $149k
...Day to Day This role’s primary responsibility is to integrate data from a variety of sources into common data domain models, supporting... ...learning new technologies Demonstrated experience in software engineering fundamentals and coding Salary Range Transparency Tier 1 -...- ...A major job search platform in Oklahoma City seeks a Data Engineer to integrate and model data across operations. This role involves building data pipelines and modifying data lakes while ensuring efficient data processing. Successful candidates will have a strong background...
- ...The Data Engineer for the Financial Operations Group is primarily responsible expediting engineering solutions that enable data-driven decision-making across Life.Church's finance and staff operations functions. This role transforms raw financial and operational data from...Full timeTemporary workWork experience placementInternshipFlexible hours
- ...Bonterra is seeking a Data Engineer I to support conversions, data hygiene, and custom data services. The core responsibilities include running data update jobs, performing data editing in SQL, and ensuring data accuracy across nonprofit datasets. The ideal candidate is...
$83.43k - $222.48k
...Position Summary We're seeking a Sr. Data Engineer to design and implement data pipelines that power analytical capabilities. This hands‑on role requires an understanding of data engineering best practices and the ability to translate business requirements into technical...Hourly payFull timeTemporary workWork experience placementLocal area- ...Design and build secure, reliable data pipelines that turn raw operational sources into usable intelligence. Impact Create the data... ...observability workflows. Work closely with data scientists, software engineers, and mission-facing teams. What we're looking for 5+ years of...
- ...The YouVersion Senior Data Engineer is primarily responsible for shaping, implementing, and maintaining data pipelines and systems that provide quality and reliable data. This role is critical in leading to increased engagement and growth through the Bible App globally...Full timeTemporary workWork experience placementCasual workInternshipLocal areaWorldwideFlexible hours
$145k - $170k
...What You’ll Do Design and implement data models in partnership with product, analytics, and engineering teams so they are scalable, well‑documented, and aligned to business needs. Collaborate with other Data Engineers to solve problems that directly impact enterprise customers...Full timeWork experience placementRemote workHome officeFlexible hours- ...YouVersion Senior Data Engineer The YouVersion Senior Data Engineer is primarily responsible for shaping, implementing, and maintaining data pipelines and systems that provide quality and reliable data. This role is critical in leading to increased engagement and growth...Full timeTemporary workCasual workInternshipLocal areaWorldwide
$120k - $160k
...Job Summary The Senior Data Engineer is responsible for software development and data engineering projects. Identifies and executes tasks in the software development life cycle, reviews and debugs codes. They are responsible for ensuring the quality and functionality of...Work experience placement$150k - $190k
...Advanced Solutions Group of SHI that is building new digital experiences for our internal users, customers and partners. The Senior Data Engineer will be responsible for the analysis, design, and development of solutions focused on data engineering and ETL workflows. As a...Work experience placementWorldwideFlexible hoursShift work- ...What you’ll do: Designs, develops, and optimizes data integration processes, ETL workflows, and data platforms, leading the creation... .... Proficiency in programming languages commonly used in data engineering, such as Python or Java. What We Offer: Comprehensive healthcare...Work experience placement
$150k - $190k
...A leading IT solutions provider in Oklahoma City is seeking a Senior Data Engineer to enhance its data engineering practices. The ideal candidate has over 10 years of experience in data applications and ETL, with expertise in data modeling and SQL. You will drive data...$111k - $177k
...team to drive the next wave of digital transformation. Join us to build, scale, and innovate at the edge. Job Summary The Senior Data Engineer will be responsible for owning the AI and data pipeline for the engineering organization, acting as the subject matter expert...Temporary workLocal areaFlexible hours$160k - $200k
...Datavant is the data collaboration platform trusted for healthcare. Guided by our mission to make the world’s health data secure, accessible... ...seeking a detail-oriented and impact-driven Senior Data Engineer to strengthen our capabilities around reporting, advanced analytics...$65k
Position Overview We are seeking a motivated Data Engineer to join our Data Engineering team. The ideal candidate will have exposure in Python, and SQL, and will be responsible for designing, developing, and maintaining robust data pipelines for structured, semi-structured...Hourly payTemporary workWork at officeRelocation- Data Engineer Location: Onsite in OKC - 5 days/week Employment Type: Direct Hire Work Authorization: Must be authorized to work in the U.S. now and in the future without sponsorship We’re partnering with a client on a direct-hire Data Engineering role supporting a growing...
$106.9k - $176.5k
...take your career wherever you want it to go. Join EY and help to build a better working world. Technology – Data and Decision Science – Data Engineering – Senior We are seeking a highly skilled Senior Consultant Data Engineer with expertise in cloud data engineering...Summer holidayFlexible hours- ...ANAUTICS INC is seeking a Data Engineer in Oklahoma City to design and build secure, reliable data pipelines. The role involves creating data foundations for analytics and implementing secure workflows. Candidates should have over 5 years of data engineering experience...
- ...We are seeking a Data Center L1 Support Engineer to provide onsite rack and stack support within a data center environment. The role involves the physical installation, cabling, labeling, and basic validation of IT infrastructure equipment while coordinating with remote...Remote workRelocation
- 6AM City, LLC is seeking a Data Engineer & Senior Data Engineer to manage data modeling and streamline data movement from disparate systems. Candidates can work remotely or from Boston, MA, or San Mateo, CA. Ideal candidates will have strong SQL and Python skills and experience...Full timeRemote work
- ...A global tech ministry is seeking a Senior Data Engineer to shape and maintain data pipelines and systems that enhance the Bible App's reach. You will collaborate with various technical teams to deliver trusted data and advocate for data governance. The ideal candidate...
- ...A leading travel services company in Oklahoma City is seeking a Data Engineer III. This role involves managing and optimizing data pipelines, providing architectural guidance, and leading data assessment activities. Applicants should have 3-5 years of cloud data warehouse...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Data Engineer. Be the first to apply!
- remote data engineer Oklahoma City, OK
- entry level big data engineer Oklahoma City, OK
- big data devops engineer Oklahoma City, OK
- data engineer Oklahoma City, OK
- software data engineer Oklahoma City, OK
- big data cloud engineer Oklahoma City, OK
- junior big data engineer Oklahoma City, OK
- sr information security engineer Oklahoma City, OK
- director data engineering Oklahoma City, OK
- principal data engineer Oklahoma City, OK

