AI Curation Data Scientist
Recruit Group
What You’ll Do Develop and optimize software pipelines for extracting and integrating structured and unstructured healthcare data Build and maintain AI/ML workflows for data classification, normalization, and analysis Train, fine‑tune, and evaluate large language models and embedding‑based systems Curate and validate high‑quality datasets used for LLM training and model improvement Work with complex healthcare data formats including XML, JSON, FHIR, and C‑CDA Implement de‑identification strategies and ensure compliance with PHI/PII handling policies Design and execute data quality assessments, validation frameworks, and automated testing processes Collaborate cross‑functionally with engineering and product teams to improve scalability and system performance Contribute to code repositories, testing infrastructure, and deployment best practices Explore emerging AI methodologies and rapidly prototype innovative solutions in a highly iterative environment What You’ll Need Required Qualifications Master’s degree or equivalent experience in Computer Science, Software Engineering, Statistics, Biology, or a related field 5+ years of hands‑on experience in AI/ML engineering, data science, software development, or predictive analytics Strong experience training and tuning transformer models and LLMs Significant experience curating datasets for AI model training Advanced Python development experience, including building extraction, classification, or NLP tools Hands‑on experience with embeddings models, sentence transformers, and modern LLM tooling Strong experience parsing and processing complex data formats such as XML and JSON Familiarity with healthcare interoperability standards such as FHIR and/or C‑CDA Experience with TensorFlow, PyTorch, scikit‑learn, or similar ML frameworks Proficiency with Git and software development best practices Experience developing unit and integration tests for scientific or healthcare‑focused applications Strong communication skills and ability to collaborate effectively within remote teams A proactive, solutions‑oriented mindset with a passion for building high‑impact products Preferred Qualifications Deep understanding of regex and advanced text‑processing techniques Experience with Unix command‑line tooling such as jq, xq, sed, and bash scripting Strong AWS experience, particularly around data storage and AI training infrastructure tradeoffs Experience working with HIPAA, PHI/PII handling, and healthcare de‑identification strategies Experience extending or customizing open‑source AI tooling Familiarity with AI‑assisted coding workflows and tools such as GitHub Copilot, Claude Code, or similar platforms Experience working across multiple programming languages and distributed technical teams Why This Role Opportunity to build AI systems that directly improve healthcare outcomes Work alongside experienced experts in AI, software systems, molecular biology, and clinical medicine High‑impact role within a fast‑growing and mission‑driven environment Exposure to cutting‑edge challenges in healthcare interoperability, AI model training, and clinical data engineering Collaborative culture that values innovation, ownership, and technical excellence Fully remote flexibility with meaningful opportunities for growth and technical leadership Let’s Talk If you’re excited by the opportunity to apply advanced AI and machine learning techniques to real‑world healthcare challenges — while working with a highly talented and mission‑driven team — we’d love to connect. #J-18808-Ljbffr
- ...care via comprehensive, intelligent access to healthcare data on an AI-assisted platform. Delivered using a Software as a Service... ...About the role: Reporting to the VP of Data Science, the AI Curation Data Scientist will, using traditional computing and custom AI model...SuggestedWork experience placementRemote workFlexible hours
- ...Recruit Group is seeking an AI/ML Engineer to optimize software pipelines for healthcare data. This role involves building AI workflows, curating datasets, and collaborating with engineers and product teams. Candidates should have a Master's degree and 5+ years in AI/ML...SuggestedRemote work
$150k - $200k
...An innovative healthcare tech firm in the United States is seeking an AI Curation Data Scientist. This remote role focuses on enhancing health data processing and analysis through software and AI model development. Ideal candidates will have a Ph.D. and at least 10 years...SuggestedRemote work$185k - $220k
Staff Software Engineer, Reporting Data Curation San Ramon, CA; New York, NY Are you passionate about designing large-scale reporting systems... ...insights. We embrace tools like GitHub Copilot and Cursor AI to accelerate development and reduce friction, allowing engineers...SuggestedH1bWork at officeWork visa$144k - $192k
...hidden within petabytes of multimodal sensor data. Our next-generation autonomous driving... ...training analysis, error diagnosis, and dataset curation. What You’ll Do Build and train machine... ...learning, sensor fusion, or embodied AI. Experience building active learning loops...SuggestedRemote work$180k - $230k
...a fully configurable and content‐rich, AI‐powered platform along with best‐in‐class... ...improvements to be made. We are looking for a Data Scientist to advance our models further. In... ...understanding past outcomes. Model Advancement: Curate labeled data, hypothesize new features,...Flexible hours- ...an exciting opportunity for a Principal Data Scientist to join the USS program, supporting both... ...innovative, secure, and mission focused AI solutions in support of USS objectives.... ...situation. Contribute and lead the creation, curation, and promotion of playbooks, best...Minimum wageContract workTemporary workWork experience placementRemote work
$90k - $115k
...Have Technical/Functional Skills Data Scientist to build a Predictive Benefit... ...datasets sourced from on prem SQL DW and curated in Microsoft Fabric / OneLake • Apply... ...mandatory) • Experience with explainable AI (XAI) techniques • Prior exposure to Microsoft...- ...Products (HCAP) team as a research associate. Members of HCAP are curators of employee data and leverage it to help answer strategic and operational... ...with confidential information. Interest in generative AI and a desire to use it to improve and reimagine tools,...Work experience placement
$96k - $128k
...of the largest clinical trial data sets in the industry. More than... ...About the Team : Medidata AI organization is building... ...You are an experienced Data Scientist who will design, implement, and... ...learning pipelines from data curation, processing, model building, model...Hourly payWork at officeLocal areaFlexible hours$200k - $250k
...About Crosby AI Crosby is an AI-first legal platform reimagining corporate legal... ...we build. The Role As a Data Scientist at Crosby, you'll play a critical role in... ...product teams to define labeling schemas, curate high-quality datasets, and improve data...$98k - $164k
...Forward-Deployed Data Scientist II New York City At Braze, we have found our people. We'... ...platforms, and a yearly learning stipend A curated in-office employee experience, designed... ...marketers to combine and activate AI agents, models, and features at every touchpoint...Work at officeFlexible hours- ...Data Scientist, AI Data Foundations About the Role Reporting into the Data Engineering organization, the Data Scientist is responsible for designing and building the curated data structures that AI and ML applications consume across MeridianLink. You will own the vector...
$150k - $170k
...which makes everything possible.The Senior Data Scientist, Diagnostics Platform supports the... ...of large language models (LLMs), agentic AI, and image AI solutions. In this role, you... ...discovery and development, and support dataset curation across text, tabular, device logs, lab...Remote workWork from homeWorldwideFlexible hours- ...Senior Data Scientist: AI Training Data (2-4 Months Contract) Company: BespokeLabs (VC-backed; founded by IIT & Ivy League alumni) Location:... ...applied statistics to develop the algorithms and logic that curate and evaluate datasets for advanced AI model training. This is...Hourly payFull timeContract workRemote work
$150k - $250k
...Data Scientist Build and optimize the ML pipeline behind Stickerbox, an AI-powered voice-to-sticker printer for kids. Hapiko is a Brooklyn-based company building the... ...day. Model Training & Data - Build and curate large-scale image datasets for training custom...Work at officeWork from homeFlexible hours$158k - $187k
...analytics and artificial intelligence (AI) to transform how a leading law firm delivers... ...business performance? As an Innovation Data Scientist within Innovation Engineering at... ...you'll lead the full solution lifecycle-curating and governing datasets, defining success...Contract workWorldwideFlexible hours$82.35k - $120.48k
...and passionate Junior Medical Data Annotator to join our Data group... ...heart health management with AI. We are seeking someone... ...intensively with product, data scientist and engineers and provide domain... ...and coding procedures for data curation and database modeling Work with...Full timeWork experience placementWork at office2 days per week- ...Description Insight Global is looking for a Data Engineer to join a dedicated team... ...alerting at scale · Collaborate with AI Scientists and MLOps teams to build data pipelines... ...data analysts and product teams to ensure curated, reliable data is available for downstream...Remote work
$120k - $160k
...Tech unicorn hiring Senior Data Engineer / Generous pay + benefits This Jobot Job is hosted... ...the data lifecycle—from ingestion through curated data marts. Why join us? Annual... ...for this job, you agree to receive calls, AI-generated calls, text messages, or emails...Local areaRemote work$140k - $240k
...Overview The Data Engineer, Mortgage Servicing on the Nebula team acts as the mortgage... ...that powers analytics, reporting, AI development, and operational decision-making... ...optimize data models, warehouse schemas, and curated datasets for analytics and BI use cases...Local areaRemote workFlexible hours$150k - $200k
...organization is looking for a Data Engineer to join the team and... ...generation of data infrastructure and AI-enabled workflows. In this... ...pipelines to clean, curate, and prepare data for analytics... ...internal development teams, data scientists, and stakeholders to understand...Work at officeRemote work- ...Hours) | ContractWe are seeking a Senior Data Engineer to join our team, focusing on enterprise... ...Data Sciences & Analytics, Modern AI/ML Model Monetization, Cloud Engineering,... ...in higher qualified project teams, custom curated for our client's specific technical & functional...Remote work
$176k - $238k
...Senior Data Engineer, Knowledge & Information United States We Breathe Life Into Data... ...Map, analytics products, and downstream AI/ML-enabled use cases. This is a hands-on engineering... ...(HITL) pipelines for data extraction and curation. Transform healthcare claims, EHR, non-...For contractorsWork experience placementWork at officeLocal areaRemote workFlexible hours$113k - $160k
...confidence. Job Description Senior Data Engineer AI-First Data Strategy for P&C Insurance... .... Define, develop, and register curated, reusable data products for lines of... ...Collaborate with data analysts, data scientists, actuaries, underwriters, claims...Work at officeLocal area- ...Senior Data Engineer About Us At Tavus, we're building the human layer of AI. Our mission is to make human-AI interaction as natural as face-to-face interaction, enabling... ...our entire data strategy, from sourcing and curating to structuring and optimizing, ensuring our...Flexible hours
- ...% within two days. The Senior Data Engineer designs, builds, and... ...that power BI, data science, and AI capabilities. You will own the... .... Collaborate with Data Scientists and ML/AI Engineers to build and... ...maintain feature pipelines and curated Gold layer datasets. Support new...Weekly payFull timeTemporary workWork at officeLocal areaImmediate startRemote work
- ...The Data Engineer is responsible for supporting the development, maintenance, and optimization... ...views) and SQL transformations to produce curated, analytics-ready datasets. Collaborate... ...implementations. Enable data for AI/ML use cases by preparing feature-rich datasets...Work at officeRemote work
$135k - $145k
...POSITION SUMMARY The Data Engineer will be responsible for designing... ...building, and supporting enterprise-wide AI and automation initiatives that improve and... ...REPORTING AND ANALYTICS ENABLEMENT: Curate and publish high-quality datasets for analytics...Work experience placementSummer workWork at officeRemote workFlexible hours$500 per week
...an opportunity to join our dynamic team of data professionals working together managing... ...frameworks Collaborate with BI teams to prepare curated data models and tables for reporting in... ...Microsoft certification(s) in the Data & AI solutions field are a plus Microsoft Certified...Temporary workRemote workWork from home
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to AI Curation Data Scientist. Be the first to apply!
- ai data scientist New York, NY
- ai scientist New York, NY
- entry level data scientist New York, NY
- associate data scientist New York, NY
- junior data scientist remote New York, NY
- junior data scientist New York, NY
- data scientist machine learning engineer New York, NY
- entry level data scientist remote New York, NY
- data scientist New York, NY
- data scientist (hedge fund) New York, NY

