Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Solutions Applied Data Scientist, Healthcare

Protege

Company Overview We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy‑centric exchange of AI training data. Solving AI’s data problem is a generational opportunity. We’re backed by world‑class investors and already powering partnerships with some of the most ambitious teams in AI. The company that succeeds will be one of the largest in AI — and in tech. We’re a lean, fast‑moving, high‑trust team of builders who are obsessed with velocity and impact. Our culture is built for people who thrive on ambiguity, own outcomes, and want to shape the future of data and AI. Role Overview We are hiring a Solutions Applied Data Scientist to help design, construct, and validate complex healthcare data cohorts used for AI model training. This role sits within the delivery organization , working closely with Solutions Leads and delivery engineers to solve complex data challenges that arise during customer projects. Solutions Leads own the customer relationship and overall delivery of projects. The Solutions Applied Data Scientist serves as their technical partner for more complex data problems , including cohort construction, multi‑source dataset assembly, feasibility analysis, and data validation. You will help translate research generated by Protege’s Data Lab and customer requirements into practical dataset definitions, determine whether those requirements can be met with available data, and build the SQL and analysis needed to construct the resulting datasets. You will also collaborate with delivery engineers when solutions require changes to data pipelines, infrastructure, or large‑scale data movement. This is a highly applied role focused on solving real‑world dataset challenges , not research or model development. The ideal candidate is someone who enjoys solving messy real‑world data problems, working directly with large healthcare datasets, writing complex SQL and collaborating closely with cross‑functional teams. Our environment has a lot going on as we grow — so we’re looking for someone energized by and excited by the fast pace of the industry and our company! What You’ll Do Technical Escalation & Delivery Collaboration During delivery projects, Solutions Leads may encounter complex data challenges that require deeper analysis or technical problem‑solving. You will act as a technical partner , helping solve things such as: Complex cohort definitions that require multi‑source joins Linking datasets across different data partners Investigating unexpected gaps or anomalies in delivered data Evaluating whether requested variables or labels exist in available datasets Determining whether a dataset can realistically satisfy model requirements You will work collaboratively with Solutions Leads to unblock delivery challenges while keeping projects moving toward successful completion. When solutions require infrastructure or pipeline changes, you will partner with the Solutions Engineer and internal platform engineering teams to implement the required workflows. Cohort Definition & Dataset Construction Work with Solutions Leads to translate customer requirements into concrete dataset logic. You will help ensure that datasets accurately represent the intended population and meet customer specifications. Responsibilities include: Writing complex SQL queries to construct cohorts Implementing inclusion and exclusion logic Joining datasets across multiple data sources Validating linkage between datasets Identifying and resolving inconsistencies or missing fields Partner with Solutions Leads to resolve complex data questions that arise during project delivery Escalate or collaborate with delivery engineers when dataset construction requires pipeline changes or large‑scale data processing Data Quality Validation & Completeness Analysis Before complex datasets are delivered to customers you will help validate that they meet required standards. You will work closely with Solutions Leads before datasets are delivered to ensure that the datasets meet agreed acceptance criteria. Review bespoke QA methodology and suggest platform improvements to Product and Engineering to decrease custom work across engagements. Responsibilities include: Performing data completeness analysis Investigating missing or anomalous data Verifying cohort logic results Validating row counts and dataset structure Creating summary statistics and validation outputs Data Feasibility Many customer projects involve AI researchers who are defining the healthcare datasets required to train or evaluate models. You will work with these customer teams to translate research goals into practical dataset specifications. Responsibilities include: Reviewing dataset requests from AI researchers and model development teams Helping clarify and refine requirements for model training or evaluation datasets Evaluating whether requested variables or labels exist in available data sources Identifying proxy variables or alternative dataset structures when ideal variables are unavailable Assessing feasibility of requested cohort definitions given real‑world data constraints Explaining data limitations, tradeoffs, and potential biases to technical stakeholders Iterating with researchers to converge on datasets that are both scientifically meaningful and operationally feasible This role requires someone who is comfortable engaging with technically sophisticated stakeholders while grounding conversations in the realities of messy, real‑world data. Data Partner & Source Data Analysis Many datasets originate from external healthcare data partners. You will help analyze partner datasets to: understand schema and field availability assess data quality and completeness identify required transformations evaluate feasibility of cohort logic This work helps ensure that projects are grounded in what data actually exists. Delivery Tooling & Workflow Improvements As delivery patterns emerge, you will help develop tools and reusable workflows that improve efficiency. Examples include: reusable SQL templates for cohort construction automated validation checks scripts for dataset preparation tools that reduce manual delivery work This role is an important bridge between manual dataset delivery and scalable data infrastructure. What Success Looks Like 30 days: Learn the delivery motion and source‑data reality. Build working knowledge of Solutions workflows, healthcare data partners, common cohort patterns, and how complex requests get escalated. Shadow active projects, understand existing QA approaches, and start contributing to scoped feasibility and validation work. 60 days: Own scoped technical escalations and create early leverage Independently support complex cohort‑definition and dataset‑construction work, write and validate SQL / Python workflows, and help Solutions Leads answer hard feasibility questions with clear tradeoffs. 90 days: Become a trusted technical partner across delivery Handle the hardest dataset problems with limited oversight, improve QA and repeatability, and propose workflow or platform improvements that reduce bespoke work across engagements. What You Bring Experience working with large structured healthcare datasets Strong SQL and python skills and experience writing complex queries Experience using Claude Code / Codex Experience joining and transforming large datasets Experience performing data validation and exploratory analysis Strong Python skills for data analysis and scripting Experience working with structured file formats (CSV, Parquet, etc.) Ability to translate ambiguous requirements into concrete data logic Strong communication skills and ability to collaborate with technical and non‑technical stakeholders Protege Values We pass the loved ones' test — integrity isn't negotiable, even when it's costly We always find a way — obstacles are expected, giving up isn't We go fast and grow fast — velocity is a competitive advantage and we treat it that way We practice kindness and candor — hard conversations happen here, and they happen with care We deliver together — no silos, no lone heroes, no passengers We own the outcome — full accountability, continuous improvement, mastery over time #J-18808-Ljbffr

Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Solutions Applied Data Scientist, Healthcare in New York, NY vacancy
  • $124k - $280k

     ...Specialty/Competency: Data, Analytics & AI...  ...and develop robust data solutions for clients. They play...  ...increase in autonomy, you apply sound judgment, recognising...  ...translating ambiguous healthcare challenges into...  ...engineers and other data scientists to deliver efficient,... 
    Suggested
    Full time
    H1b

    PwC

    New York, NY
    1 day ago
  • $158k - $168k

     ...trans-founded with a vision to transform healthcare for every trans life. We hope to make gender...  ...with purpose. About the Role The Senior Data and AI Engineer is a high-impact...  ...transformation models, and BI deliverables. Applied AI (RAG pipelines, MLOps) is a growing area... 
    Suggested
    Full time
    For contractors

    Plume Ltd

    New York, NY
    1 day ago
  •  ...100 countries. With over $14 billion in annual revenue, they are committed to advancing healthcare and empowering patients, providers, and researchers through data-driven solutions. Their mission is to improve health and improve lives by delivering clear and confident... 
    Suggested
    Remote work
    Flexible hours

    Coherent Solutions, Inc.

    New York, NY
    1 day ago
  • $140k - $180k

     ...Senior Solutions Data Engineer (SaaS / E-commerce Integrations) United States (Remote) Salary – $140,000 – $180,000 + Equity + 401K + PTO + Healthcare Insurance Are you a customer-facing Data Engineer or Solutions Engineer with experience delivering SaaS or E-commerce... 
    Suggested
    Remote work

    Rise Technical

    New York, NY
    1 day ago
  • $110k - $140k

     ...Overview Cotiviti is seeking a Data Scientist to lead the development of...  ...and predictive systems for healthcare risk adjustment and ICD-10 code...  ..., driving cutting‑edge AI solutions across the healthcare...  ...to processes and protocols. Applying established protocols in a timely... 
    Suggested
    Work at office

    Cotiviti

    New York, NY
    1 day ago
  • £70k - £90k per year

     ...Job Title: Applied Analytics Engineer Compensation Range: £70,000-£90,000 + 10% bonus...  ...anywhere in the UK. Overview As an Applied Data Scientist at Quid, you will build data and AI-...  ...turning ambiguous needs into scalable solutions. Your work will power key insights and... 
    Remote work

    Quid

    New York, NY
    1 day ago
  • $109.24k - $189.11k

     ...Data Scientist (SME) - Clearance Required Join to apply for the Data Scientist (SME) - Clearance Required role at LMI Overview...  .... LMI is a new breed of digital solutions provider dedicated to...  ...LMI serves the defense, space, healthcare, and energy sectors—helping agencies... 
    Full time
    Contract work
    Remote work

    LMI

    New York, NY
    1 day ago
  •  ...About Solace Healthcare in the U.S. is fundamentally broken...  ...the Role At Solace, data isn't just about...  ...is looking for a Data Scientist who excels at forecasting...  ...Uncover Deep Insights: Apply advanced statistical methods...  .... Productionize Solutions: Work closely with Data... 
    Local area

    SOLACE HEALTH LLC

    New York, NY
    1 day ago
  • $130k - $175k

     ...ophthalmology. Our patient data platform integrates...  ...the intersection of healthcare and technology, including...  ...AMD. Role : Data Scientist, I/II Reports to:...  ...experience focused on applying statistical analyses to...  ...operationalizing your solution ~ Deep knowledge of... 
    Full time

    Character Biosciences

    Jersey City, NJ
    1 day ago
  •  ...Applied Data Scientist-Financial Services We are looking for an Applied Data Scientist for customer-facing projects that combine data science...  ...of machine learning and big data technologies to create solutions for customers' challenges and needs, defining and developing... 
    Work experience placement

    1872 Consulting

    New York, NY
    17 hours ago
  •  ...An established industry player is seeking a Data Scientist I QA to join their innovative Enterprise Data Science Team. This role focuses on applying machine learning solutions to tackle real-world healthcare challenges, utilizing both structured and unstructured data.... 
    Remote work

    Cotiviti

    New York, NY
    1 day ago
  •  ...the forefront of transforming healthcare and enhancing longevity. We...  ...is looking for a Senior Data Scientist, Marketing to join our growing...  ...will focus on building and applying statistical models, experiments...  ...into machine learning solutions, shipping high‑quality work... 
    Hourly pay
    Full time
    Temporary work
    For contractors
    Remote work
    Flexible hours

    Hone Health

    New York, NY
    3 days ago
  •  ..., we are transforming healthcare and improving patient...  ...chronic conditions. The Data Science team is...  ...tremendous—as a Senior Data Scientist, you’ll help build a revolutionary...  ...data-driven solutions that directly improve...  ...enterprise impact. Apply a broad problem-solving... 
    For contractors
    For subcontractor
    Work at office
    Remote work
    Flexible hours

    Clover Health

    New York, NY
    1 day ago
  •  ...John Snow Labs US-Based Healthcare Data Scientist Contract John Snow Labs is an award-winning AI...  ...Snow Labs is the winner of the 2018 AI Solution Provider of the Year Award, the 2019 AI...  ...supportive, collaborative environment. To apply, please include the words 'John Snow... 
    Long term contract
    Full time
    Contract work
    Freelance

    John Snow Labs

    New York, NY
    4 days ago
  • $55 - $72 per hour

     ...analytical and independent Senior Data Scientist to join their team in...  ...modernization initiative within a healthcare environment, utilizing...  ...machine learning concepts or applied modeling. Exposure to A/B testing...  ...independently. Analytical, solution‑oriented mindset.... 
    Hourly pay
    Monday to Friday

    Ledgent Technology

    New York, NY
    47 minutes ago
  • $40 - $70 per hour

     ...Contract Data Scientist Boston or NYC Layer Health was founded in...  ...reduce friction everywhere in healthcare. Our LLM-powered platform is...  ...the Layer Health ML team applying our existing ML workflow to...  ...-accuracy, production-ready solutions across new clinical domains.... 
    Full time
    Contract work
    Work at office
    3 days per week

    Layer Health

    New York, NY
    4 days ago
  • $130k - $196.5k

     ...LiveRamp is the data collaboration platform of choice for the...  ...giants to banks, retailers, and healthcare leaders turn to LiveRamp to...  ...quality and reproducibility. Apply causal inference and bias...  ...end. Partner with product, solutions, and operations teams to translate... 
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    New York, NY
    4 days ago
  •  ...comprehensive, intelligent access to healthcare data on an AI-assisted platform....  ..., the AI Curation Data Scientist will, using traditional...  ...xCures and implementing software solutions compliant with policies and...  ...a list of bullet points. To apply, please send your cover... 
    Work experience placement
    Remote work
    Flexible hours

    xCures

    New York, NY
    1 day ago
  • $190k - $210k

     ...Claritev is seeking a Principal Applied Scientist to lead the research and deployment of advanced AI systems in healthcare. In this hands-on role, you will work closely with...  ...translate cutting-edge research into scalable solutions. The ideal candidate will have a Ph.D. or... 

    RadNet

    New York, NY
    1 day ago
  • $180k - $280k

     ...A leading healthcare technology company is seeking an applied scientist to tackle medical document analysis using advanced techniques in Natural Language Processing...  ...will have a strong background in AI-driven solutions focused on healthcare efficiency and delivering impactful... 
    Remote work

    The Rawlings Group

    New York, NY
    1 day ago
  •  ...About the job Data Engineer - SQL/GCP OUR...  ..., action-oriented solutions to business problems through...  ...of 2,000+ data scientists and analysts who assist...  ...serve the insurance, healthcare, banking, capital markets...  ...institutions are also welcome to apply Strong and in-depth... 

    Inizio Partners

    New York, NY
    2 days ago
  • $160k - $174k

    About Cleerly We’re Cleerly - a healthcare company that’s...  ...-driven precision diagnostic solutions with the goal of helping prevent...  ...objectives. About the Team The BI & Data team at Cleerly provides in-...  ...percent of the qualifications? Apply anyway and help us diversify... 
    Remote work

    Cleerly, LLC

    New York, NY
    2 days ago
  • $100k - $125k

     ...Apply Description About Us: Revive is a dynamic and innovative organization specializing in healthcare delivery and technology. We pride ourselves on delivering...  ...skilled and motivated Data Engineer to join our...  ...with data warehouse/lake solutions (e.g., Azure Synapse). Knowledge... 
    Remote work
    Flexible hours

    CloudDevs

    New York, NY
    1 day ago
  • $120k - $160k

     ...patients, their communities, the healthcare system, families, and society...  ...partners to understand data needs and deliver reliable pipelines...  ...taker, you're part of the solution. A proactive, problem‑solving...  ...financially responsible and applying it fairly, equitably,... 
    Remote work
    Flexible hours

    Empassion Health, Inc.

    New York, NY
    2 days ago
  • $135.6k - $154.8k

     ...Senior Associate, Data Scientist - US Card (Applied GenAI) Data is at the center of everything we do. As a startup, we disrupted the credit card...  ...The Servicing Intelligence team delivers data science solutions to capture value from unstructured, multi-modal data sources... 
    Full time
    Part time
    Local area

    Capital One

    New York, NY
    17 hours ago
  • $146k - $180k

     ...people receive quality mental healthcare? Come join our mission of...  ...We are looking for a Senior Data Engineer to join our Data Science...  ...teams to ensure that data solutions meet business needs and support...  .... We encourage you to apply, even if you don't meet every... 
    Live in
    Work at office
    Local area
    Remote work
    Worldwide
    Flexible hours
    2 days per week
    1 day per week

    Talkspace Remote Therapist Roles

    New York, NY
    17 hours ago
  •  ...Everywhere. The healthcare industry still relies...  ...iteration and getting solutions into the hands of users...  ...Job Title: Senior Data Engineer We're hiring...  ...with engineers, data scientists or operations teams to...  ..., we encourage you to apply. Core Skills & Experience... 
    Work at office

    Verse Medical

    New York, NY
    2 days ago
  •  ...Protege is hiring a Solutions Applied Data Scientist to tackle complex healthcare data challenges for AI model training. This role focuses on creating accurate datasets, collaborating with cross-functional teams, and ensuring data quality. Ideal candidates should have... 

    Protege

    New York, NY
    1 day ago
  • $93k - $112k

     ...Mid-level Applied AI Data Scientist (Tableau Focus) Location: New York, NY (Hybrid - 3 days in-office) | Practice Area: Technology & Engineering...  ...contribute to the development of scalable, data-driven solutions. What You'll Do Apply statistical analysis,... 
    Permanent employment
    Work at office

    Capco

    New York, NY
    17 hours ago
  • $176k - $238k

     ...Senior Data Engineer, Knowledge & Information United States...  ...mission. That's why we built the Healthcare Map — the industry's largest,...  ..., production-grade data solutions. You will accomplish these...  ...volume productization; experience applying AI or agentic workflows to... 
    For contractors
    Work experience placement
    Work at office
    Local area
    Remote work
    Flexible hours

    Komodo Health

    New York, NY
    1 day ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Solutions Applied Data Scientist, Healthcare. Be the first to apply!