Sign up to access all features of our service.
  • Job search
  • Favorites
  • Create a CV
    New
  • Salaries
  • Subscriptions

Sr Staff Data Scientist, Virtual Biology Initiative

$241k - $331.1k

Biohub

Sr Staff Data Scientist, Virtual Biology Initiative

New York, NY (Hybrid); Redwood City, CA (Hybrid)

Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose system to accelerate scientific discovery, integrating frontier AI models, biological foundation models, and lab capabilities, with the ultimate goal of curing disease. Our technology powers scientists around the world, translating AI capabilities into tools that accelerate research everywhere.

The Opportunity

In April 2026, Biohub launched the Virtual Biology Initiative—a $500 million, five-year commitment to galvanize a global effort to build predictive models of the human cell. This initiative will bring together leading institutions to generate the multi-modal biological data, at unprecedented scale, that will power the next generation of AI models for biology while producing datasets of unprecedented size.

Our data science team defines the algorithms and processing approaches that turn raw biological measurements into rich representations models can actually learn from. That includes designing data formats and representations optimized for AI use cases, building cost-aware processing pipelines that balance expressiveness with efficiency, developing scalable QC and validation frameworks across modalities, creating agent-augmented curation tools for metadata extraction and ontology mapping, and building the cross-modal entity resolution and semantic infrastructure that ties it all together.

Both the scale and domain are active research areas. How do you tokenize a cell image? How do you represent a perturbation experiment? How do you combine transcriptomics with imaging in a way that preserves biological meaning? These questions don't have established answers. We need scientific leaders who can work at this frontier: people who understand biological measurement deeply, think creatively about data representations, sampling, and tokenization strategies, and can translate that thinking into data representations that enable novel training architectures.

You'll work directly with scientists, computational biologists, data engineers, and AI researchers to define model input and biological evaluations. You will operate with broad scope and high autonomy, influencing roadmap decisions across teams while mentoring senior individual contributors. Success in this role means creating and implementing data systems that are not only large, but adaptive, interpretable, and scientifically grounded—accelerating progress toward robust biological frontier models and ultimately advancing human health.

What You'll Do
  • Set technical vision and strategy for the design of data representations and tokenization strategies across biological data types—including imaging, sequencing, and multimodal data—that enable novel model architectures
  • Develop, deploy and validate approaches for combining heterogeneous data modalities into unified training frameworks, designing for robustness to noise, bias, and batch effects
  • Evaluate model performance, identifying which biological signals are captured or lost and iterating to improve
  • Partner deeply with ML engineers and AI researchers to co-design datasets and optimize model training, evaluation, and generalization
  • Lead cross-functional initiatives spanning data engineering, infrastructure, science, and product, aligning technical execution with long-term scientific goals
  • Identify and drive new data acquisition and generation opportunities, from consortium partnerships to internal experimental pipelines
  • Serve as a technical mentor and leader, raising the bar for data science and ML rigor across the organization
What You'll Bring
  • 12+ years of experience (or PhD + 7 years) working with large-scale biological datasets, including ownership of end-to-end data products
  • Deep expertise in at least one of: (a) imaging data—microscopy, cell phenotyping, spatial biology, and the data characteristics of image-based biological measurement; or (b) genomics data—bulk and single-cell sequencing, functional genomics, epigenomics, transcriptomics, spatial biology, and/or multi-omics
  • Understanding of how to transform raw biological data into AI-ready datasets, including familiarity with scientific best practices, noise characteristics, batch effects, and quality assessment specific to your domain
  • Experience with tokenization strategies for non-text data (images, sequences, graphs, time series) or with creating data representations and feature engineering for machine learning in scientific or biological contexts
  • Strong expertise in data science and statistical modeling; familiarity with modern ML architectures (transformers, diffusion models, or similar) and how data representation choices affect learning
  • Strong computational skills; demonstrated ability to design robust, extensible data architectures
  • Excellent communication and leadership skills, with the ability to translate between biology, ML, and engineering audiences and align teams to deliver complex projects
  • Creative, first-principles thinking about how to structure data for learning
Compensation

The Redwood City, CA & New York City, NY base pay range for a new hire in this role is $241,000.00 - $331,100.00. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.

Better Together

As we grow, we're excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team's manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.

Benefits for the Whole You

We're thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.

  • Provides a generous employer match on employee 401(k) contributions to support planning for the future.
  • Paid time off to volunteer at an organization of your choice.
  • Funding for select family-forming benefits.
  • Relocation support for employees who need assistance moving
Vacancy posted 1 day ago
Similar jobs that could be interesting for youBased on the Sr Staff Data Scientist, Virtual Biology Initiative in New York, NY vacancy
  • $190k - $270k

     ...health treatment. We deliver personalized, virtual care rooted in connection-between...  ...meet you. About the Role As a Staff Data Scientist, you'll be seen as a tech lead thought...  ...reach out to for brainstorming and new initiatives. Serve as a tech lead for the Data... 
    Virtual
    Full time
    Work at office
    Local area

    Charlie Health Outreach

    New York, NY
    2 days ago
  • $190k - $270k

     ...saving behavioral health treatment. We deliver personalized, virtual care rooted in connection-between clients and clinicians,...  ...deserve, we'd love to meet you. About the Role As a Staff Data Scientist - Growth and Marketing, you'll be seen as a thought leader among... 
    Virtual
    Full time
    Temporary work
    Work at office
    Local area

    Charlie Health Outreach

    New York, NY
    7 hours ago
  • $200k - $235k

     ...LiveRamp is the data collaboration platform of choice for the world's most innovative...  ...and privacy requirements. Staff Data Scientist LiveRamp is the data collaboration platform...  ...they do. Fun: We host in-person and virtual events such as game nights, happy hours... 
    Virtual
    Work at office
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    New York, NY
    3 days ago
  •  ...Sr. Data Engineers 1+ year contract Hybrid 2-3 days a week onsite in New York, NY Interview process will be a single round 1-hour long virtual IV on Zoom Requirements: Minimum of 5-7+ years of applicable Data Engineering experience... 
    Virtual
    Senior
    Contract work
    2 days per week
    3 days per week

    Saxon Global

    New York, NY
    4 days ago
  • $190k - $270k

     ...saving behavioral health treatment. We deliver personalized, virtual care rooted in connection—between clients and clinicians,...  ...they deserve, we'd love to meet you. About the Role As a Staff Data Scientist - Growth and Marketing, you'll be seen as a thought leader among... 
    Virtual
    Full time
    Temporary work
    Work at office
    Local area

    Charlie Health

    New York, NY
    2 days ago
  • $124k - $175k

     ...a positive mark on culture. Senior Data Scientist, Causal Science (45539) Overview...  ...Opportunities for both on-site and virtual engagement events. Unique opportunities...  ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access, resources... 
    Virtual
    Senior

    Paramount

    New York, NY
    2 days ago
  •  ...BDIPlus is seeking a Senior Data Engineer to support a Fortune 100 financial services client’s real-time intelligence initiatives. In this role, You’ll work in a virtualization-first architecture using Denodo , building both virtual and physical data products that are... 
    Virtual
    Senior

    VALID8 Financial

    New York, NY
    2 days ago
  • $124k - $186k

     ...Overview: We are seeking a Senior Data Scientist who is excited to build ML products that...  ...Opportunities for both on-site and virtual engagement events. Unique opportunities...  ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access,... 
    Virtual
    Senior
    Worldwide

    Paramount

    New York, NY
    7 hours ago
  • $150k - $200k

     ...Senior Data Engineer New York, NY About The Role Our...  ...executing cross-engineering initiatives and projects, as well as developing...  ...with data analysts, data scientists, engineers, and cross-functional...  ...is K Health's AI-powered virtual care engine. Esteemed health... 
    Virtual
    Senior
    Full time
    Local area

    K Health

    New York, NY
    3 days ago
  • $150k - $180k

     ...Gratitude Hone has been fully virtual from day one and will...  ...leadership—the ability to drive initiatives forward while remaining excited...  ...Role Hone is looking for a Sr Data Engineer to join our team....  ...with Analytics Engineers, Data Scientists, Analysts and Software... 
    Virtual
    Senior
    Full time
    Temporary work
    Part time
    For contractors
    Remote work
    Flexible hours

    Hone Health

    New York, NY
    7 hours ago
  • $180k - $250k

     ...Senior Software Engineer, Data Engineering About Atria Atria is a membership‑based preventive...  ...insights. You’ll own complex data initiatives from design through implementation while...  ...& dependents giving access to 24/7 virtual care Fertility & family planning Company... 
    Virtual
    Senior
    Remote work
    Flexible hours

    Atria Health and Research Institute

    New York, NY
    7 hours ago
  • $156.8k - $235.2k

     ...Overview We are hiring a Senior Lead Data Engineer to build and scale the data...  .... Opportunities for both on-site and virtual engagement events. Unique opportunities...  ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access,... 
    Virtual
    Senior

    Paramount

    New York, NY
    4 days ago
  •  ...as well as in Belfast. For more information, visit DailyPay's Press Center. The Role: DailyPay is seeking a Senior Staff Data Scientist to serve as the technical architect and strategic leader for our Personalization & AI Platform domains. This is one of the most... 
    Senior
    Temporary work
    Local area

    DailyPay

    New York, NY
    4 days ago
  • $132k - $264k

     ...What you'll do... Role summary: Walmart is seeking a Staff Data Scientist to leverage advanced analytics and machine learning techniques...  ...business domains and databases to support analytics initiatives. Perform data quality assessments and ensure data suitability... 
    Full time
    Temporary work
    Part time

    Walmart

    Hoboken, NJ
    7 hours ago
  •  ...Data Scientist Location:  New York On-site | Full-time Compensation: Competitive Our client is a high-performance technology...  ...technical feasibility. Project Ownership: Drive data initiatives from initial problem identification through to solution... 
    Full time
    Work at office
    Immediate start

    MLabs

    New York, NY
    1 day ago
  • $212k - $265k

     ...decision engine for better mental health outcomes. As a Staff Data Scientist, Product Analytics , you will be a senior technical and...  ...build conviction in what "good" looks like. Lead multi-team initiatives. Drive cross-functional work where the right answer is not... 
    Work from home
    Flexible hours

    Headway - Design & Development

    New York, NY
    1 day ago
  • $195.5k - $218.5k

     ...Staff Data Scientist OpenX is focused on unleashing the full economic potential of digital media companies. We do this by making digital...  ...Identify and lead high-impact, cross-team data science initiatives, such as improving bidding strategies, building new prediction... 
    Work experience placement
    Local area

    OpenX

    New York, NY
    3 days ago
  • $238k - $302k

     ...achieving Waymo's ambitious goals. In this role, you will lead key initiatives for measuring the quality and trustworthiness of the behavior...  ...experience, or 7+ years of industry experience solving data science problems Solid statistical background. Expertise using... 
    Full time
    Remote work

    Waymo

    New York, NY
    19 days ago
  •  ...develop platform-level solutions to promote security-related initiatives and improvements. - Review source code for potential security...  ...Platform SSO - Okta Servers - Windows and Linux, VMware Virtual Machines and Cloud Device Management - AzureAD, Carbon Black... 
    Virtual
    Senior
    Full time
    Remote work
    Home office

    Stack Overflow

    New York, NY
    10 days ago
  • $207k - $300k

    Staff Data Scientist, Research, Search Health Mountain View, CA, USA; New York, NY, USA; San Francisco, CA, USA; and additional locations. Advanced...  ...sources, product headroom analysis for key AI features. Initiate projects inside and across the organization and drive to... 
    Full time
    Work experience placement

    Google Inc.

    New York, NY
    2 days ago
  • Google Inc. is seeking a Staff Data Scientist in New York to lead initiatives in data analysis and machine learning. The successful candidate will have a master's degree in a quantitative field and at least 8 years of relevant experience. Responsibilities include leading... 

    Google Inc.

    New York, NY
    2 days ago
  • $212k - $265k

     ...Staff Data Scientist - Finance & Accounting New York, New York, United States; San Francisco, California, United States; Seattle, Washington...  ...loss so they can be translated into cross-functional initiatives. Build forecasts that inform staffing decisions and help... 
    Work from home
    Flexible hours

    Headway - Design & Development

    New York, NY
    3 days ago
  • $124k - $175k

     ...Senior Data Scientist We're on a mission to unleash the power of content… you in? We've got the brands, we've got the stars, we've got...  ...Paramount's most dynamic teams. Opportunities for both on-site and virtual engagement events. Unique opportunities to make meaningful... 
    Virtual
    Senior

    Paramount Global Services

    New York, NY
    4 days ago
  • We are seeking a motivated scientist to join the Antibody Discovery team and support early-stage antibody screening initiatives. This individual will contribute to critical discovery...  ...plates. The role will also support biologics inventory and registration activities within... 
    Local area

    Kaztronix LLC

    New York, NY
    5 days ago
  • $150k - $200k

     ...expertise, and 200+ petabytes of multimodal data linked to patient outcomes, so we can...  ...energized by complexity, and want to apply AI, biology, or both to redefine the future of drug...  ...Medicine We are hiring specialized scientists to accelerate development of our Oncology... 
    Senior
    Work at office
    Shift work
    3 days per week

    Pathos

    New York, NY
    5 days ago
  •  ...unit or location. Position: Principal Data Scientist Location: REMOTE Remote Status: Remote...  ...data science and AI roadmap, sequencing initiatives based on business value, technical feasibility...  ...in person, via phone, and through virtual collaboration tools. Occasional... 
    Virtual
    Weekly pay
    Full time
    Temporary work
    Work at office
    Local area
    Immediate start
    Remote work

    Munchsupply

    New York, NY
    7 hours ago
  •  ...clinicians-because healthcare deserves better software. As a Senior Data Engineer, you'll be hands-on with the backbone of that system:...  ..., and vision plans with nationwide coverage, including 24/7 virtual urgent care. Mental Health Support: Weekly therapy reimbursement... 
    Virtual
    Senior
    Work at office
    Remote work
    Flexible hours

    Camber

    New York, NY
    7 hours ago
  • $142.5k - $220.5k

     ...Sr. Data Platform Engineer - Computer System Validator Your work will change lives. Including...  ...clinical-stage TechBio company decoding biology to industrialize drug discovery. Central...  ...with biologists, chemists, and data scientists to build relatability and query-ability... 
    Senior
    Work at office

    Recursion Pharmaceuticals

    New York, NY
    4 days ago
  •  ...life-saving behavioral health treatment. We deliver personalized, virtual care rooted in connection—between clients and clinicians, care...  ...deserve, we’d love to meet you. About The Role The Analytics & Data Engineering team owns all post-transactional data operations that... 
    Virtual
    Senior
    Local area

    Charlie Health

    New York, NY
    4 days ago
  • $130k - $196.5k

     ...LiveRamp is the data collaboration platform of choice for the world's most innovative companies. A groundbreaking leader in consumer...  ...friendly people who love what they do. Fun: We host in-person and virtual events such as game nights, happy hours, camping trips, and... 
    Virtual
    Senior
    Work from home
    Flexible hours
    Night shift

    LiveRamp

    New York, NY
    3 days ago

Do you want to receive more vacancies?

Subscribe and receive similar vacancies to Sr Staff Data Scientist, Virtual Biology Initiative. Be the first to apply!