Sr Staff Data Scientist, Virtual Biology Initiative
$241k - $331.1kBiohub
Sr Staff Data Scientist, Virtual Biology Initiative
New York, NY (Hybrid); Redwood City, CA (Hybrid)
Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose system to accelerate scientific discovery, integrating frontier AI models, biological foundation models, and lab capabilities, with the ultimate goal of curing disease. Our technology powers scientists around the world, translating AI capabilities into tools that accelerate research everywhere.
The Opportunity
In April 2026, Biohub launched the Virtual Biology Initiative—a $500 million, five-year commitment to galvanize a global effort to build predictive models of the human cell. This initiative will bring together leading institutions to generate the multi-modal biological data, at unprecedented scale, that will power the next generation of AI models for biology while producing datasets of unprecedented size.
Our data science team defines the algorithms and processing approaches that turn raw biological measurements into rich representations models can actually learn from. That includes designing data formats and representations optimized for AI use cases, building cost-aware processing pipelines that balance expressiveness with efficiency, developing scalable QC and validation frameworks across modalities, creating agent-augmented curation tools for metadata extraction and ontology mapping, and building the cross-modal entity resolution and semantic infrastructure that ties it all together.
Both the scale and domain are active research areas. How do you tokenize a cell image? How do you represent a perturbation experiment? How do you combine transcriptomics with imaging in a way that preserves biological meaning? These questions don't have established answers. We need scientific leaders who can work at this frontier: people who understand biological measurement deeply, think creatively about data representations, sampling, and tokenization strategies, and can translate that thinking into data representations that enable novel training architectures.
You'll work directly with scientists, computational biologists, data engineers, and AI researchers to define model input and biological evaluations. You will operate with broad scope and high autonomy, influencing roadmap decisions across teams while mentoring senior individual contributors. Success in this role means creating and implementing data systems that are not only large, but adaptive, interpretable, and scientifically grounded—accelerating progress toward robust biological frontier models and ultimately advancing human health.
What You'll Do
- Set technical vision and strategy for the design of data representations and tokenization strategies across biological data types—including imaging, sequencing, and multimodal data—that enable novel model architectures
- Develop, deploy and validate approaches for combining heterogeneous data modalities into unified training frameworks, designing for robustness to noise, bias, and batch effects
- Evaluate model performance, identifying which biological signals are captured or lost and iterating to improve
- Partner deeply with ML engineers and AI researchers to co-design datasets and optimize model training, evaluation, and generalization
- Lead cross-functional initiatives spanning data engineering, infrastructure, science, and product, aligning technical execution with long-term scientific goals
- Identify and drive new data acquisition and generation opportunities, from consortium partnerships to internal experimental pipelines
- Serve as a technical mentor and leader, raising the bar for data science and ML rigor across the organization
What You'll Bring
- 12+ years of experience (or PhD + 7 years) working with large-scale biological datasets, including ownership of end-to-end data products
- Deep expertise in at least one of: (a) imaging data—microscopy, cell phenotyping, spatial biology, and the data characteristics of image-based biological measurement; or (b) genomics data—bulk and single-cell sequencing, functional genomics, epigenomics, transcriptomics, spatial biology, and/or multi-omics
- Understanding of how to transform raw biological data into AI-ready datasets, including familiarity with scientific best practices, noise characteristics, batch effects, and quality assessment specific to your domain
- Experience with tokenization strategies for non-text data (images, sequences, graphs, time series) or with creating data representations and feature engineering for machine learning in scientific or biological contexts
- Strong expertise in data science and statistical modeling; familiarity with modern ML architectures (transformers, diffusion models, or similar) and how data representation choices affect learning
- Strong computational skills; demonstrated ability to design robust, extensible data architectures
- Excellent communication and leadership skills, with the ability to translate between biology, ML, and engineering audiences and align teams to deliver complex projects
- Creative, first-principles thinking about how to structure data for learning
Compensation
The Redwood City, CA & New York City, NY base pay range for a new hire in this role is $241,000.00 - $331,100.00. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.
Better Together
As we grow, we're excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team's manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.
Benefits for the Whole You
We're thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.
- Provides a generous employer match on employee 401(k) contributions to support planning for the future.
- Paid time off to volunteer at an organization of your choice.
- Funding for select family-forming benefits.
- Relocation support for employees who need assistance moving
$190k - $270k
...health treatment. We deliver personalized, virtual care rooted in connection-between... ...meet you. About the Role As a Staff Data Scientist, you'll be seen as a tech lead thought... ...reach out to for brainstorming and new initiatives. Serve as a tech lead for the Data...VirtualFull timeWork at officeLocal area$146.16k - $219.24k
...with high-volume, multi-source data ecosystem and builds trusted,... ...closely with analysts, data scientists, ad ops, product, and source-... ...insights o develop new initiatives to improve business KPIs such... ...Opportunities for both on-site and virtual engagement events. Unique...VirtualSeniorPermanent employment$190k - $270k
...saving behavioral health treatment. We deliver personalized, virtual care rooted in connection-between clients and clinicians,... ...deserve, we'd love to meet you. About the Role As a Staff Data Scientist - Growth and Marketing, you'll be seen as a thought leader among...VirtualFull timeTemporary workWork at officeLocal area$200k - $235k
...LiveRamp is the data collaboration platform of choice for the world's most innovative... ...and privacy requirements. Staff Data Scientist LiveRamp is the data collaboration platform... ...they do. Fun: We host in-person and virtual events such as game nights, happy hours...VirtualWork at officeWork from homeFlexible hoursNight shift$190k - $270k
...saving behavioral health treatment. We deliver personalized, virtual care rooted in connection—between clients and clinicians,... ...they deserve, we'd love to meet you. About the Role As a Staff Data Scientist - Growth and Marketing, you'll be seen as a thought leader among...VirtualFull timeTemporary workWork at officeLocal area$130.2k - $195.3k
...a positive mark on culture. Senior Data Scientist, Causal Science (45539) Overview... ...Opportunities for both on-site and virtual engagement events. Unique opportunities... ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access, resources...VirtualSenior$124k - $186k
...Overview: We are seeking a Senior Data Scientist who is excited to build ML products that... ...Opportunities for both on-site and virtual engagement events. Unique opportunities... ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access,...VirtualSeniorWorldwide$150k - $200k
...About the role: Our data engineering team is looking for... ...executing cross-engineering initiatives and projects, as well as developing... ...with data analysts, data scientists, engineers, and cross-functional... ...is K Health's AI-powered virtual care engine. Esteemed...VirtualSeniorFull timeWork at officeLocal area$130k - $196.5k
...LiveRamp is the data collaboration platform of choice for the world's most innovative... ...mentoring engineers, leading technical initiatives, or coordinating work with vendors and external... ...they do. Fun: We host in-person and virtual events such as game nights, happy hours,...VirtualSeniorWork from homeFlexible hoursNight shift$142.6k - $153.1k
...quality and accessible. With in-person and virtual clinics in multiple states, the company... .... About the Role We’re looking for a Sr. Data Engineer with strong data platform experience... ...use. You will partner closely with data scientists, analysts, and product managers to...VirtualSeniorHourly pay$190k - $250k
...seeking a Senior Software Engineer for our Data Engineering team to support engineering... ...insights. You'll own complex data initiatives from design through implementation while... ...employees & dependents giving access to 24/7 virtual care ~ Fertility & family planning ~...VirtualSeniorWork at officeFlexible hours3 days per week$164.64k - $246.96k
...Overview We are hiring a Senior Lead Data Engineer to build and scale the data... .... Opportunities for both on-site and virtual engagement events. Unique opportunities... ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access,...VirtualSenior$241k - $338k
...the first large-scale initiative bringing frontier AI models... ...frontier AI models, biological foundation models, and... ...Our technology powers scientists around the world,... ...The role is part of the Data Engineering team,... ...biological AI. As a senior / staff data engineer at...SeniorWork at officeWorldwideRelocation package3 days per week- ...as well as in Belfast. For more information, visit DailyPay's Press Center. The Role: DailyPay is seeking a Senior Staff Data Scientist to serve as the technical architect and strategic leader for our Personalization & AI Platform domains. This is one of the most...SeniorTemporary workLocal area
$240k - $249.5k
...Senior Staff Data Scientist At Wonder Data Science, our mission is to build data science and machine learning systems that improve how our marketplace operates, how customers experience the platform, and how the business makes high-quality decisions. As a Senior Staff...SeniorTemporary workWork at office3 days per week- ...About the Role We are seeking a skilled and motivated Staff Data Scientist to join our Fraud & Risk Data Science team. As an advanced-... ...detection and risk management solutions. You will lead technical initiatives, mentor peers, and drive functional productivity and...Remote work
- ...Data Scientist Location: New York On-site | Full-time Compensation: Competitive Our client is a high-performance technology... ...technical feasibility. Project Ownership: Drive data initiatives from initial problem identification through to solution...Full timeWork at officeImmediate start
$195.5k - $218.5k
...Staff Data Scientist OpenX is focused on unleashing the full economic potential of digital media companies. We do this by making digital... ...Identify and lead high-impact, cross-team data science initiatives, such as improving bidding strategies, building new prediction...Work experience placementLocal area- ...Senior Data Scientist, Analytics New York, Hybrid About the Position As a Senior Data Scientist focusing on Analytics, you'll serve as the statistical backbone of Bluefish's Data Science team. You'll own experimentation and causal inference frameworks, produce...SeniorWork at officeImmediate start
- ...stop-shop for all your physical assets. Why We’re Hiring Our Data Science & Analytics team has been a force multiplier and accelerant... ...principles problem solvers who push forward our most important initiatives across product, growth, tech, and operations. Every business...Contract workPart timeFreelanceLocal areaFlexible hours
- Join to apply for the Staff Data Scientist role at CINC Systems Location: Worldwide (Remote/Hybrid) Reports to: TBD Staff Data Scientist... ...exploration and selection to inform downstream AI and analytics initiatives Design experiments and analyses to validate assumptions...Full timeRemote workWorldwide
$212k - $265k
...the decision engine for better mental health outcomes. As a Staff Data Scientist, Product Analytics, you will be a senior technical and... ...build conviction in what “good” looks like. Lead multi-team initiatives. Drive cross-functional work where the right answer is not...Work from homeFlexible hours- Jobgether is seeking a Staff Data Scientist, Marketing in Canada. This key role involves shaping marketing intelligence and strategic decisions... ..., develop Media Mix Models, and lead advanced data science initiatives in partnership with Marketing and Finance teams....
$217k - $303.9k
...content and behaviors across the site. We are looking for a Staff Data Scientist to join our Safety Insights Data Science team. In this role... ...rigorous analyses to size the potential impact of new initiatives and help the organization focus on the highest ROI areas to...Work at officeRemote work- ...work approach, see below and visit . About the Role: As a Staff Data Scientist at Hims & Hers, you are a technical leader and a "force multiplier... ...Take accountability for the full model lifecycle, from the initial data design through to the long-term performance and...Remote jobFull timeLocal areaImmediate startFlexible hours
- ...life-saving behavioral health treatment. We deliver personalized, virtual care rooted in connection—between clients and clinicians, care... ...deserve, we’d love to meet you. About The Role The Analytics & Data Engineering team owns all post-transactional data operations that...VirtualSeniorLocal area
- ...clinicians-because healthcare deserves better software. As a Senior Data Engineer, you'll be hands-on with the backbone of that system:... ..., and vision plans with nationwide coverage, including 24/7 virtual urgent care. Mental Health Support: Weekly therapy reimbursement...VirtualSeniorWork at officeRemote workFlexible hours
$10k
...books. The problems are high-stakes, data-dense, and unforgiving. We hire people... ...5 years of industry experience as a Data Scientist Strong python experience (numpy, pandas... ...pay • Employee Assistance Program and virtual care through Lumino Health United Kingdom...VirtualSeniorFull timeWork at officeHome officeRelocation packageFlexible hours$142.5k - $220.5k
...Sr. Data Platform Engineer - Computer System Validator Your work will change lives. Including... ...clinical-stage TechBio company decoding biology to industrialize drug discovery. Central... ...with biologists, chemists, and data scientists to build relatability and query-ability...SeniorWork at office$169.54k
...Anticipated End Date: 2026-07-15 Position Title: Data Scientist Senior Job Description: Data Scientist Senior Location... ...combines structured office engagement with the autonomy of virtual work, promoting a dynamic and adaptable workplace. Please...VirtualSeniorTemporary workWork experience placementWork at officeLocal areaMonday to Friday2 days per week1 day per week
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr Staff Data Scientist, Virtual Biology Initiative. Be the first to apply!
- assistant scientist New York, NY
- python data scientist (contract) New York, NY
- senior data scientist New York, NY
- energy data scientist New York, NY
- part time data scientist New York, NY
- python data scientist New York, NY
- data scientist New York, NY
- principal data scientist New York, NY
- junior data scientist New York, NY
- entry level data scientist remote New York, NY


