Sr Staff Data Scientist, Virtual Biology Initiative
$241k - $331.1kBiohub
Sr Staff Data Scientist, Virtual Biology Initiative, AI Research
New York, NY (Hybrid); Redwood City, CA (Hybrid)
Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose system to accelerate scientific discovery, integrating frontier AI models, biological foundation models, and lab capabilities, with the ultimate goal of curing disease. Our technology powers scientists around the world, translating AI capabilities into tools that accelerate research everywhere.
The Opportunity
In April 2026, Biohub launched the Virtual Biology Initiative—a $500 million, five-year commitment to galvanize a global effort to build predictive models of the human cell. This initiative will bring together leading institutions to generate the multi-modal biological data, at unprecedented scale, that will power the next generation of AI models for biology while producing datasets of unprecedented size.
Our data science team defines the algorithms and processing approaches that turn raw biological measurements into rich representations models can actually learn from. That includes designing data formats and representations optimized for AI use cases, building cost-aware processing pipelines that balance expressiveness with efficiency, developing scalable QC and validation frameworks across modalities, creating agent-augmented curation tools for metadata extraction and ontology mapping, and building the cross-modal entity resolution and semantic infrastructure that ties it all together.
Both the scale and domain are active research areas. How do you tokenize a cell image? How do you represent a perturbation experiment? How do you combine transcriptomics with imaging in a way that preserves biological meaning? These questions don't have established answers. We need scientific leaders who can work at this frontier: people who understand biological measurement deeply, think creatively about data representations, sampling, and tokenization strategies, and can translate that thinking into data representations that enable novel training architectures.
You'll work directly with scientists, computational biologists, data engineers, and AI researchers to define model input and biological evaluations. You will operate with broad scope and high autonomy, influencing roadmap decisions across teams while mentoring senior individual contributors. Success in this role means creating and implementing data systems that are not only large, but adaptive, interpretable, and scientifically grounded—accelerating progress toward robust biological frontier models and ultimately advancing human health.
What You'll Do
- Set technical vision and strategy for the design of data representations and tokenization strategies across biological data types—including imaging, sequencing, and multimodal data—that enable novel model architectures
- Develop, deploy and validate approaches for combining heterogeneous data modalities into unified training frameworks, designing for robustness to noise, bias, and batch effects
- Evaluate model performance, identifying which biological signals are captured or lost and iterating to improve
- Partner deeply with ML engineers and AI researchers to co-design datasets and optimize model training, evaluation, and generalization
- Lead cross-functional initiatives spanning data engineering, infrastructure, science, and product, aligning technical execution with long-term scientific goals
- Identify and drive new data acquisition and generation opportunities, from consortium partnerships to internal experimental pipelines
- Serve as a technical mentor and leader, raising the bar for data science and ML rigor across the organization
What You'll Bring
- 12+ years of experience (or PhD + 7 years) working with large-scale biological datasets, including ownership of end-to-end data products
- Deep expertise in at least one of: (a) imaging data—microscopy, cell phenotyping, spatial biology, and the data characteristics of image-based biological measurement; or (b) genomics data—bulk and single-cell sequencing, functional genomics, epigenomics, transcriptomics, spatial biology, and/or multi-omics
- Understanding of how to transform raw biological data into AI-ready datasets, including familiarity with scientific best practices, noise characteristics, batch effects, and quality assessment specific to your domain
- Experience with tokenization strategies for non-text data (images, sequences, graphs, time series) or with creating data representations and feature engineering for machine learning in scientific or biological contexts
- Strong expertise in data science and statistical modeling; familiarity with modern ML architectures (transformers, diffusion models, or similar) and how data representation choices affect learning
- Strong computational skills; demonstrated ability to design robust, extensible data architectures
- Excellent communication and leadership skills, with the ability to translate between biology, ML, and engineering audiences and align teams to deliver complex projects
- Creative, first-principles thinking about how to structure data for learning
Compensation
The Redwood City, CA & New York City, NY base pay range for a new hire in this role is $241,000.00 - $331,100.00. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job-related skills and experience, as evaluated throughout the interview process.
Better Together
As we grow, we're excited to strengthen in-person connections and cultivate a collaborative, team-oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in-office days determined by the team's manager. The exact schedule will be at the hiring manager's discretion and communicated during the interview process.
Benefits for the Whole You
We're thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible.
- Provides a generous employer match on employee 401(k) contributions to support planning for the future.
- Paid time off to volunteer at an organization of your choice.
- Funding for select family-forming benefits.
- Relocation support for employees who need assistance moving
If you're interested in a role but your previous experience doesn't perfectly align with each qualification in the job description, we still encourage you to apply as you may be the perfect fit for this or another role.
$200k - $270k
...it easy to find and book in-person or virtual care in all 50 states, across +200 specialties... ...important asset is our people. As a Staff Data Scientist, Marketplace, you'll play a meaningful... ...the impact of product and commercial initiatives Partnering with Engineering to...VirtualFlexible hours$190k - $270k
...health treatment. We deliver personalized, virtual care rooted in connection-between... ...meet you. About the Role As a Staff Data Scientist, you'll be seen as a tech lead thought... ...reach out to for brainstorming and new initiatives. Serve as a tech lead for the Data...VirtualFull timeWork at officeLocal area$190k - $270k
...saving behavioral health treatment. We deliver personalized, virtual care rooted in connection-between clients and clinicians,... ...deserve, we'd love to meet you. About the Role As a Staff Data Scientist - Growth and Marketing, you'll be seen as a thought leader among...VirtualFull timeTemporary workWork at officeLocal area$200k - $235k
...LiveRamp is the data collaboration platform of choice for the world's most innovative... ...and privacy requirements. Staff Data Scientist LiveRamp is the data collaboration platform... ...they do. Fun: We host in-person and virtual events such as game nights, happy hours...VirtualWork at officeWork from homeFlexible hoursNight shift- ...Sr. Data Engineers 1+ year contract Hybrid 2-3 days a week onsite in New York, NY Interview process will be a single round 1-hour long virtual IV on Zoom Requirements: Minimum of 5-7+ years of applicable Data Engineering experience...VirtualSeniorContract work2 days per week3 days per week
$190k - $270k
...saving behavioral health treatment. We deliver personalized, virtual care rooted in connection—between clients and clinicians,... ...they deserve, we'd love to meet you. About the Role As a Staff Data Scientist - Growth and Marketing, you'll be seen as a thought leader among...VirtualFull timeTemporary workWork at officeLocal area$124k - $175k
...a positive mark on culture. Senior Data Scientist, Causal Science (45539) Overview... ...Opportunities for both on-site and virtual engagement events. Unique opportunities... ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access, resources...VirtualSenior- ...BDIPlus is seeking a Senior Data Engineer to support a Fortune 100 financial services client’s real-time intelligence initiatives. In this role, You’ll work in a virtualization-first architecture using Denodo , building both virtual and physical data products that are...VirtualSenior
$124k - $186k
...Overview: We are seeking a Senior Data Scientist who is excited to build ML products that... ...Opportunities for both on-site and virtual engagement events. Unique opportunities... ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access,...VirtualSeniorWorldwide$150k - $200k
...Senior Data Engineer New York, NY About The Role Our... ...executing cross-engineering initiatives and projects, as well as developing... ...with data analysts, data scientists, engineers, and cross-functional... ...is K Health's AI-powered virtual care engine. Esteemed health...VirtualSeniorFull timeLocal area$142.6k - $153.1k
...quality and accessible. With in-person and virtual clinics in multiple states, the company... .... About the Role We’re looking for a Sr. Data Engineer with strong data platform experience... ...use. You will partner closely with data scientists, analysts, and product managers to...VirtualSeniorHourly pay$150k - $180k
...Gratitude Hone has been fully virtual from day one and will... ...leadership—the ability to drive initiatives forward while remaining excited... ...Role Hone is looking for a Sr Data Engineer to join our team.... ...with Analytics Engineers, Data Scientists, Analysts and Software...VirtualSeniorFull timeTemporary workPart timeFor contractorsRemote workFlexible hours$180k - $250k
...Senior Software Engineer, Data Engineering About Atria Atria is a membership‑based preventive... ...insights. You’ll own complex data initiatives from design through implementation while... ...& dependents giving access to 24/7 virtual care Fertility & family planning Company...VirtualSeniorRemote workFlexible hours$156.8k - $235.2k
...Overview We are hiring a Senior Lead Data Engineer to build and scale the data foundations... .... Opportunities for both on-site and virtual engagement events. Unique... ...benefits/programs and social impact outreach initiatives, we believe that opportunity, access,...VirtualSenior- ...as well as in Belfast. For more information, visit DailyPay's Press Center. The Role: DailyPay is seeking a Senior Staff Data Scientist to serve as the technical architect and strategic leader for our Personalization & AI Platform domains. This is one of the most...SeniorTemporary workLocal area
- ...Data Scientist Location: New York On-site | Full-time Compensation: Competitive Our client is a high-performance technology... ...technical feasibility. Project Ownership: Drive data initiatives from initial problem identification through to solution...Full timeWork at officeImmediate start
$132k - $264k
...What you'll do... Role summary: Walmart is seeking a Staff Data Scientist to leverage advanced analytics and machine learning techniques... ...business domains and databases to support analytics initiatives. Perform data quality assessments and ensure data suitability...Full timeTemporary workPart time$212k - $265k
...decision engine for better mental health outcomes. As a Staff Data Scientist, Product Analytics , you will be a senior technical and... ...build conviction in what "good" looks like. Lead multi-team initiatives. Drive cross-functional work where the right answer is not...Work from homeFlexible hours$238k - $302k
...achieving Waymo's ambitious goals. In this role, you will lead key initiatives for measuring the quality and trustworthiness of the behavior... ...experience, or 7+ years of industry experience solving data science problems Solid statistical background. Expertise using...Full timeRemote work$195.5k - $218.5k
...Staff Data Scientist OpenX is focused on unleashing the full economic potential of digital media companies. We do this by making digital... ...Identify and lead high-impact, cross-team data science initiatives, such as improving bidding strategies, building new prediction...Work experience placementLocal area$150k - $220k
...health treatment. We deliver personalized, virtual care rooted in connection—between clients... ...As a Senior Platform Engineer on the Data Platform team, you will be responsible for... ...intersection of every major engineering initiative, working across Product Engineering, Analytics...VirtualSeniorFull timeWork at officeLocal areaRemote work$135.9k - $153k
...on benefit programs to government agency staff. Through human-centered design and modern... ..., empathy, and accessibility. The Senior Data Engineer role will be responsible for developing... ...health resources & support tools. Virtual care – See doctors online with no copay through...VirtualSeniorContract workTemporary workWork at officeLocal areaRemote workHome officeVisa sponsorshipFlexible hours- Google Inc. is seeking a Staff Data Scientist in New York to lead initiatives in data analysis and machine learning. The successful candidate will have a master's degree in a quantitative field and at least 8 years of relevant experience. Responsibilities include leading...
$207k - $300k
Staff Data Scientist, Research, Search Health Mountain View, CA, USA; New York, NY, USA; San Francisco, CA, USA; and additional locations. Advanced... ...sources, product headroom analysis for key AI features. Initiate projects inside and across the organization and drive to...Full timeWork experience placement$124k - $175k
...Senior Data Scientist We're on a mission to unleash the power of content… you in? We've got the brands, we've got the stars, we've got... ...Paramount's most dynamic teams. Opportunities for both on-site and virtual engagement events. Unique opportunities to make meaningful...VirtualSenior$62k - $78k
...The Staff Scientist will prepare grant applications and other scientific... ...experiments, and integrate team data. Assists in the design... ...genetics, genomics, cancer biology, neuroscience or related discipline... ...the status quo; takes initiative and is solution-oriented;...For contractorsVisa sponsorshipRelocation package$150k - $200k
...expertise, and 200+ petabytes of multimodal data linked to patient outcomes, so we can... ...energized by complexity, and want to apply AI, biology, or both to redefine the future of drug... ...Medicine We are hiring specialized scientists to accelerate development of our Oncology...SeniorWork at officeShift work3 days per week- ...unit or location. Position: Principal Data Scientist Location: REMOTE Remote Status: Remote... ...data science and AI roadmap, sequencing initiatives based on business value, technical feasibility... ...in person, via phone, and through virtual collaboration tools. Occasional...VirtualWeekly payFull timeTemporary workWork at officeLocal areaImmediate startRemote work
- ...clinicians-because healthcare deserves better software. As a Senior Data Engineer, you'll be hands-on with the backbone of that system:... ..., and vision plans with nationwide coverage, including 24/7 virtual urgent care. Mental Health Support: Weekly therapy reimbursement...VirtualSeniorWork at officeRemote workFlexible hours
- ...life-saving behavioral health treatment. We deliver personalized, virtual care rooted in connection—between clients and clinicians, care... ...deserve, we’d love to meet you. About The Role The Analytics & Data Engineering team owns all post-transactional data operations that...VirtualSeniorLocal area
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr Staff Data Scientist, Virtual Biology Initiative. Be the first to apply!
- assistant scientist New York, NY
- python data scientist New York, NY
- data scientist no experience New York, NY
- healthcare data scientist New York, NY
- junior data scientist remote New York, NY
- data scientist New York, NY
- ai data scientist New York, NY
- data scientist (hedge fund) New York, NY
- entry level data scientist remote New York, NY
- junior data scientist New York, NY



