Sr Staff Data Scientist, Virtual Biology Initiative
$241k - $331.1kBiohub
Sr Staff Data Scientist, Virtual Biology Initiative New York, NY (Hybrid); Redwood City, CA (Hybrid) Biohub is the first large-scale initiative bringing frontier AI models, massive compute, and frontier experimental capabilities under one roof. We're building a general-purpose system to accelerate scientific discovery, integrating frontier AI models, biological foundation models, and lab capabilities, with the ultimate goal of curing disease. Our technology powers scientists around the world, translating AI capabilities into tools that accelerate research everywhere. The Team Biohub's data organization is responsible for producing biologically informative, petabyte-scale, AI-ready datasets for frontier models of cell biology. Our work spans genomics, imaging, and proteomics, and we're building the data systems that will enable a new generation of biological AI. The team consists of data engineering, data science, and technical program management. We operate with a flat structure that emphasizes strong IC ownership. We're solving hard problems at the intersection of scientific strategy, large-scale data infrastructure, and foundation model training. The Opportunity In April 2026, Biohub launched the Virtual Biology Initiative—a $500 million, five-year commitment to galvanize a global effort to build predictive models of the human cell. This initiative will bring together leading institutions to generate the multi-modal biological data, at unprecedented scale, that will power the next generation of AI models for biology while producing datasets of unprecedented size. Our data science team defines the algorithms and processing approaches that turn raw biological measurements into rich representations models can actually learn from. That includes designing data formats and representations optimized for AI use cases, building cost‑aware processing pipelines that balance expressiveness with efficiency, developing scalable QC and validation frameworks across modalities, creating agent‑augmented curation tools for metadata extraction and ontology mapping, and building the cross‑modal entity resolution and semantic infrastructure that ties it all together. Both the scale and domain are active research areas. How do you tokenize a cell image? How do you represent a perturbation experiment? How do you combine transcriptomics with imaging in a way that preserves biological meaning? These questions don't have established answers. We need scientific leaders who can work at this frontier: people who understand biological measurement deeply, think creatively about data representations, sampling, and tokenization strategies, and can translate that thinking into data representations that enable novel training architectures. You’ll work directly with scientists, computational biologists, data engineers, and AI researchers to define model input and biological evaluations. You will operate with broad scope and high autonomy, influencing roadmap decisions across teams while mentoring senior individual contributors. Success in this role means creating and implementing data systems that are not only large, but adaptive, interpretable, and scientifically grounded—accelerating progress toward robust biological frontier models and ultimately advancing human health. What You’ll Do Set technical vision and strategy for the design of data representations and tokenization strategies across biological data types—including imaging, sequencing, and multimodal data that enable novel model architectures Develop, deploy and validate approaches for combining heterogeneous data modalities into unified training frameworks, designing for robustness to noise, bias, and batch effects Evaluate model performance, identifying which biological signals are captured or lost and iterating to improve Partner deeply with ML engineers and AI researchers to co‑design datasets and optimize model training, evaluation, and generalization Lead cross‑functional initiatives spanning data engineering, infrastructure, science, and product, aligning technical execution with long‑term scientific goals Identify and drive new data acquisition and generation opportunities, from consortium partnerships to internal experimental pipelines Serve as a technical mentor and leader, raising the bar for data science and ML rigor across the organization What You’ll Bring 12+ years of experience (or PhD + 7 years) working with large‑scale biological datasets, including ownership of end‑to‑end data products Deep expertise in at least one of: (a) imaging data—microscopy, cell phenotyping, spatial biology, and the data characteristics of image‑based biological measurement; or (b) genomics data—bulk and single‑cell sequencing, functional genomics, epigenomics, transcriptomics, spatial biology, and/or multi‑omics Understanding of how to transform raw biological data into AI‑ready datasets, including familiarity with scientific best practices, noise characteristics, batch effects, and quality assessment specific to your domain Experience with tokenization strategies for non‑text data (images, sequences, graphs, time series) or with creating data representations and feature engineering for machine learning in scientific or biological contexts Strong expertise in data science and statistical modeling; familiarity with modern ML architectures (transformers, diffusion models, or similar) and how data representation choices affect learning Strong computational skills; demonstrated ability to design robust, extensible data architectures Excellent communication and leadership skills, with the ability to translate between biology, ML, and engineering audiences and align teams to deliver complex projects Creative, first‑principles thinking about how to structure data for learning Compensation The Redwood City, CA & New York City, NY base pay range for a new hire in this role is $241,000.00 - $331,100.00. New hires are typically hired into the lower portion of the range, enabling employee growth in the range over time. Actual placement in range is based on job‑related skills and experience, as evaluated throughout the interview process. Better Together As we grow, we’re excited to strengthen in‑person connections and cultivate a collaborative, team‑oriented environment. This role is a hybrid position requiring you to be onsite for at least 60% of the working month, approximately 3 days a week, with specific in‑office days determined by the team’s manager. The exact schedule will be at the hiring manager’s discretion and communicated during the interview process. Benefits for the Whole You We’re thankful to have an incredible team behind our work. To honor their commitment, we offer a wide range of benefits to support the people who make all we do possible. Provides a generous employer match on employee 401(k) contributions to support planning for the future. Paid time off to volunteer at an organization of your choice. Relocation support for employees who need assistance moving Voluntary Self Identification For reporting purposes, we ask candidates to respond to the below self‑identification survey. Completion of the form is entirely voluntary. Whatever your decision, it will not be considered in the hiring process or thereafter. Any information that you do provide will be recorded and maintained in a confidential file. As set forth in the organization’s Equal Employment Opportunity policy, we do not discriminate on the basis of any protected group status under any applicable law. If you believe you belong to any of the categories of protected veterans listed below, please indicate by making the appropriate selection. Classification of protected categories is as follows: A "disabled veteran" is one of the following: a veteran of the U.S. military, ground, naval or air service who is entitled to compensation (or who but for the receipt of military retired pay would be entitled to compensation) under laws administered by the Secretary of Veterans Affairs; or a person who was discharged or released from active duty because of a service‑connected disability. A "recently separated veteran" means any veteran during the three‑year period beginning on the date of such veteran’s discharge or release from active duty in the U.S. military, ground, naval, or air service. An "active duty wartime or campaign badge veteran" means a veteran who served on active duty in the U.S. military, ground, naval or air service during a war, or in a campaign or expedition for which a campaign badge has been authorized under the laws administered by the Department of Defense. An "Armed forces service medal veteran" means a veteran who, while serving on active duty in the U.S. military, ground, naval or air service, participated in a United States military operation for which an Armed Forces service medal was awarded pursuant to Executive Order 12985. Reasonable Accommodation Notice The organization provides (and state and federal law requires) reasonable accommodations to be provided to qualified applicants with disabilities. Your recruiter will work with you during the interview process should you require any such accommodations. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment. Background Check Notice As part of our hiring process, all offers of employment are contingent upon the successful completion of a background check. By submitting your application, you acknowledge that you will be required to undergo a background check prior to employment. Artificial Intelligence Usage Notice We use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications and analyzing resumes. These tools assist our recruitment team but do not replace human judgment. Hiring decisions are ultimately made by humans. If you have questions about this once your are in our hiring process, please contact your Recruiter. #J-18808-Ljbffr Biohub
$214k - $294.8k
...to unlock new dimensions of biological understanding. You will leverage... ..., high-quality biological data, powerful computing infrastructure... ...AI. We’re looking for a data scientist with deep expertise in... ...generalization. Lead cross-functional initiatives spanning data engineering,...SuggestedWork at officeWorldwideRelocation package3 days per week- Biohub in Redwood City, CA is looking for a Sr Staff Data Scientist to lead the design of data representations and tokenization strategies for biological data. This role offers the chance to work directly with scientists and engineers, fostering collaboration and innovation...VirtualSenior
$192k - $260k
...we are obsessed with enabling data teams to solve the world's... ...fleet consists of millions of virtual machines, generating terabytes... ...of the above. Role As a Data Scientist on the Data Team, you will help... ...and key Engineering initiatives (reliability and efficiency)....VirtualWork at officeLocal areaWorldwide- ...Data Analyst Silicon Valley R&D Center is looking for an experienced Data Analyst to join our Ads Technology team. The ideal candidate... ...and with the multi-teams including PM/Engineering/Data Scientist Job Specifications (Education, Knowledge, Skills, and Abilities...SeniorWork experience placement
$241k - $338k
...Biohub is the first large-scale initiative bringing frontier AI models,... ...frontier AI models, biological foundation models, and lab capabilities... .... Our technology powers scientists around the world, translating... ...Opportunity The role is part of the Data Engineering team, which...SeniorWork at officeWorldwideRelocation package3 days per week- ...The Chan Zuckerberg Initiative was founded by Priscilla... ...make it possible to help scientists cure, prevent, or... ...strides in understanding biological systems, advancing... ...Building an AI-based virtual cell model to predict... ...companies that fill critical data, research or...VirtualLocal areaRelocation package
$174k
...surrounded by opportunities to drive new initiatives and innovations. At our core, we are... ...is massive! What You Will Do Own the Data Science function within a core pillar of... ...closely related field 8 years of Data Scientist experience in customer-facing technology...SeniorFull timeTemporary workFlexible hours$160k - $225k
...long-term vision is to use the data we collect to diagnose health conditions... ...Details Position Title: Data Scientist / Senior Data Scientist... ...the forefront of our AI-driven initiatives, working with a diverse team of experts in biology, bioinformatics, and machine learning...Senior$163.2k - $220k
...: Natera is seeking a Staff Machine Learning Scientist – Agentic AI to join our... ...Leveraging a proprietary data moat of over 250,000 oncology... ...capable of multi-step biological reasoning, converting complex... ...foundation models, and simulating virtual patient trajectories....VirtualWork at officeImmediate startRemote workWorldwide$238k - $302k
...Staff Data Scientist Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its... ...achieving Waymo's ambitious goals. In this role, you will lead key initiatives for measuring the quality and trustworthiness of the...Full timeRemote work$92.5k - $114.25k
...artificial intelligence and biology, working to accelerate... .... Unlike academia, our scientists have long-term funding... ...alone. Our two Institute Initiatives reflect this model in action: Virtual Cell Initiative :... ...generating high quality data and maintaining good documentation...VirtualSeniorFlexible hoursWeekend work$179k - $246k
...Manager to lead highly cross-functional initiatives that substantiate the safety case for Zoox... ...Your work will span both real-world and virtual environments, ensuring that the autonomy... ..., software pipelines, and/or data-driven system modeling and analysis....VirtualSeniorTemporary workRelocation package$123.2k - $218k
...Data Scientist At Databricks, we are obsessed with enabling data teams to solve the world's toughest problems, from security threat detection... ...scale software platforms. The fleet consists of millions of virtual machines, generating terabytes of logs and processing...VirtualSeniorWork at officeLocal areaWorldwide$146.29k - $304.75k
...industry-leading solutions-including Unified Endpoint Management, Virtual Apps and Desktops, Digital Employee Experience, and Security... ...across the Omnissa product ecosystem. As a Senior Data Scientist, you will lead and innovate within the data science team to drive...VirtualSeniorWork experience placementLocal areaVisa sponsorshipFlexible hours- ...less work, and complete confidence. Responsibilities Drive the initiation and design of complex recommendation and agent model solutions.... ...Have extensive prior experience building end-to-end, reusable data and model pipelines — from data acquisition through to complex...Senior
$179k - $246k
...Manager to lead highly cross-functional initiatives that substantiate the safety case for Zoox... ...Your work will span both real-world and virtual environments, ensuring that the autonomy... ..., software pipelines, and/or data-driven system modeling and analysis....VirtualSeniorTemporary workRelocation package$161.4k - $258.5k
...and internal business units. The Internal Data & Analytics group supports these... ...groups at Visa. This Senior Data Scientist will partner with our North America FP&A... ...supplier flows, pricing, yields and growth initiatives within VCS. Visa Direct: ~ Support...SeniorWork experience placementWork at officeLocal areaImmediate startFree visa- ...design, and augment high-quality datasets to fuel cutting-edge AI initiatives. Optimize LLM operations to maximize efficiency and... ...maintain a deeply collaborative and fun-loving atmosphere—from virtual lunches and draw battles to cooking classes that bring our distributed...VirtualSeniorWork at officeRemote work
$84.25k - $114.25k
...artificial intelligence and biology, working to accelerate... .... Unlike academia, our scientists have long-term funding... ...alone. Our two Institute Initiatives reflect this model in action: Virtual Cell Initiative:... ...can run and whether its data is interpretable. You will...VirtualSeniorTemporary workFor contractorsFixed term contractCasual workWork at office$214k - $294.8k
A leading biomedical research organization in Redwood City is seeking a Data Scientist with expertise in genomics to enhance biological research through AI. You will set the technical vision for data representations and collaborate with teams to design scalable solutions...$300k
...Staff Data Scientist Grindr is an AI-native platform powering how millions of gay people connect globally. With 15M+ monthly users, 130B+ annual messages, and a team of fewer than 200, we move fast, stay lean, and tackle technical problems at a scale few companies...Work at officeImmediate startWorldwideFlexible hours$169.1k - $270.8k
...Senior Consultant Data Engineer As a Senior Consultant Data Engineer, you'll join our Value Added Services – Digital Marketing... ...scalable solutions. You'll drive internal proof of concept initiatives. When needed, quickly design and implement a prototype of a system...SeniorWork experience placementWork at officeLocal areaImmediate start- ...Position: Sr Network Engineer /Architect Location: Redwood city, California (Day 1 onsite) Position Summary: Monitor... ...on all supported devices and for application load balancer virtual IPs (VIPs). Implement and validate firewall security policies...VirtualSenior
- ...Job Application Privacy Notice When you apply to a job on this site, the personal data contained in your application will be collected by SentinelOne, Inc., located at 444 Castro Street, Suite 400, Mountain View, CA 94041, ("Controller") and/or its affiliates ("Controller...
- Hybrid, Palo Alto (Hybrid - 2 days/week in office) Reports to: Director Data Science + Analytics About the Role We are looking for a highly strategic Senior or Staff Data Scientist to design, build, and own the end-to-end data framework that defines our business health...Work at officeImmediate startShift work2 days per week
- ...Overview Come join the Intuit Customer Success (ICS) Data Science team as a Staff Data Scientist. This role will be pivotal in shaping how we measure, grow, and optimize the externalization of Intuit's expert capabilities across ICS and various business segments. You...Work experience placementLocal area
- ...and driving the workshops with 30+ participants - in-person and virtual Coordinate research and usability studies, collaborating... ...delivery of a large-scale corporate website implementation initiative Deep understanding of strategizing; Omnichannel strategy...VirtualSenior
- ...artificial intelligence and biology, working to accelerate... .... Unlike academia, our scientists have long-term funding... ...alone. Our two Institute Initiatives reflect this model in action: Virtual Cell Initiative :... ...analysis, troubleshooting, and data interpretation with a...Virtual
- Chan Zuckerberg Initiative is seeking a Senior Manager for Data Science & Engineering in Redwood City, CA. You will lead and manage a team of data scientists and engineers, driving strategic decision-making and building data-driven products for the education sector. The...SeniorFlexible hours
- Overview Client is a global Bio-Pharmaceutical Company. Responsibilities The candidate will collaborate with scientists and other informaticians to help discover lead antibodies from NGS workflows. The candidate is expected to develop new tools and analytic methods...
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Sr Staff Data Scientist, Virtual Biology Initiative. Be the first to apply!
- energy data scientist Redwood City, CA
- data scientist (hedge fund) Redwood City, CA
- python data scientist (contract) Redwood City, CA
- healthcare data scientist Redwood City, CA
- python data scientist Redwood City, CA
- senior data scientist Redwood City, CA
- entry level data scientist remote Redwood City, CA
- data scientist Redwood City, CA
- senior game producer Redwood City, CA
- senior manager quality engineering Redwood City, CA

