Senior Bioinformatician - Genomics Data Infrastructure
$175k - $225kViolet Research
About Violet Research Institute Violet Research Institute (VRI) is building the future of personalized medicine for patients with genetic diseases. We combine the urgency and execution mindset of a startup with the mission‑driven openness of a nonprofit, bringing together leading researchers, engineers and organizations across omics, therapeutic design, manufacturing, clinical care and AI to move from insight to action as quickly as possible. Location San Francisco Bay Area (or Remote, US‑based preferred) Type Full‑Time Compensation $175k – $225k Role Overview As the founding Bioinformatician you will architect and own VRI’s entire genomics data foundation, from sequencing data ingestion to quality control, storage, and analysis readiness. You will design scalable, reproducible pipelines and data infrastructure that enable rapid, reliable clinical interpretation. Your work will directly impact real‑patient outcomes. What You’ll Own Genomics Data Management & Stewardship Own the full lifecycle of VRI’s genomics data, from raw sequencer output (FASTQ, BAM/CRAM, VCF) through QC, storage, versioning, and retrieval. Define and enforce data standards, naming conventions, metadata schemas and ontologies for all data types. Build and maintain a centralized, queryable genomics data lake that unifies heterogeneous inputs from internal labs and CRO partners. Establish sample tracking, data lineage documentation, and versioning protocols so every result is traceable back to its source. Manage cloud storage strategy (AWS S3 or GCP) across hot, warm, and cold tiers, balancing cost, accessibility and HIPAA‑compliant security. Create and maintain an internal data catalog documenting all datasets, pipeline versions, and transformation logic. Design and build production‑grade, reusable pipelines for ingesting and processing PacBio long‑read WGS data, including phased genome assembly, structural variant calling and SNP/indel detection. Build ETL workflows that clean, normalize and integrate diverse data modalities into unified, analysis‑ready formats. Automate QC steps to surface data anomalies early and monitor data quality continuously across sequencing batches and CRO handoffs. Establish code quality standards, testing protocols, and deployment practices (version control, containerization) that will scale as the team grows. Maintain and develop internal database systems, including our proprietary VRI OS platform used for experiment tracking. Integrate physics‑based thermodynamic models and predictive algorithms to forecast therapeutic performance and guide design decisions. Develop and apply design criteria and ranking systems to evaluate therapeutic candidates computationally before advancing to wet‑lab testing. Build and maintain algorithms that bridge computational predictions with experimental validation, optimizing the design‑to‑testing pipeline. Multiomics Integration Integrate genomics data (DNA, RNA‑seq, long‑read RNA, splicing) with proteomics, metabolomics and mass‑spectrometry data into coherent, patient‑centric multiomics datasets. Query and harmonise large‑scale population cohorts (UK Biobank, Mount Sinai Million and similar) to contextualise patient findings. Partner with computational biologists and clinical scientists to surface analysis‑ready datasets, enabling and supporting their interpretation work. Insight Delivery & Reporting Build automated reporting pipelines that push structured summaries of data quality, pipeline status and batch results to scientific stakeholders. Develop QC dashboards to surface data quality metrics, pipeline status and anomaly alerts in real time. Support IND filings through preparation of relevant datasets and written reports. Monitor the bioinformatics landscape, identifying emerging algorithms and platforms that can sharpen VRI’s data infrastructure. Lay the foundation for future bioinformatics hires by embedding well‑documented, reproducible data practices from day one. Qualifications Must Have 4+ years of hands‑on bioinformatics experience in a research or biotech environment, focused on genomics data management and pipeline engineering. Experience owning genomics data end‑to‑end – building systems and standards that make data trustworthy and reusable. Strong fluency in genomics file formats and toolchains (FASTQ, BAM/CRAM, VCF, BED; GATK, DeepVariant, PBSV; hifiasm or equivalent). Demonstrated experience with PacBio long‑read WGS data and associated tooling. Proficiency in Python and production‑grade pipeline development with workflow managers (Nextflow, Snakemake, or WDL). Hands‑on experience with cloud data infrastructure (AWS S3 or GCP), data lake design, pipeline orchestration and HIPAA‑compliant storage. Experience querying and integrating biobank‑scale datasets (UK Biobank or similar). Strong organisational skills – you document and build systems others can use, taking ownership of data quality without being asked. Preferred Experience with RNA‑seq and long‑read RNA analysis, including pre‑mRNA processing and splicing characterisation. Familiarity with LIMS systems (Benchling, LabVantage or similar) and data governance / FAIR data frameworks. Experience with containerisation tools (Docker, Singularity) and CI/CD practices. Exposure to siRNA, ASO or other therapeutic modality‑specific bioinformatics. Experience in a seed or early‑stage biotech; comfort building infrastructure from scratch. Ability to execute independently from loosely specified tasks; you are self‑directing. Clear communication of challenges and progress when seeking help. Thrives in early‑stage, ambiguous, high‑pace environments. Mission‑driven with genuine, active care for patient impact. Benefits We offer competitive compensation, a full suite of benefits, and a culture that embraces AI across all functions to accelerate discovery and impact. #J-18808-Ljbffr Violet Research
$214.4k - $245k
Vivodyne creates human data before clinical trials. We accelerate... .... We're looking for a Bioinformatician who can develop new analytical methods and build robust infrastructure to support them at scale. You... ...Computational Biology, Bioinformatics, Genomics, Systems Biology,...DataSeniorFull timeLocal area- ...Scientist to join our Bioinformatics and Data Science Team focused on developing... ..., and optimizing our next generation of genomic assays. This position requires a unique... ...pipeline deployment by working closely with infrastructure and engineering teams Ensure compliance...DataSeniorFlexible hours
$139.44k - $174.31k
...Senior Scientific Data Engineer Berkeley Lab's Joint Genome Institute has an opening for a Senior Scientific Data Engineer to join the Institutional Informatics... ...systems in support of the nation's energy and infrastructure security. Through world-class capabilities in...DataSeniorFull timeWork at officeRemote workRelocation package- Granica, based in San Francisco, is seeking an expert in distributed systems to enhance their data infrastructure. This role involves architecting a global metadata substrate, developing intelligent data layouts, and implementing algorithms for efficient data representation...DataSeniorFlexible hours
- Vivodyne, Inc. in San Francisco is seeking a skilled Bioinformatician to drive analytical methods and build scalable pipelines for high-throughput -omics data. This full-time, onsite role requires expertise in Python and bioinformatics, aiming to foster collaboration across...DataSeniorFull time
- Decagon AI, Inc. is looking for a Senior Data Infrastructure Engineer to design and operate the data systems that power its AI products. The successful candidate will own critical data pipelines and storage layers, improving reliability and creating clear data pathways...DataSenior
- Freenome is seeking a Senior Research Associate in Brisbane, California, to grow their Genomics Assay Development team. The role involves developing assay technologies... ...Responsibilities include experimental design, data analysis, and cross-functional collaboration. Join...DataSenior
- Join a forward-thinking company as a Senior Accounting Leader, where you'll leverage your... ...to drive financial accuracy in AI infrastructure. This role offers a unique opportunity to... ...collaborating cross-functionally to enhance data processes. If you have a passion for...DataSenior
- A frontier research laboratory in San Francisco is seeking a Senior / Principal ML Engineer to enhance their ML infrastructure. The role involves designing experimental frameworks for data scientists, collaborating with various teams, and ensuring rigorous practices in...DataSenior
- A leading AI research organization in San Francisco is seeking a senior electrical infrastructure engineer to lead the development of power architectures for high-density, liquid-cooled AI data centers. The ideal candidate will have over 10 years of experience with critical...DataSenior
- A vertically integrated AI infrastructure company is seeking a Staff Storage Systems Administrator to lead the architecture and operation of the data layer. This role manages the end-to-end lifecycle of global storage environments, ensuring performance and reliability...DataSenior
- A pioneering AI company is seeking a Senior / Staff Infrastructure Engineer to design and operate secure, reliable cloud infrastructure. Responsibilities include ensuring system performance in data-heavy environments and guiding the security posture. Ideal candidates will...DataSeniorFlexible hours
- ...to lead Financial Planning & Analysis (FP&A) for its digital infrastructure unit. The ideal candidate will have 8+ years of experience in... ...overseeing complex budgeting and forecasting models for large-scale data center projects. In this high-visibility role, you'll directly...DataSenior
- A leading AI research firm in San Francisco seeks a Staff Infrastructure Engineer to identify and resolve infrastructure bottlenecks and design... ...and strong skills in performance optimization and large-scale data pipelines. Join a collaborative team that values effective...DataSenior
- Grow Therapy in San Francisco is seeking a Senior AI Enablement Engineer to define how AI... ...will design and build foundational AI infrastructure that enhances efficiency. Responsibilities... ...ensuring compliance with security and data standards. This hybrid role requires working...DataSeniorFlexible hours3 days per week
- AI Talent Now is seeking a Senior Software Engineer, Infrastructure to join their team in San Francisco. Ideal candidates will have 6-10 years of experience in building scalable, reliable infrastructure and a history of mentoring junior engineers. The role requires proficiency...DataSenior
- A rapidly growing data company in San Francisco is seeking a Senior Engineer specializing in data infrastructure to drive the technical direction of their data platform. In this role, you'll design robust systems for data ingestion and transformation while partnering closely...DataSenior
- ...experienced engineer to manage and optimize large-scale training infrastructure. You will build core systems that support researchers, focusing on distributed training, performance optimization, and data pipelines. Ideal candidates should have a solid background in systems...DataSenior
$180k - $250k
...Senior Infrastructure Engineer Title of Role: Senior Infrastructure Engineer Location: San Francisco, hybrid Company Stage of Funding: Seed - AI, Devtools, Enterprise, Data Office Type: Hybrid Salary: $180K-$250K Company Description We're representing...DataSeniorWork at office- Epoch Biodesign is seeking a Senior Manager, Global Supply Management in San Francisco, CA. This full-time role focuses on optimizing... ...experience, strong analytical skills, and a background in data center infrastructure. Competitive benefits include stock options, health...DataSeniorFull time
- Gerra Group in San Francisco is seeking a Senior Software Engineer to build core infrastructure for petabyte-scale data collection for leading robotics companies. You will design distributed systems for real-time sensor data and own critical data pipeline systems. The ideal...DataSenior
$79.61k - $168.59k
...career in Advisory. KPMG is currently seeking a Senior Associate, Infrastructure Project Advisory (Construction/Engineering) in Infrastructure... ...diverse engagement teams to perform field work, including data collection, analysis, and work paper documentation;...DataSeniorFull timeContract workH1bLocal area$150k - $220k
...Senior Cloud DevSecOps Infrastructure Engineer Title of Role: Senior Cloud DevSecOps Infrastructure Engineer Location: San Francisco, onsite... ...of security protocols to safeguard sensitive healthcare data. Ideal Candidate Background ~6+ years of experience...DataSeniorWork at office$191k - $225k
...looking for a highly skilled individual to join their team in San Francisco. This role focuses on building and managing big data infrastructure, emphasizing strong programming skills, particularly in Java and Scala. You will have the opportunity to contribute to open-source...DataSenior$160k - $300k
...and energy to transform mountains of unstructured technical data into real-time, actionable insights. Backed by world-class... ...-defining company together. About the Role As a Senior / Staff Infrastructure Engineer at Apiphany, you’ll design, build, and operate the...DataSeniorWork at officeVisa sponsorshipFlexible hours- ...science. About the Role The Core Infrastructure team makes the rest of Nooks fast. We're... ..., Agent Studio, Coaching - and our data layer, observability, and developer tooling... ...hours a day in the product. We're hiring senior engineers to own the systems everything...DataSeniorWork at office3 days per week
- ...companies or start-ups, ideally with a focus on infrastructure or platform engineering , You thrive in... ...engineering talent across systems, data, platform, and ML infrastructure domains... ..., and comfortable partnering with senior engineering leaders , You operate with integrity...DataSenior
- ...Judgment Labs builds infrastructure for Agent Behavior Monitoring (ABM). While traditional observability focuses on logging exceptions... ...Hartz, and others. The Role: We are looking for a Senior Data Infrastructure Engineer to build and scale the real-time data...DataSenior
- ...humans once had to do. Role Overview We're looking for a Senior Infrastructure Engineer to own and evolve the foundational systems that power... ...instrumented systems, built dashboards, and used telemetry data to diagnose production issues. Familiarity with common...DataSenior
- ...redefine computing. About the Role We're seeking a Senior Infrastructure Engineer to help build and scale Hyperbolic's GPU Cloud Marketplace... ...stack implementation Experience with storage and data infrastructure for AI/ML workloads, including object storage...DataSeniorRemote work
Do you want to receive more vacancies?
Subscribe and receive similar vacancies to Senior Bioinformatician - Genomics Data Infrastructure. Be the first to apply!
- bioinformatician San Francisco, CA
- bioinformatics scientist San Francisco, CA
- senior office manager San Francisco, CA
- senior automation controls engineer San Francisco, CA
- senior accounts payable San Francisco, CA
- senior brand designer San Francisco, CA
- senior financial advisor San Francisco, CA
- senior underwriter San Francisco, CA
- senior cost analyst San Francisco, CA
- senior business analyst contract San Francisco, CA



