NHLBI’s BioData Catalyst is a shared virtual space where scientists can access and work with the digital objects of biomedical research, such as data and software. It is a cloud-based platform for tools, applications, and workflows, and it provides secure workspaces to share, store, cross-link, and analyze large sets of data generated from biomedical and behavioral research, while also ensuring patient privacy.
BioData Catalyst will meet the needs of the NHLBI and our research community by enhancing access to NHLBI data, like NHLBI's Trans-Omics for Precision Medicine (TOPMed) Program, which is among the first available datasets in BioData Catalyst. The platform will also provide access to tools that can be used to analyze various data types, including phenotypic, genomic, other omics, and imaging data.
BioData Catalyst directly addresses the NHLBI Strategic Vision objective of leveraging emerging opportunities in data science to open new frontiers in heart, lung, blood, and sleep (HLBS) research.
BioData Catalyst will offer specialized search functions, controlled access to data, and analytic tools via widely available programming interfaces. With these capabilities, NHLBI researchers and other scientists will be able to use NHLBI datasets for scientific discovery.
BioData Catalyst will use HLBS research to test and expand the platform. In the long term, BioData Catalyst will integrate massive datasets from NHLBI-supported clinical, population-based, and genomic studies to support NHLBI efforts toward precision medicine.
BioData Catalyst leverages the NIH Science and Technology Research Infrastructure for Discovery, Experimentation, and Sustainability (STRIDES) Initiative to enhance efficiency of data storage and computation by partnering with commercial providers. It is also collaborating with the Office of Data Science Strategy to support its efforts to connect data systems across NIH.
BioData Catalyst is a joint effort of the NHLBI and data science experts in academic institutions, research organizations, and industry. Harvard Medical School, Seven Bridges Genomics, the Renaissance Computing Institute, University of California Santa Cruz, the Broad Institute, and University of Chicago are working closely with the NHLBI to develop the platform.
BioData Catalyst is governed by a steering committee that includes the development teams, NLHBI staff, and data producers and consumers. An external panel of experts provides guidance to the NHLBI during the development and implementation of BioData Catalyst.
The BioData Catalyst development team is building this platform by engaging in the following activities.
WHAT: In a bold step for precision medicine, researchers at the National, Heart, Lung, and Blood Institute (NHLBI) today announced they are releasing for study nearly 9,000 whole genomes, courtesy of participants in the Institute’s Trans-Omics for Precision Medicine Program (TOPMed). The genomes—an organism’s complete set of DNA—are the first ever...