I wrote an article with Josh Denny (Vanderbilt), David Glazer (Verily Life Sciences), Benedict Paten (University of California at Santa Cruz), and Anthony Philippakis (Broad Institute) about we are calling a data biosphere for biomedical research.
In this article, we introduce four governing principles for data biospheres: A data biosphere should be:
- modular, composed of functional components with well-specified interfaces;
- community-driven, created by many groups to foster a diversity of ideas;
- open, developed under open-source licenses that enable extensibility and reuse, with users able to add custom, proprietary modules as needed;
- and, standards-based, consistent with standards developed by coalitions such as the Global Alliance for Genomics and Health (GA4GH).