Abstract

BackgroundTo drive translational medicine, modern day biobanks need to integrate with other sources of data (clinical, genomics) to support novel data-intensive research. Currently, vast amounts of research and clinical data remain in silos, held and managed by individual researchers, operating under different standards and governance structures; a framework that impedes sharing and effective use of data. In this article, we describe the journey of British Columbia’s Gynecological Cancer Research Program (OVCARE) in moving a traditional tumour biobank, outcomes unit, and a collection of data silos, into an integrated data commons to support data standardization and resource sharing under collaborative governance, as a means of providing the gynecologic cancer research community in British Columbia access to tissue samples and associated clinical and molecular data from thousands of patients.ResultsThrough several engagements with stakeholders from various research institutions within our research community, we identified priorities and assessed infrastructure needs required to optimize and support data collections, storage and sharing, under three main research domains: (1) biospecimen collections, (2) molecular and genomics data, and (3) clinical data. We further built a governance model and a resource portal to implement protocols and standard operating procedures for seamless collections, management and governance of interoperable data, making genomic, and clinical data available to the broader research community.ConclusionsProper infrastructures for data collection, sharing and governance is a translational research imperative. We have consolidated our data holdings into a data commons, along with standardized operating procedures to meet research and ethics requirements of the gynecologic cancer community in British Columbia. The developed infrastructure brings together, diverse data, computing frameworks, as well as tools and applications for managing, analyzing, and sharing data. Our data commons bridges data access gaps and barriers to precision medicine and approaches for diagnostics, treatment and prevention of gynecological cancers, by providing access to large datasets required for data-intensive science.

Highlights

  • To drive translational medicine, modern day biobanks need to integrate with other sources of data to support novel data-intensive research

  • We have described the journey followed towards implementing a data commons to benefit the gynecologic cancer community in British Columbia

  • The appropriate technological solution suitable for each type of data needs to be in place; there is no single solution that can be adapted to all data collections but multiple solutions should be integrated

Read more

Summary

Introduction

Modern day biobanks need to integrate with other sources of data (clinical, genomics) to support novel data-intensive research. The last decade has seen advances in biotechnology such as generation sequencing (NGS), and the emergence of “omics” techniques for precision medicine (e.g., genomics, transcriptomics, proteomics, metabolomics, and epigenomics) These innovations coincided with breakthroughs in computing, artificial intelligence (AI) and analytics, enabling discrimination between disease with greater precision [12]. Present day research environments and needs have led to the development and implementation of data commons [16, 17], bringing together, within a research community, diverse data, computing infrastructure, as well as tools and applications for managing, analyzing, and sharing interoperable data This has created an opportunity to maximize collaborations and to extend the value generated from primary data collection [18]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call