Abstract
Purpose: Lack of reproducibility in scientific research, particularly in healthcare, has become an increasing problem in recent years. This is especially important in the emerging field of radiomics/radiogenomics where large data sets and huge numbers of feature variables lead to an increased risk of spurious correlations which are not actually driven by biology. To address this problem we have developed an open-access database called The Cancer Imaging Archive (TCIA) which allows researchers to share the original image data necessary to accurately compare and validate research methods. Methods: Scalable processes for data de-identification were developed which leverage DICOM (PS 3.15 Annex E) standards to ensure compliance with HIPAA regulations. Submission software is customized for each new data set and a team of trained experts assist submitters with the upload process. Data are organized into collections related to a particular cancer type, modality, or research question. Collections may be open-access or restricted to specific users. Users may browse and download the data via their web browser or using programmatic interfaces. Digital object identifiers can be created to allow easy re-use of data or citations in related publications. A helpdesk is available to answer questions from TCIA users. Results: TCIA contains over 52 data collections (46 publicly accessible) for a total of 26.5 million images and associated data. Sixteen of the TCIA data collections support radiogenomics research by linking imaging from subjects with genomic, clinical and pathology data that are available on The Cancer Genome Atlas data portal or Gene Expression Omnibus. Approximately 3,000 users visit the site monthly. More than 90 manuscripts and 12 DOIs have been published relating to TCIA data. Conclusion: TCIA provides a wealth of high-value imaging data to the imaging research community, as well as comprehensive services required to support its user community and facilitate continued growth. Funded by NCI Contract No. HHSN261200800001E
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.