Preclinical MRI studies have been utilized for the discovery of biomarkers that predict post-traumatic epilepsy (PTE). However, these single site studies often lack statistical power due to limited and homogeneous datasets. Therefore, multisite studies, such as the Epilepsy Bioinformatics Study for Antiepileptogenic Therapy (EpiBioS4Rx), are developed to create large, heterogeneous datasets that can lead to more statistically significant results. EpiBioS4Rx collects preclinical data internationally across sites, including the United States, Finland, and Australia. However, in doing so, there are robust normalization and harmonization processes that are required to obtain statistically significant and generalizable results. This work describes the tools and procedures used to harmonize multisite, multimodal preclinical imaging data acquired by EpiBioS4Rx. There were four main harmonization processes that were utilized, including file format harmonization, naming convention harmonization, image coordinate system harmonization, and diffusion tensor imaging (DTI) metrics harmonization. By using Python tools and bash scripts, the file formats, file names, and image coordinate systems are harmonized across all the sites. To harmonize DTI metrics, values are estimated for each voxel in an image to generate a histogram representing the whole image. Then, the Quantitative Imaging Toolkit (QIT) modules are utilized to scale the mode to a value of one and depict the subsequent harmonized histogram. The standardization of file formats, naming conventions, coordinate systems, and DTI metrics are qualitatively assessed. The histograms of the DTI metrics were generated for all the individual rodents per site. For inter-site analysis, an average of the individual scans was calculated to create a histogram that represents each site. In order to ensure the analysis can be run at the level of individual animals, the sham and TBI cohort were analyzed separately, which depicted the same harmonization factor. The results demonstrate that these processes qualitatively standardize the file formats, naming conventions, coordinate systems, and DTI metrics of the data. This assists in the ability to share data across the study, as well as disseminate tools that can help other researchers to strengthen the statistical power of their studies and analyze data more cohesively.
Read full abstract