Non-imaging Data Research Articles

BackgroundMultimodal data, especially imaging and non-imaging data, is being routinely acquired in the context of disease diagnostics; however, computational challenges have limited the ability to quantitatively integrate imaging and non-imaging data channels with different dimensionalities and scales. To the best of our knowledge relatively few attempts have been made to quantitatively fuse such data to construct classifiers and none have attempted to quantitatively combine histology (imaging) and proteomic (non-imaging) measurements for making diagnostic and prognostic predictions. The objective of this work is to create a common subspace to simultaneously accommodate both the imaging and non-imaging data (and hence data corresponding to different scales and dimensionalities), called a metaspace. This metaspace can be used to build a meta-classifier that produces better classification results than a classifier that is based on a single modality alone. Canonical Correlation Analysis (CCA) and Regularized CCA (RCCA) are statistical techniques that extract correlations between two modes of data to construct a homogeneous, uniform representation of heterogeneous data channels. In this paper, we present a novel modification to CCA and RCCA, Supervised Regularized Canonical Correlation Analysis (SRCCA), that (1) enables the quantitative integration of data from multiple modalities using a feature selection scheme, (2) is regularized, and (3) is computationally cheap. We leverage this SRCCA framework towards the fusion of proteomic and histologic image signatures for identifying prostate cancer patients at the risk of 5 year biochemical recurrence following radical prostatectomy.ResultsA cohort of 19 grade, stage matched prostate cancer patients, all of whom had radical prostatectomy, including 10 of whom had biochemical recurrence within 5 years of surgery and 9 of whom did not, were considered in this study. The aim was to construct a lower fused dimensional metaspace comprising both the histological and proteomic measurements obtained from the site of the dominant nodule on the surgical specimen. In conjunction with SRCCA, a random forest classifier was able to identify prostate cancer patients, who developed biochemical recurrence within 5 years, with a maximum classification accuracy of 93%.ConclusionsThe classifier performance in the SRCCA space was found to be statistically significantly higher compared to the fused data representations obtained, not only from CCA and RCCA, but also two other statistical techniques called Principal Component Analysis and Partial Least Squares Regression. These results suggest that SRCCA is a computationally efficient and a highly accurate scheme for representing multimodal (histologic and proteomic) data in a metaspace and that it could be used to construct fused biomarkers for predicting disease recurrence and prognosis.

Read full abstract

Finding the Meaning in Images:Annotation and Image Markup Daniel L. Rubin (bio) Keywords ontologies, semantic annotation, imaging, knowledge representation Biomedical images and ontologies are closely related conceptually, yet currently they are studied in isolation. Biomedical ontologies provide a representation of the canonical entities considered in biomedical research and clinical observations, and the relations among them. Images reveal instances of those entities and, taken in aggregate, inform the construction of ontologies describing the pertinent domain content revealed in the images. The article by Fielding and Marwede (2011) notes the differences between the ontology of the body and the ontology of the image, developing toward an application of ontology of the psychiatric domain. Although such ontology development is important for knowledge representation, it is also important to relate and integrate such ontologies with the actual images to which they relate. In this commentary, we describe ongoing work to accomplish this linkage. Connecting biomedical ontologies to images is an important activity. Biomedical images provide rich information, but the contents of images, such as the modality used to acquire them, the anatomy they contain, and visual observations made about images, are not explicit or computable. Image data are accumulating in a variety of online databases at an explosive pace, similar to nonimage data. But whereas nonimage data, such as genetic data, are easily processed by machines, image data are generally not exploited directly—images typically are stored in archives, and only particular data needed for the study in which the images were originally acquired are generally available for subsequent analysis. Consequently, informatics methods are in development to enable the community to leverage the vast amounts of images accumulating as products of biomedical research. The Challenges of Using Images in e-Science There is growing interest in applying semantic web technologies to biomedicine, because these methods can make biomedical data explicit and computable. An "e-Science" paradigm is emerging, and the biomedical community is looking for tools to help them access, query, and analyze the myriad of data available online. Specifically, they are beginning to embrace technologies for semantic scientific knowledge integration, such as ontologies (Bodenreider and Stevens 2006), standard syntaxes and semantics to make biomedical [End Page 311] knowledge explicit, and the Semantic Web (Ruttenberg et al. 2007). These technologies are enabling the community to access large amounts of data, and to interoperate among diverse data archives. Such technologies are showing promise in tackling the information challenges in biomedicine, and a variety of applications are quickly appearing (Ruttenberg et al. 2007). Although researchers can now access a broad diversity of biomedical data, a critical type of data—images—remains difficult to leverage. Those wanting to access and use imaging in their work face similar difficulties as the rest of the e-Science community, namely to manage, find, and use the voluminous amounts of imaging data accruing at an explosive pace. However, imaging poses unique challenges hindering direct translation of the informatics methods that are currently being applied to nonimaging biomedical data. Image Content Is Not Explicit and Machine Accessible Images contain rich information about anatomy and abnormal structures contained in the images; however, this is implicit knowledge that is deduced by the person viewing the image. For example, a researcher viewing an image may want to indicate where in the image particular areas of interest lie, and whether they are abnormal (Figure 1). This information, the semantic image content, is often considered "image metadata," including observations about images, interpretations, and conclusions, and it is generally not recorded in a structured manner nor directly linked to the image. Thus, images cannot be easily searched for their semantic content (e.g., find all images containing particular anatomy or representing particular abnormalities). No Controlled Image Terminology or Standard Syntax for Image Information There are no standard terminologies specifically for describing medical image contents—the imaging observations, the anatomy, and the pathology—and the syntax in which the information is recorded varies, with no widely adopted standards, resulting in limited interoperability. Descriptions of medical images are most frequently recorded in free text in an unstructured manner, limiting the ability of computers to analyze and access this information. Schemes for annotating images have been proposed in nonmedical domains...

Read full abstract

Non-imaging Data Research Articles

Related Topics

Articles published on Non-imaging Data

Hierarchical Segmentations with Graphs: Quasi-flat Zones, Minimum Spanning Trees, and Saliency Maps

Fusion of fMRI and non-imaging data for ADHD classification

SP-0597: Tissue classification models for prostate based on imaging and non-imaging data

SP-0596: Machine learning and bioinformatics approaches to combine imaging with non-imaging data for outcome prediction

A concept for holistic whole body MRI data analysis, Imiomics.

Devising an interpretable calibrated scale to quantitatively assess the dementia stage of subjects with alzheimer's disease: A machine learning approach

Advances in Classification of Crops using Remote Sensing Data

Label-aligned multi-task feature learning for multimodal classification of Alzheimer's disease and mild cognitive impairment.

A positive-negative mode of population covariation links brain connectivity, demographics and behavior.

Noninvasive Multimodal Imaging to Predict Recovery of Locomotion after Extended Limb Ischemia

Efficient field-programmable gate array implementation of CCSDS 121.0-B-2 lossless data compression algorithm for image compression

Interactive Visual Analysis of Image-Centric Cohort Study Data.

A kernel-based sparsity preserving method for semi-supervised classification

Content-based medical image retrieval: a survey of applications to multidimensional and multimodality data.

Designing user interfaces to enhance human interpretation of medical content-based image retrieval: application to PET-CT images

Development of spectral indices for detecting and identifying plant diseases

Consensus embedding: theory, algorithms and application to segmentation and classification of biomedical data

Supervised regularized canonical correlation analysis: integrating histologic and proteomic measurements for predicting biochemical recurrence following prostate surgery.

Finding the Meaning in Images: Annotation and Image Markup

Optimizing Analysis, Visualization, and Navigation of Large Image Data Sets: One 5000-Section CT Scan Can Ruin Your Whole Day

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Non-imaging Data Research Articles

Related Topics

Articles published on Non-imaging Data

Hierarchical Segmentations with Graphs: Quasi-flat Zones, Minimum Spanning Trees, and Saliency Maps

Fusion of fMRI and non-imaging data for ADHD classification

SP-0597: Tissue classification models for prostate based on imaging and non-imaging data

SP-0596: Machine learning and bioinformatics approaches to combine imaging with non-imaging data for outcome prediction

A concept for holistic whole body MRI data analysis, Imiomics.

Devising an interpretable calibrated scale to quantitatively assess the dementia stage of subjects with alzheimer's disease: A machine learning approach

Advances in Classification of Crops using Remote Sensing Data

Label-aligned multi-task feature learning for multimodal classification of Alzheimer's disease and mild cognitive impairment.

A positive-negative mode of population covariation links brain connectivity, demographics and behavior.

Noninvasive Multimodal Imaging to Predict Recovery of Locomotion after Extended Limb Ischemia

Efficient field-programmable gate array implementation of CCSDS 121.0-B-2 lossless data compression algorithm for image compression

Interactive Visual Analysis of Image-Centric Cohort Study Data.

A kernel-based sparsity preserving method for semi-supervised classification

Content-based medical image retrieval: a survey of applications to multidimensional and multimodality data.

Designing user interfaces to enhance human interpretation of medical content-based image retrieval: application to PET-CT images

Development of spectral indices for detecting and identifying plant diseases

Consensus embedding: theory, algorithms and application to segmentation and classification of biomedical data

Supervised regularized canonical correlation analysis: integrating histologic and proteomic measurements for predicting biochemical recurrence following prostate surgery.

Finding the Meaning in Images: Annotation and Image Markup

Optimizing Analysis, Visualization, and Navigation of Large Image Data Sets: One 5000-Section CT Scan Can Ruin Your Whole Day