A kernel-based integration of genome-wide data for clinical decision support

Anneleen Daemen,Fabian Ojeda,Johan Ak Suykens,Annelies Debucquoy,Olivier Gevaert,Jean-Pascal Machiels,Bart De Moor,Christine Sempoux,Karin Haustermans

doi:10.1186/gm39

Anneleen Daemen, Fabian Ojeda + Show 7 more

Open Access

https://doi.org/10.1186/gm39

Copy DOI

Journal: Genome Medicine	Publication Date: Jan 1, 2009
Citations: 149	License type: CC BY 2.0

Affiliation: KU Leuven, Université Catholique de Louvain

Abstract

BackgroundAlthough microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions (for example, cancer) on transcription and translation. This increases the importance of fusing more than one source of genome-wide data, such as the genome, transcriptome, proteome, and epigenome. The current increase in the amount of available omics data emphasizes the need for a methodological integration framework.MethodsWe propose a kernel-based approach for clinical decision support in which many genome-wide data sources are combined. Integration occurs within the patient domain at the level of kernel matrices before building the classifier. As supervised classification algorithm, a weighted least squares support vector machine is used. We apply this framework to two cancer cases, namely, a rectal cancer data set containing microarray and proteomics data and a prostate cancer data set containing microarray and genomics data. For both cases, multiple outcomes are predicted.ResultsFor the rectal cancer outcomes, the highest leave-one-out (LOO) areas under the receiver operating characteristic curves (AUC) were obtained when combining microarray and proteomics data gathered during therapy and ranged from 0.927 to 0.987. For prostate cancer, all four outcomes had a better LOO AUC when combining microarray and genomics data, ranging from 0.786 for recurrence to 0.987 for metastasis.ConclusionsFor both cancer sites the prediction of all outcomes improved when more than one genome-wide data set was considered. This suggests that integrating multiple genome-wide data sources increases the predictive performance of clinical decision support models. This emphasizes the need for comprehensive multi-modal data. We acknowledge that, in a first phase, this will substantially increase costs; however, this is a necessary investment to ultimately obtain cost-efficient models usable in patient tailored therapy.

Highlights

Microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions on transcription and translation
Study I: rectal cancer Using the methodologies shown in Figure 1, models were built using microarray and proteomics data of 36 rectal cancer patients at two time points during therapy for the prediction of three outcomes registered at the moment of surgery: a tumor regression grading system (WHEELER) and two prognostic factors, pathologic N stage at surgery and the circumferential margin involvement (CRM)
The step A models are model based on microarray data at time point before treatment (T0) (MT0), model based on microarray data at T1 (MT1), model based on proteomics data at T0 (PT0), and model based on proteomics data at T1 (PT1)

Summary

Introduction

Microarray technology allows the investigation of the transcriptomic make-up of a tumor in one experiment, the transcriptome does not completely reflect the underlying biology due to alternative splicing, post-translational modifications, as well as the influence of pathological conditions (for example, cancer) on transcription and translation This increases the importance of fusing more than one source of genome-wide data, such as the genome, transcriptome, proteome, and epigenome. Based on a strong theoretical framework, their rapid uptake in applications such as bioinformatics [2], chemoinformatics, and even computational linguistics is due to their reliability, accuracy, and computational efficiency They have the capability to handle a very wide range of data types (for example, kernel methods have been used to analyze sequences, vectors, networks, phylogenetic trees, and so on). To account for the unbalancedness in many twoclass problems, this linear problem is extended with weights that are different for the positive and negative classes

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A kernel-based integration of genome-wide data for clinical decision support

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genome Medicine

Lead the way for us

Similar Papers

A Predictive Model for Personalized Therapeutic Interventions in Non-small Cell Lung Cancer.
Nelofar Kureshi ... Syed Sibte Raza Abidi
IEEE Journal of Biomedical and Health Informatics | VOL. 20
Nelofar Kureshi, et. al.Nelofar Kureshi ... Syed Sibte Raza Abidi
04 Dec 2014
IEEE Journal of Biomedical and Health Informatics | VOL. 20

Towards an Intelligent Integrated Approach for Clinical Decision Support
Rajendra Akerkar
-
Rajendra AkerkarRajendra Akerkar
01 Jan 2015
01 Jan 2015

GAN-Based Semi-Supervised Learning Approach for Clinical Decision Support in Health-IoT Platform
Yun Yang ... Dehai Zhang
IEEE Access | VOL. 7
Yun Yang, et. al.Yun Yang ... Dehai Zhang
01 Jan 2019
IEEE Access | VOL. 7

Automated Diagnosis of Prostate Cancer Using mpMRI Images: A Deep Learning Approach for Clinical Decision Support
Anil B Gavade ... Shridhar Sunilkumar Pol
Computers | VOL. 12
Anil B Gavade, et. al.Anil B Gavade ... Shridhar Sunilkumar Pol
28 Jul 2023
Computers | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A kernel-based integration of genome-wide data for clinical decision support

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Genome Medicine