Prediction of Survival and Risk Assessment Using Joint Analysis of Microarray Gene Expression Data

Haleh Yasrebi

doi:10.5075/epfl-thesis-4494

Abstract

Gene expression profiles have been widely used in molecular classification, diagnosis and prediction, particularly in the area of oncology where accurate and early diagnosis is needed for appropriate treatment. Avoiding under-/over-treatment when it is not necessary can extend a patient's survival and prevent disease recurrence. These high-throughput assay technologies have generated terabytes of data exploited extensively to provide insights on cancer biology and the underlying mechanism of disease progression. The ultimate goal is to identify possibly tailored treatment and therapy for personalized medicine. Analysis of microarray data is constrained by the following characteristics: (i) noisy due to missing or erroneous values; (ii) high dimensional due to a large number of genes versus a few number of samples in which their expression levels are measured; (iii) costly due to expensive microarray experiments. Abundant microarray gene expression data should be processed by appropriate computational and statistical learning methodologies such as machine learning techniques. These methods are robust to noisy data and have a great capacity to analyze high dimensional data. Their computational power is nevertheless limited to sample size based on which these methods are built. These algorithms have been widely applied to microarray gene expression data to identify a set of genes known as a gene signature whose expressions are highly correlated to a target value or outcome such as disease status, tumor subtype, a patient's survival time, risk of mortality or cancer relapse. Prediction of survival time and a patient's risk which is unknown at diagnosis presents a more challenging task for machine learning methods than tumor subtype or disease classification, which is already established by oncologists. The properties of microarray data cited above, the limitation of the number of samples in cancer patients and dependency of the machine learning methods' performance on sample size justify joint analysis of microarray data to increase the number of samples. We applied joint analysis methods to breast and lung cancer data sets to improve survival prediction and risk assessment. In overall, no significant improvement or deterioration of the performance accuracy was obtained with joint analysis. However, increasing sample size helped to identify robust or stable gene signatures predictive of survival time and risk assessment. Our achievements and learned-lessons from joint analysis of microarray gene expression data can be used as a guideline for future research studies in classification and prediction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Prediction of Survival and Risk Assessment Using Joint Analysis of Microarray Gene Expression Data

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Building interpretable fuzzy models for high dimensional data analysis in cancer diagnosis.
Zhenyu Wang ... Vasile Palade
BMC Genomics | VOL. Suppl 12 2
Zhenyu Wang, et. al.Zhenyu Wang ... Vasile Palade
01 Jan 2010
BMC Genomics | VOL. Suppl 12 2

Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management
Matthias Maruschke ... D Koczan
BJU International | VOL. 108
Matthias Maruschke, et. al.Matthias Maruschke ... D Koczan
16 Mar 2011
BJU International | VOL. 108

Compressed Sensing for Image Compression Using Wavelet Packet Analysis
Kanike Vijay Kumar ... K Suresh Reddy
International Journal of Computer Science and Informatics | VOL. -
Kanike Vijay Kumar, et. al.Kanike Vijay Kumar ... K Suresh Reddy
01 Jul 2013
International Journal of Computer Science and Informatics | VOL. -

Using Siamese Networks with Transfer Learning for Face Recognition on Small-Samples Datasets
Mohsen Heidari ... Kazim Fouladi-Ghaleh
-
Mohsen Heidari, et. al.Mohsen Heidari ... Kazim Fouladi-Ghaleh
01 Feb 2020
01 Feb 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Prediction of Survival and Risk Assessment Using Joint Analysis of Microarray Gene Expression Data

Abstract

Talk to us

Similar Papers