Systematic Comparison of Machine Learning Methods for Identification of miRNA Species as Disease Biomarkers

Chihiro Higuchi,Toshihiro Tanaka,Yukinori Okada

doi:10.1007/978-3-319-16480-9_38

Abstract

Micro RNA (miRNA) plays important roles in a variety of biological processes and can act as disease biomarkers. Thus, establishment of discovery methods to detect disease-related miRNAs is warranted. Human omics data including miRNA expression profiles have orders of magnitude with much more number of descriptors (p) than that of samples (n), which is so called “p > > n problem”. Since traditional statistical methods mislead to localized solutions, application of machine learning (ML) methods that handle sparse selection of the variables are expected to solve this problem. Among many ML methods, least absolute shrinkage and selection operator (LASSO) and multivariate adaptive regression splines (MARS) give a few variables from the result of supervised learning with endpoints such as human disease statuses. Here, we performed systematic comparison of LASSO and MARS to discover biomarkers, using six miRNA expression data sets of human disease samples, which were obtained from NCBI Gene Expression Omnibus (GEO). We additionally conducted partial least square method discriminant analysis (PLS-DA), as a control traditional method to evaluate baseline performance of discriminant methods. We observed that LASSO and MARS showed relatively higher performance compared to that of PLS-DA, as the number of the samples increases. Also, some of the identified miRNA species by ML methods have already been reported as candidate disease biomarkers in the previous biological studies. These findings should contribute to the extension of our knowledge on ML method performances in empirical utilization of clinical data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Systematic Comparison of Machine Learning Methods for Identification of miRNA Species as Disease Biomarkers

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Replicability of Machine Learning Models in the Social Sciences
Ranjith Vijayakumar ... Mike W.-L Cheung
Zeitschrift für Psychologie | VOL. 226
Ranjith Vijayakumar, et. al.Ranjith Vijayakumar ... Mike W.-L Cheung
01 Oct 2018
Zeitschrift für Psychologie | VOL. 226

Machine Learning Methods Based on CT Features Differentiate G1/G2 From G3 Pancreatic Neuroendocrine Tumors
Hai-Yan Chen ... Guo-Liang Shao
Academic radiology | VOL. 31
Hai-Yan Chen, et. al.Hai-Yan Chen ... Guo-Liang Shao
04 Dec 2023
Academic radiology | VOL. 31

Integrated multiple microarray studies by robust rank aggregation to identify immune-associated biomarkers in Crohn's disease based on three machine learning methods
Zi-An Chen ... Dong-Mei Yao
Scientific Reports | VOL. 13
Zi-An Chen, et. al.Zi-An Chen ... Dong-Mei Yao
15 Feb 2023
Scientific Reports | VOL. 13

Lasso algorithm and support vector machine strategy to screen pulmonary arterial hypertension gene diagnostic markers.
Chenyang Jiang ... Weidong Jiang
Scottish Medical Journal | VOL. 68
Chenyang Jiang, et. al.Chenyang Jiang ... Weidong Jiang
17 Oct 2022
Scottish Medical Journal | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Systematic Comparison of Machine Learning Methods for Identification of miRNA Species as Disease Biomarkers

Abstract

Talk to us

Similar Papers