Improved microarray data analysis using feature selection methods with machine learning methods

Jing Sun Jing Sun,Kalpdrum Passi,Chakresh Kumar Jain

doi:10.1109/bibm.2016.7822748

Abstract

Microarray data analysis directly relates with the state of disease through gene expression profile, and is based upon several feature extractions to classification methodologies. This paper focuses on the study of 8 different ways of feature selection preprocess methods from 4 different feature selection methods. They are Minimum Redundancy-Maximum Relevance (mRMR), Max Relevance (MaxRel), Quadratic Programming Feature Selection (QPFS) and Partial Least Squared (PLS) methods. In this study, microarray datasets of colon cancer and leukemia cancer were used for implementing and testing four different classifiers i.e. K-Nearest-Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM) and Neural Network (NN). The performance was measured by accuracy and AUC (area under the curve) value. The experimental results show that discretization can somehow improve performance of microarray data analysis, and mRMR gives the best performance of microarray data analysis on the colon and leukemia datasets. We also list some results on comparative performance of methods for the specific (data-ratio) number of features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved microarray data analysis using feature selection methods with machine learning methods

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Mass Classification in Mammograms Using Selected Geometry and Texture Features, and a New SVM-Based Feature Selection Method
Xiaoming Liu ... Jinshan Tang
IEEE Systems Journal | VOL. 8
Xiaoming Liu, et. al.Xiaoming Liu ... Jinshan Tang
01 Sep 2014
IEEE Systems Journal | VOL. 8

Application of information theoretic feature selection and machine learning methods for the development of genetic risk prediction models
... John Bowes
Scientific reports | VOL. 11
, et. al. ... John Bowes
01 Dec 2021
Scientific reports | VOL. 11

Analysis of Cross-Combinations of Feature Selection and Machine-Learning Classification Methods Based on [18F]F-FDG PET/CT Radiomic Features for Metabolic Response Prediction of Metastatic Breast Cancer Lesions.
Ober Van Gómez ... Alexander Haug
Cancers | VOL. 14
Ober Van Gómez, et. al.Ober Van Gómez ... Alexander Haug
14 Jun 2022
Cancers | VOL. 14

An Empirical Study of Several Information Theoretic Based Feature Extraction Methods for Classifying High Dimensional Low Sample Size Data
Sheena Leeza Verghese ... Tomas H Maul
IEEE Access | VOL. 9
Sheena Leeza Verghese, et. al.Sheena Leeza Verghese ... Tomas H Maul
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved microarray data analysis using feature selection methods with machine learning methods

Abstract

Talk to us

Similar Papers