A Hybrid Machine Learning Approach to Screen Optimal Predictors for the Classification of Primary Breast Tumors from Gene Expression Microarray Data.

Nashwan Alromema,Asif Hassan Syed,Tabrej Khan

doi:10.3390/diagnostics13040708

Nashwan Alromema, Asif Hassan Syed + Show 1 more

Open Access

https://doi.org/10.3390/diagnostics13040708

Copy DOI

Journal: Diagnostics	Publication Date: Feb 13, 2023
Citations: 8	License type: CC BY 4.0

Affiliation: King Abdulaziz University

Abstract

The high dimensionality and sparsity of the microarray gene expression data make it challenging to analyze and screen the optimal subset of genes as predictors of breast cancer (BC). The authors in the present study propose a novel hybrid Feature Selection (FS) sequential framework involving minimum Redundancy-Maximum Relevance (mRMR), a two-tailed unpaired t-test, and meta-heuristics to screen the most optimal set of gene biomarkers as predictors for BC. The proposed framework identified a set of three most optimal gene biomarkers, namely, MAPK 1, APOBEC3B, and ENAH. In addition, the state-of-the-art supervised Machine Learning (ML) algorithms, namely Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Neural Net (NN), Naïve Bayes (NB), Decision Tree (DT), eXtreme Gradient Boosting (XGBoost), and Logistic Regression (LR) were used to test the predictive capability of the selected gene biomarkers and select the most effective breast cancer diagnostic model with higher values of performance matrices. Our study found that the XGBoost-based model was the superior performer with an accuracy of 0.976 ± 0.027, an F1-Score of 0.974 ± 0.030, and an AUC value of 0.961 ± 0.035 when tested on an independent test dataset. The screened gene biomarkers-based classification system efficiently detects primary breast tumors from normal breast samples.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Hybrid Machine Learning Approach to Screen Optimal Predictors for the Classification of Primary Breast Tumors from Gene Expression Microarray Data.

Abstract

Talk to us

Similar Papers

More From: Diagnostics

Lead the way for us

Similar Papers

A Hybrid Feature Selection Approach to Screen a Novel Set of Blood Biomarkers for Early COVID-19 Mortality Prediction.
Asif Hassan Syed ... Nashwan Alromema
Diagnostics (Basel, Switzerland) | VOL. 12
Asif Hassan Syed, et. al.Asif Hassan Syed ... Nashwan Alromema
30 Jun 2022
Diagnostics (Basel, Switzerland) | VOL. 12

Pediatric Patient Traumatic Brain Injury Prediction1
Franklin Fuchs ... Omar Kamal
-
Franklin Fuchs, et. al.Franklin Fuchs ... Omar Kamal
16 Dec 2020
16 Dec 2020

Autonomous Detection of Mouse-Ear Hawkweed Using Drones, Multispectral Imagery and Supervised Machine Learning
Narmilan Amarasingam ... Mark Hamilton
Remote Sensing | VOL. 15
Narmilan Amarasingam, et. al.Narmilan Amarasingam ... Mark Hamilton
17 Mar 2023
Remote Sensing | VOL. 15

Analysis and result of classification algorithm on email classification
...
-
, et. al. ...
31 Jul 2019
31 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Machine Learning Approach to Screen Optimal Predictors for the Classification of Primary Breast Tumors from Gene Expression Microarray Data.

Abstract

Talk to us

Similar Papers

More From: Diagnostics