An optimal variable importance for machine learning classification models using modified simulated annealing algorithm

A Rusyana,A H Wigena,I M Sumertajaya,B Sartono

doi:10.1088/1755-1315/1356/1/012089

A Rusyana, A H Wigena + Show 2 more

Open Access

https://doi.org/10.1088/1755-1315/1356/1/012089

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Abstract Each machine learning model will generate a different importance variable even though the method used is the same. Interpreting the variable significance is confusing. This study proposes combining several variable importance measures using a simulated annealing algorithm with an initial solution of mean and mode. The study uses simulation and empirical data. The simulation data are divided into three scenarios: no correlation, moderate correlation, and high correlation among predictor variables. The empirical data consist of 24 predictor variables. The machine learning models are classification models of random forest, extreme gradient boosting, neural network, and support vector machine. Based on the simulation data study, the combined variable importance will be optimal when predictor variables have low correlation. The simulated annealing algorithms show convergent objective values around the 25th iteration in empirical data. The more predictor variables, the higher the accuracy of this variable importance. Accuracy is optimal when the number of predictors exceeds ten. The five most important variables in explaining family food insecurity are the education of the family head, the floor type of the house, the number of family members who have a savings account, ownership of land, and decent drinking water.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IOP Conference Series: Earth and Environmental Science	Publication Date: Jun 1, 2024
Citations: 1	License type: cc-by

R Discovery Prime

An optimal variable importance for machine learning classification models using modified simulated annealing algorithm

Abstract

Published Version

Talk to us

Similar Papers

More From: IOP Conference Series: Earth and Environmental Science

Lead the way for us

Similar Papers

Application of machine learning model to predict osteoporosis based on abdominal computed tomography images of the psoas muscle: a retrospective study
Cheng-Bin Huang ... Tian-Hao Xu
BMC Geriatrics | VOL. 22
Cheng-Bin Huang, et. al.Cheng-Bin Huang ... Tian-Hao Xu
13 Oct 2022
BMC Geriatrics | VOL. 22

Ensemble machine learning-based models for estimating the transfer length of strands in PSC beams
Viet-Linh Tran ... Jin-Kook Kim
Expert Systems with Applications | VOL. 221
Viet-Linh Tran, et. al.Viet-Linh Tran ... Jin-Kook Kim
01 Mar 2023
Expert Systems with Applications | VOL. 221

Machine learning algorithm can provide assistance for the diagnosis of non-ST-segment elevation myocardial infarction.
Lian Qin ... Xiang Ma
Postgraduate medical journal | VOL. -
Lian Qin, et. al.Lian Qin ... Xiang Ma
16 Feb 2022
Postgraduate medical journal | VOL. -

Machine learning algorithm can provide assistance for the diagnosis of non-ST-segment elevation myocardial infarction
Lian Qin ... Quan Qi
Postgraduate Medical Journal | VOL. 42
Lian Qin, et. al.Lian Qin ... Quan Qi
16 Feb 2022
Postgraduate Medical Journal | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

An optimal variable importance for machine learning classification models using modified simulated annealing algorithm

Abstract

Published Version

Talk to us

Similar Papers

More From: IOP Conference Series: Earth and Environmental Science