Enhancing Depression Detection: A Stacked Ensemble Model with Feature Selection and RF Feature Importance Analysis Using NHANES Data

Annapoorani Selvaraj,Lakshmi Mohandoss

doi:10.3390/app14167366

Abstract

Around the world, 5% of adults suffer from depression, which is often inadequately treated. Depression is caused by a complex relationship of cultural, psychological, and physical factors. This growing issue has become a significant public health problem globally. Medical datasets often contain redundant characteristics, missing information, and high dimensionality. By using an iterative floating elimination feature selection algorithm and considering various factors, we can reduce the feature set and achieve optimized outcomes. The research utilizes the 36-Item Short Form Survey (SF-36) from the NHANES 2015–16 dataset, which categorizes data into seven groups relevant to quality of life and depression. This dataset presents a challenge due to its imbalance, with only 8.08% of individuals diagnosed with depression. The Depression Ensemble Stacking Generalization Model (DESGM) employs stratified k-fold cross-validation and oversampling for training data. DESGM enhances the classification performance of both base learners (linear support vector machine, perceptron, artificial neural network, linear discriminant analysis, and K-nearest neighbor) and meta-learners (logistic regression). The model achieved an F1 score of 0.9904 and an accuracy of 98.17%, with no instances of depression misdiagnosed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Depression Detection: A Stacked Ensemble Model with Feature Selection and RF Feature Importance Analysis Using NHANES Data

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Journal: Applied Sciences	Publication Date: Aug 21, 2024
License type: CC BY 4.0

Similar Papers

Comparison of the diagnostic efficacy of mathematical models in distinguishing ultrasound imaging of breast nodules
Lu Li ... Jie Wang
Scientific Reports | VOL. 13
Lu Li, et. al.Lu Li ... Jie Wang
25 Sep 2023
Scientific Reports | VOL. 13

A comparison of machine learning algorithms for chemical toxicity classification using a simulated multi-scale data model.
Richard Judson ... R Woodrow Setzer
BMC Bioinformatics | VOL. 9
Richard Judson, et. al.Richard Judson ... R Woodrow Setzer
19 May 2008
BMC Bioinformatics | VOL. 9

Identifying chronic disease patients using predictive algorithms in pharmacy administrative claims: an application in rheumatoid arthritis
Ervant J Maksabedian Hernandez ... Jessica Tiu
Journal of Medical Economics | VOL. 24
Ervant J Maksabedian Hernandez, et. al.Ervant J Maksabedian Hernandez ... Jessica Tiu
01 Jan 2020
Journal of Medical Economics | VOL. 24

STEM: STacked Ensemble Model design for aggregation technique in Group Recommendation System
P Arun Raj Kumar ... Nagarajan Kumar
International Journal of Business Intelligence and Data Mining | VOL. 1
P Arun Raj Kumar, et. al.P Arun Raj Kumar ... Nagarajan Kumar
01 Jan 2021
International Journal of Business Intelligence and Data Mining | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Depression Detection: A Stacked Ensemble Model with Feature Selection and RF Feature Importance Analysis Using NHANES Data

Abstract

Talk to us

Similar Papers

More From: Applied Sciences