Abstract
This paper presents a new feature selection method based on the changes in out-of-bag (OOB) Cohen kappa values of a random forest (RF) classifier, which was tested on the automatic detection of sleep apnea based on the oxygen saturation signal (SpO2). The feature selection method is based on the RF predictor importance defined as the increase in error when features are permuted. This method is improved by changing the classification error into the Cohen kappa value, by adding an extra factor to avoid correlated features and by adapting the OOB sample selection to obtain a patient independent validation. When applying the method for sleep apnea classification, an optimal feature set of 3 parameters was selected out of 286. This was half of the 6 features that were obtained in our previous study. This feature reduction resulted in an improved interpretability of our model, but also a slight decrease in performance, without affecting the clinical screening performance. Feature selection is an important issue in machine learning and especially biomedical informatics. This new feature selection method introduces interesting improvements of RF feature selection methods, which can lead to a reduced feature set and an improved classifier interpretability.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.