Objective. Sleep staging is the basis for sleep quality assessment and diagnosis of sleep-related disorders. In response to the inadequacy of traditional manual judgement of sleep stages, using machine learning techniques for automatic sleep staging has become a hot topic. To improve the performance of sleep staging, numerous studies have extracted a large number of sleep-related characteristics. However, there are redundant and irrelevant features in the high-dimensional features that reduce the classification accuracy. To address this issue, an effective hybrid feature selection method based on the entropy weight method is proposed in this paper for automatic sleep staging. Approach. Firstly, we preprocess the four modal polysomnography (PSG) signals, including electroencephalogram (EEG), electrooculogram (EOG), electrocardiogram (ECG) and electromyogram (EMG). Secondly, the time domain, frequency domain and nonlinear features are extracted from the preprocessed signals, with a total of 185 features. Then, in order to acquire characteristics of the multi-modal signals that are highly correlated with the sleep stages, the proposed hybrid feature selection method is applied to choose effective features. This method is divided into two stages. In stage I, the entropy weight method is employed to combine two filter methods to build a subset of features. This stage evaluates features based on information theory and distance metrics, which can quickly obtain a subset of features and retain the relevant features. In stage II, Sequential Forward Selection is used to evaluate the subset of features and eliminate redundant features. Further more, to achieve better performance of classification, an ensemble model based on support vector machine, K-nearest neighbor, random forest and multilayer perceptron is finally constructed for classifying sleep stages. Main results. The experiment using the Cyclic Alternating Pattern (CAP) sleep database is performed to assess the performance of the method proposed in this paper. The proposed hybrid feature selection method chooses only 30 features highly correlated to sleep stages. The accuracy, F1 score and Kappa coefficient of 6 class sleep staging reach 88.86%, 83.15% and 0.8531%, respectively. Significance. Experimental results show the effectiveness of the proposed method compared to the existing state-of-the-art studies. It greatly reduces the number of features required while achieving outstanding auto-sleep staging results.
Read full abstract