Abstract

Classifying imbalanced data is a common problem with pathological voice detection. Traditional classification algorithms usually assume that the number of samples in each category is similar, and that the cost of misclassification in training is equal. However, the cost of misclassifying pathological samples in pathological voice detection is higher than that of normal samples. Here, a hybrid sampling algorithm combined with optimal two-factor random forests is proposed for imbalanced classification of pathological voice detection. On the basis of two-factor random forests, it combines the synthetic minority oversampling technique (SMOTE) with the edited nearest neighbor (ENN) algorithm. SMOTE is used to increase the number of samples in a minority class. The oversampling rate of SMOTE is the out-of-bag misclassification rate of the two-factor random forests. ENN is then used to remove the noise in the majority class samples. Finally, the two-factor random forests classifies the resampled voice, and stops the iteration according to a classification evaluation index (such as the F1-macro). Binary classification and multi-classification between normal and pathological voices in the Massachusetts Eye and Ear Infirmary database demonstrate that the proposed algorithm effectively handles the problem of imbalanced pathological voice classification. Compared with a traditional sampling algorithm, the accuracy and recall of the proposed algorithm in multi-classification of voice disorders increased by 3.64% and 2.25%, respectively.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call