Background and objectiveWheezes in pulmonary sounds are anomalies which are often associated with obstructive type of lung diseases. The previous works on wheeze-type classification focused mainly on using fixed time-frequency/scale resolution based on Fourier and wavelet transforms. The main contribution of the proposed method, in which the time-scale resolution can be tuned according to the signal of interest, is to discriminate monophonic and polyphonic wheezes with higher accuracy than previously suggested time and time-frequency/scale based methods. MethodsAn optimal Rational Dilation Wavelet Transform (RADWT) based peak energy ratio (PER) parameter selection method is proposed to discriminate wheeze types. Previously suggested Quartile Frequency Ratios, Mean Crossing Irregularity, Multiple Signal Classification, Mel-frequency Cepstrum and Dyadic Discrete Wavelet Transform approaches are also applied and the superiority of the proposed method is demonstrated in leave-one-out (LOO) and leave-one-subject-out (LOSO) cross validation schemes with support vector machine (SVM), k nearest neighbor (k-NN) and extreme learning machine (ELM) classifiers. ResultsThe results show that the proposed RADWT based method outperforms the state-of-the-art time, frequency, time-frequency and time-scale domain approaches for all classifiers in both LOO and LOSO cross validation settings. The highest accuracy values are obtained as 86% and 82.9% in LOO and LOSO respectively when the proposed PER features are fed into SVM. ConclusionsIt is concluded that time and frequency domain characteristics of wheezes are not steady and hence, tunable time-scale representations are more successful in discriminating polyphonic and monophonic wheezes when compared with conventional fixed resolution representations.
Read full abstract