Abstract
According to the 2016 World Health Organization (WHO) survey, respiratory diseases are serious diseases that account for four of the top ten causes of death in the world, accounting for more than 8 million deaths worldwide. Currently, the diagnosis of respiratory disease is made by auscultation, but in order to make an accurate diagnosis, a number of abnormal patterns of respiratory sounds need to be memorized, and the results of the diagnosis are dependent on the proficiency of the physician. Therefore, a computer aided diagnosis (CAD) system is needed to quantitatively classify the respiratory sounds and output the results as a second opinion. In this paper, a short-time Fourier transformed spectrogram, a Constant-Q transformed logarithmic frequency spectrogram, and a continuous wavelet transformed scalogram are simultaneously input to VGG16 which is one of the network models of CNN(Convolutional Neural Network) and classified by LSTM (Long short-term memory). The proposed method is applied to 26 respiratory sounds, and the 0.90 of accuracy, sensitivity of 0.97, and specificity of 0.90 is obtained.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have