Abstract
In the last few decades, Deep Neural Networks (DNNs) has shown outstanding performance in speech recognition applications. We demonstrate that the improved accuracy obtained by Deep Convolutional Neural Network (DCNN) arose from their capacity to extract discriminative representations which are robust to various sources of variability in speech signals. By this study, we propose a new algorithm, named Scattering Transform-Deep Convolutional Neural Network CNN: ST-DCNN to identify normal and pathological voices. The effectiveness of advances in speech features have been proven to be the root for an efficient pathological voices classification. The proposed algorithm involved two stages: First, scatter wavelet features are extracted. Then, DCNN is used to classify the voices samples. We evaluated the robustness of the proposed system in silent environments. The experimental results indicates that it achieves better performance with scattering wavelet and DCNN with the clean data within 99.62 % of recognition rate.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.