Implementation of Speech Enhancement Using Bionic Wavelet Transform

R Bhagya,P R Ashwini,R Bharathi

doi:10.1007/978-981-19-9638-2_28

Abstract

Researchers in the field of speech processing have focused on minimizing the impact of environmental noise that decreases the performance of the systems such as speaker recognition and speech recognition. The speech enhancement approach proposed for denoising the speech signals corrupted by various noise is based on Bionic Wavelet Transform (BWT) and mean square error. Thresholding method is used for denoising the signal based on its spectral amplitude. The inverse bionic wavelet function is applied to denoised coefficient obtained at the enhanced speech signals. The speech quality measures and speech intelligence measures are used to assess the performance of the suggested technique. Proposed methodology is compared with Continuous Wavelet Transform (CWT) approach. The Mel frequency cepstral coefficient (MFCC) feature is extracted from the denoised signal for speaker recognition. Machine learning classifiers such as K-nearest neighbors (KNN), Support Vector Machine (SVM), and Convolutional Neural Network (CNN) are used for recognizing the speaker. Six different speakers were recognized efficiently by CNN technique when compared to SVM and KNN. CNN technique with 2500 database shows the training accuracy of 98% and test accuracy of 82% for enhanced signal.

Full Text