Taylor-DBN: A new framework for speech recognition systems

Arul Valiyavalappil Haridas,V G Sivakumar,Basabi Chakraborty,Ramalatha Marimuthu

doi:10.1142/s021969132050071x

Abstract

Speech recognition is a rapidly emerging research area as the speech signal contains linguistic information and speaker information that can be used in applications including surveillance, authentication, and forensic field. The performance of speech recognition systems degrades expeditiously nowadays due to channel degradations, mismatches, and noise. To provide better performance of speech recognition, the Taylor-Deep Belief Network (Taylor-DBN) classifier is proposed, which is the modification of the Gradient Descent (GD) algorithm with Taylor series in the existing DBN classifier. Initially, the noise present in the speech signal is removed through the speech signal enhancement. The features, such as Holoentropy with the eXtended Linear Prediction using autocorrelation Snapshot (HXLPS), spectral kurtosis, and spectral skewness, are extracted from the enhanced speech signal, which is fed to the Taylor-DBN classifier that identifies the speech of the impaired persons. The experimentation is done using the TensorFlow speech recognition database, the real database, and the ESC-50 dataset. The accuracy, False Acceptance Rate (FAR), False Rejection Rate (FRR), and Mean Square Error (MSE) of the Taylor-DBN for TensorFlow speech recognition database are 96.95%, 3.04%, 3.04%, and 0.045, respectively, and for real database, the accuracy, FAR, FRR, and MSE are 96.67%, 3.32%, 3.32%, and 0.0499, respectively. Similarly, for the ESC-50 dataset, the accuracy, FAR, FRR, and MSE are 96.81%, 3.18%, 3.18%, and 0.047, respectively. The results imply that the Taylor-DBN provides better performance as compared to the existing conventional methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Taylor-DBN: A new framework for speech recognition systems

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing

Lead the way for us

Journal: International Journal of Wavelets, Multiresolution and Information Processing	Publication Date: Dec 11, 2020
Citations: 1

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Feature Level Solution to Noise Robust Speech Recognition in the context of Tonal Languages
Utpal Bhattacharjee ... Jyoti Mannala
International Journal of Engineering and Advanced Technology | VOL. 9
Utpal Bhattacharjee, et. al.Utpal Bhattacharjee ... Jyoti Mannala
30 Dec 2020
International Journal of Engineering and Advanced Technology | VOL. 9

Time scale modification and vocal tract length normalization for improving the performance of Tamil speech recognition system implemented using language independent segmentation algorithm
S Saraswathi ... T V Geetha
International Journal of Speech Technology | VOL. 9
S Saraswathi, et. al.S Saraswathi ... T V Geetha
01 Dec 2006
International Journal of Speech Technology | VOL. 9

Autocorrelation-based Methods for Noise-Robust Speech Recognition
Gholamreza Farahani ... Mohammad Mehdi
-
Gholamreza Farahani, et. al.Gholamreza Farahani ... Mohammad Mehdi
01 Jun 2007
01 Jun 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Taylor-DBN: A new framework for speech recognition systems

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing