Noise Robust Speech Recognition Using Deep Belief Networks

Mahboubeh Farahat,Ramin Halavati

doi:10.1142/s146902681650005x

Abstract

Most current speech recognition systems use Hidden Markov Models (HMMs) to deal with the temporal variability of speech and Gaussian mixture models (GMMs) to determine how well each state of each HMM fits a frame or a short window of frames of coefficients that represents the acoustic input. In these systems acoustic inputs are represented by Mel Frequency Cepstral Coefficients temporal spectrogram known as frames. But MFCC is not robust to noise. Consequently, with different train and test conditions the accuracy of speech recognition systems decreases. On the other hand, using MFCCs of larger window of frames in GMMs needs more computational power. In this paper, Deep Belief Networks (DBNs) are used to extract discriminative information from larger window of frames. Nonlinear transformations lead to high-order and low-dimensional features which are robust to variation of input speech. Multiple speaker isolated word recognition tasks with 100 and 200 words in clean and noisy environments has been used to test this method. The experimental results indicate that this new method of feature encoding result in much better word recognition accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Noise Robust Speech Recognition Using Deep Belief Networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Intelligence and Applications

Lead the way for us

Journal: International Journal of Computational Intelligence and Applications	Publication Date: Mar 1, 2016
Citations: 4

Similar Papers

Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups
Geoffrey Hinton ... George Dahl
IEEE Signal Processing Magazine | VOL. 29
Geoffrey Hinton, et. al.Geoffrey Hinton ... George Dahl
01 Nov 2012
IEEE Signal Processing Magazine | VOL. 29

Enhancing quality and accuracy of speech recognition system by using multimodal audio-visual speech signal
Eslam E El Maghraby ... Amr M Gody
-
Eslam E El Maghraby, et. al.Eslam E El Maghraby ... Amr M Gody
01 Dec 2016
01 Dec 2016

Voice and speech recognition in Tamil language
Kiran R ... Nivedha K
-
Kiran R, et. al.Kiran R ... Nivedha K
01 Feb 2017
01 Feb 2017

Acoustic modeling using deep belief network for Bangla speech recognition
Mahtab Ahmed ... M A H Akhand
-
Mahtab Ahmed, et. al.Mahtab Ahmed ... M A H Akhand
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Noise Robust Speech Recognition Using Deep Belief Networks

Abstract

Talk to us

Similar Papers

More From: International Journal of Computational Intelligence and Applications