Speech Enhancement and Recognition Using Deep Learning Algorithms: A Review

D Hepsiba,R Vinotha,L D Vijay Anand

doi:10.1007/978-981-19-9819-5_20

Abstract

In this paper, different methods for speech recognition and speech enhancement are reviewed. Usually, speech enhancement act as a frontend system to enhance the automatic speech recognition (ASR) system performance. Signals captured by the microphones are distorted by reverberations and background noise. A degradation of the signal would make it difficult for the speech recognition system to recognize the speech. By identifying the magnitude spectrograms of the degraded speech, recurrent neural networks (RNN) and deep neural networks (DNN) are trained to perform spectral masking and also perform some algorithms such as Transformer-based neural network (TSTNN), minimum overlap-gap algorithm, residual long short-term memory neural network (ResLSTM), and deep complex convolution recurrent network (DCCRN). It is also possible to amplify and recognize speech by using certain processing technique including spectral subtraction, Wiener and Kalman filtering, MMSE estimation, phase spectrum compensation, multichannel end-to-end system (ME2E), binaural codebook-based speech enhancement, progressive learning-based adaptive noise and speech estimation (PL-ANSE) method, voice activity detection (VAD), adaptive noise reduction algorithms, and beamforming. Hence, the noise embedded in the speech needs to be eliminated for making the speech recognition system more effective in understanding the speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech Enhancement and Recognition Using Deep Learning Algorithms: A Review

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Speech Enhancement System for Automatic Speech Recognition in Automotive Environment
Gokul G Nair ... C Santhosh Kumar
-
Gokul G Nair, et. al.Gokul G Nair ... C Santhosh Kumar
06 Jul 2021
06 Jul 2021

Ses Tanıma için Derin Öğrenme Mimarileri Üzerine Derleme
Yeşim Dokuz ... Zekeriya Tüfekci̇
European Journal of Science and Technology | VOL. -
Yeşim Dokuz, et. al.Yeşim Dokuz ... Zekeriya Tüfekci̇
30 Apr 2020
European Journal of Science and Technology | VOL. -

A Novel N-Average Wavelet Algorithm for a Voice-Based Wheel Chair
E Chandra
-
E ChandraE Chandra
04 Apr 2022
04 Apr 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Enhancement and Recognition Using Deep Learning Algorithms: A Review

Abstract

Talk to us

Similar Papers