A case study of speech recognition in Spanish: From conventional to deep approach

Aldonso Becerra,J Ismael De La Rosa,Efren Gonzalez

doi:10.1109/andescon.2016.7836212

Abstract

The aim of this paper is to exhibit a comparative case study of the conventional speech recognition GMM-HMM (Gaussian mixture model — hidden Markov model) architecture and the recent model based on deep neural networks. During years the GMM approach has controlled the speech recognition tasks, however it has been surpassed with the resurgence of artificial neural networks. To exemplify these acoustic modeling frameworks, a case study has been conducted by using the Kaldi toolkit, employing a personalized speaker-independent mid-vocabulary voice corpus for recognition of digit strings and personal name lists in latin spanish on a connected-words phone dialing task. The speech recognition accuracy obtained in the results shows a better word error rate by using the DNN acoustic modeling. A 20.71% relative improvement is obtained with DNN-HMM models (3.33% WER) in respect to the lowest GMM-HMM rate (4.20% WER).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A case study of speech recognition in Spanish: From conventional to deep approach

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speech recognition in a dialog system: from conventional to deep processing
Aldonso Becerra ... J Ismael De La Rosa
Multimedia Tools and Applications | VOL. 77
Aldonso Becerra, et. al.Aldonso Becerra ... J Ismael De La Rosa
06 Sep 2017
Multimedia Tools and Applications | VOL. 77

Development of Speaker-Independent Automatic Speech Recognition System for Kannada Language
Praveen Kumar ... H S Jayanna
Indian Journal of Science and Technology | VOL. 15
Praveen Kumar, et. al.Praveen Kumar ... H S Jayanna
27 Feb 2022
Indian Journal of Science and Technology | VOL. 15

Speech recognition using deep neural networks trained with non-uniform frame-level cost functions
Aldonso Becerra ... N Iracemi Escalante
-
Aldonso Becerra, et. al.Aldonso Becerra ... N Iracemi Escalante
01 Nov 2017
01 Nov 2017

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation
Do Van Hai ... Xiong Xiao
-
Do Van Hai, et. al.Do Van Hai ... Xiong Xiao
01 Dec 2015
Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation
Do Van Hai ... Xiong Xiao

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A case study of speech recognition in Spanish: From conventional to deep approach

Abstract

Talk to us

Similar Papers