Bayesian Networks for Discrete Observation Distributions in Speech Recognition

Antonio Miguel,Eduardo Lleida,Alfonso Ortega,Luis Buera

doi:10.1109/tasl.2010.2092764

Abstract

Traditionally, in speech recognition, the hidden Markov model state emission probability distributions are usually associated to continuous random variables, by using Gaussian mixtures. Thus, complex multimodal inter-feature dependencies are not accurately modeled by Gaussian models, since they are unimodal distributions and mixtures of Gaussians are needed in these complex cases, but this is done in a loose and inefficient way. Graphical models provide a precise and simple mechanism to model the dependencies among two or more variables. This paper proposes the use of discrete random variables as observations and graphical models to extract the internal dependence structure in the feature vectors. Therefore, speech features are quantized to a small number of levels, in order to obtain a tractable model. These quantized speech features provide a mechanism to increase the robustness against noise uncertainty. In addition, discrete random variables allow the learning of joint statistics of the observation densities. A method to estimate a graphical model with a constrained number of dependencies is shown in this paper, being a special kind of Bayesian network. Experimental results show that by using this modeling, better performance can be obtained compared to standard baseline systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bayesian Networks for Discrete Observation Distributions in Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Aug 1, 2011
Citations: 57

Similar Papers

Error handling in multimodal voice-enabled interfaces of tour-guide robots using graphical models

-

01 Jan 2006
01 Jan 2006

Bayesian Networks and Discrete Observations for Robust Speech Recognition
Antonio Miguel ... Eduardo Lleida
-
Antonio Miguel, et. al.Antonio Miguel ... Eduardo Lleida
31 Mar 2011
31 Mar 2011

Bayesian Network Learning for Uncertainty Quantification
Zhen Hu ... Sankaran Mahadevan
-
Zhen Hu, et. al.Zhen Hu ... Sankaran Mahadevan
06 Aug 2017
06 Aug 2017

Bayesian Network Learning for Data-Driven Design
Zhen Hu ... Sankaran Mahadevan
ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part B: Mechanical Engineering | VOL. 4
Zhen Hu, et. al.Zhen Hu ... Sankaran Mahadevan
18 Apr 2018
ASCE-ASME Journal of Risk and Uncertainty in Engineering Systems, Part B: Mechanical Engineering | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bayesian Networks for Discrete Observation Distributions in Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing