Discriminative Models for Speech Recognition

M.J.F Gales

doi:10.1109/ita.2007.4357576

Abstract

The vast majority of automatic speech recognition systems use hidden Markov models (HMMs) as the underlying acoustic model. Initially these models were trained based on the maximum likelihood criterion. Significant performance gains have been obtained by using discriminative training criteria, such as maximum mutual information and minimum phone error. However, the underlying acoustic model is still generative, with the associated constraints on the state and transition probability distributions, and classification is based on Bayes' decision rule. Recently, there has been interest in examining discriminative, or direct, models for speech recognition. This paper briefly reviews the forms of discriminative models that have been investigated. These include maximum entropy Markov models, hidden conditional random fields and conditional augmented models. The relationships between the various models and issues with applying them to large vocabulary continuous speech recognition will be discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discriminative Models for Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Temporally Varying Weight Regression: A Semi-Parametric Trajectory Model for Automatic Speech Recognition
Shilin Liu ... Khe Chai Sim
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Shilin Liu, et. al.Shilin Liu ... Khe Chai Sim
01 Jan 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Integrate template matching and statistical modeling for continuous speech recognition
Xie Sun
-
Xie SunXie Sun
01 Jan 2010
01 Jan 2010

Tied triphone semi-Markov model for large vocabulary continuous speech recognition
Hyunsin Park ... Chang D Yoo
-
Hyunsin Park, et. al.Hyunsin Park ... Chang D Yoo
01 Jun 2014
01 Jun 2014

Comparing Fusion Models for DNN-Based Audiovisual Continuous Speech Recognition
Ahmed Hussen Abdelaziz
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Ahmed Hussen AbdelazizAhmed Hussen Abdelaziz
01 Mar 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative Models for Speech Recognition

Abstract

Talk to us

Similar Papers