Acoustic modeling for speech recognition based on a generalized Laplacian mixture distribution

Atsushi Nakamura

doi:10.1002/ecjb.10093

Abstract

AbstractIn acoustic modeling for speech recognition, the Gaussian distribution or the Gaussian mixture distribution is widely used. The general reason for preference of the Gaussian distribution in the parametric modeling of an unknown ensemble is the central limit theorem. The Gaussian distribution has many properties that are theoretically clear. For the particular problem, however, in which the time series of an acoustic feature is to be modeled on the basis of a limited number of training samples for speech recognition, there is no guarantee that the method based on the Gaussian distribution is always optimal. Consequently, this paper proposes an acoustic modeling approach based on the generalized Laplacian distribution, which can represent a wider range of distribution shapes, including the Laplacian and Gaussian distributions. The formulation of the generalized Laplacian distribution and the method of estimation of the distribution parameters are described. The acoustic model with the generalized Laplacian mixture output distribution is constructed by retraining of the hidden Markov model with the Gaussian mixture output distribution. It is shown by a continuous speech recognition experiment using natural uttered speech that the recognition performance is improved compared to recognition based on the Gaussian mixture distribution. © 2002 Wiley Periodicals, Inc. Electron Comm Jpn Pt 2, 85(11): 32–42, 2002; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecjb.10093

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Acoustic modeling for speech recognition based on a generalized Laplacian mixture distribution

Abstract

Talk to us

Similar Papers

More From: Electronics and Communications in Japan (Part II: Electronics)

Lead the way for us

Journal: Electronics and Communications in Japan (Part II: Electronics)	Publication Date: Oct 23, 2002
Citations: 9

Similar Papers

Fast and accurate recurrent neural network acoustic models for speech recognition
Haşim Sak ... Kanishka Rao
-
Haşim Sak, et. al.Haşim Sak ... Kanishka Rao
06 Sep 2015
06 Sep 2015

SAR Image Despeckling Based on a Mixture of Gaussian Distributions with Local Parameters and Multiscale Edge Detection in Lapped Transform Domain
Deepika Hazarika ... Manbendra Bhuyan
Sensing and Imaging | VOL. 17
Deepika Hazarika, et. al.Deepika Hazarika ... Manbendra Bhuyan
24 Aug 2016
Sensing and Imaging | VOL. 17

Robust i-vector based adaptation of DNN acoustic model for speech recognition
Sri Garimella ... Sree Hari Krishnan Parthasarathi
-
Sri Garimella, et. al.Sri Garimella ... Sree Hari Krishnan Parthasarathi
06 Sep 2015
06 Sep 2015

Optimizing acoustic models for commercial speech recognition using foreground scores and data weighting
D Boies ... B Strope
-
D Boies, et. al.D Boies ... B Strope
17 May 2004
17 May 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic modeling for speech recognition based on a generalized Laplacian mixture distribution

Abstract

Talk to us

Similar Papers

More From: Electronics and Communications in Japan (Part II: Electronics)