GMM-based acoustic modeling for embedded speech recognition

Christophe Lévy,Jean-François Bonastre,Georges Linarès

doi:10.21437/interspeech.2006-479

Christophe Lévy, Jean-François Bonastre + Show 1 more

https://doi.org/10.21437/interspeech.2006-479

Copy DOI

Export

Save

Cite

Publication Date: Sep 17, 2006

Citations: 4

Abstract
Full-Text
Similar Papers

Abstract

Listen

Speech recognition applications are known to require a significant amount of resources (training data, memory, computing power). However, the targeted context of this work - mobile phone embedded speech recognition system - only authorizes few KB of memory, few MIPS and usually small amount of training data. In order to fit the resource constraints, an approach based on a semi-continuous HMM system using a GMM-based stateindependent acoustic modeling is proposed in this paper. A transformation is computed and applied to the global GMM in order to obtain each of the HMM state-dependent probability density functions. This strategy aims at storing only the transformation function parameters for each state and authorizes to decrease the amount of computing power needed for the likelihood computation. The proposed approach is evaluated on two tasks: a digit recognition task using the French corpus BDSON (which allows a Digit Error Rate of 2.5%) and a voice command task using French corpus VODIS (the Command Error Rate leads around 4.1%). Index Terms: embedded speech recognition, acoustic modeling.

Full Text