HMM modeling for speaker independent voice dialing in car environment

L Fissore,P Ruscitti,P Laface

doi:10.1109/icassp.1992.225925

Abstract

The authors describe the development of a speaker-independent isolated word recognizer for a voice dialing application operating in the car environment. Speaker-dependent and speaker-independent approaches are addressed and compared. Simple continuous hidden Markov models (HMMs) are used for speaker-dependent recognition; multiple codebook discrete and continuous HMMs are trained by speaker-independent reference data derived from a large database of speech collected inside several cars under a wide variety of driving conditions and by a large number of speakers from different Italian regions. By modeling separately two models (one for male and one for female speakers) for each word with 12 state continuous density whole word HMMs with eight diagonal covariance Gaussians per state, and performing a beam search Viterbi decoding a recognition rate of 99% has been obtained (65 errors out of 6423 words).< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>

Full Text