Speaker-independent continuous speech dictation

J.L Gauvain,L.F Lamel,G Adda,M Adda-Decker

doi:10.1016/0167-6393(94)90038-8

Abstract

In this paper we report on progress made at LIMSI in speaker-independent large vocabulary speech dictation using newspaper-based speech corpora in English and French. The recognizer makes use of continuous density HMMs with Gaussian mixtures for acoustic modeling and n-gram statistics estimated on newspaper texts for language modeling. Acoustic modeling uses cepstrum-based features, context-dependent phone models (intra and interword), phone duration models, and sex-dependent models. For English the ARPA Wall Street Journal-based CSR corpus is used and for French the BREF corpus containing recordings of texts from the French newspaper Le Monde is used. Experiments were carried out with both these corpora at the phone level and at the word level with vocabularies containing up to 20,000 words. Word recognition experiments are also described for the ARPA RM task which has been widely used to evaluate and compare systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker-independent continuous speech dictation

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Oct 1, 1994
Citations: 87

Similar Papers

Speaker-independent continuous speech dictation
Jean-Luc Gauvain ... Lori F Lamel
-
Jean-Luc Gauvain, et. al.Jean-Luc Gauvain ... Lori F Lamel
22 Sep 1993
22 Sep 1993

The LIMSI continuous speech dictation system
J L Gauvain ... M Adda-Decker
-
J L Gauvain, et. al.J L Gauvain ... M Adda-Decker
01 Jan 1993
01 Jan 1993

The LIMSI continuous speech dictation system: evaluation on the ARPA Wall Street Journal task
J.L Gauvain ... G Adda
-
J.L Gauvain, et. al.J.L Gauvain ... G Adda
19 Apr 1994
19 Apr 1994

Phone duration modeling: overview of techniques and performance optimization via feature selection in the context of emotional speech
Alexandros Lazaridis ... Nikos Fakotakis
International Journal of Speech Technology | VOL. 13
Alexandros Lazaridis, et. al.Alexandros Lazaridis ... Nikos Fakotakis
30 Jul 2010
International Journal of Speech Technology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker-independent continuous speech dictation

Abstract

Talk to us

Similar Papers

More From: Speech Communication