Gender-Dependent Acoustic Models Fusion Developed for Automatic Subtitling of Parliament Meetings Broadcasted by the Czech TV

Jan Vaněk,Josef V Psutka

doi:10.1007/978-3-642-15760-8_55

Abstract

AbstractGender-dependent (male/female) acoustic models are more acoustically homogeneous and therefore give better recognition performance than single gender-independent model. This paper deals with a problem how to use these gender-based acoustic models in a real-time LVCSR (Large Vocabulary Continuous Speech Recognition) system that is for more than one year used by the Czech TV for automatic subtitling of Parliament meetings that are broadcasted on the channel ČT24. Frequent changes of speakers and the direct connection of the LVCSR system to the TV audio stream require switching/fusion of models automatically and as soon as possible. The paper presents various techniques based on using the output probabilities for quick selection of a better model or their combinations. The best proposed method achieved over 11% relative WER reduction in comparision with the GI model.KeywordsFusion MethodTotal ProbabilityAcoustic ModelOutput ProbabilityWord Error RateThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Gender-Dependent Acoustic Models Fusion Developed for Automatic Subtitling of Parliament Meetings Broadcasted by the Czech TV

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2010
Citations: 21	License type: other-oa

Similar Papers

Exemplar-Based Sparse Representation Features: From TIMIT to LVCSR
Tara N Sainath ... Michael Picheny
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19
Tara N Sainath, et. al.Tara N Sainath ... Michael Picheny
01 Nov 2011
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19

Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news
Katsutoshi Ohtsuki ... Katsuhiko Shirai
Speech Communication | VOL. 28
Katsutoshi Ohtsuki, et. al.Katsutoshi Ohtsuki ... Katsuhiko Shirai
01 Jun 1999
Speech Communication | VOL. 28

Acoustic models of the elderly for large‐vocabulary continuous speech recognition
Akira Baba ... Kiyohiro Shikano
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87
Akira Baba, et. al.Akira Baba ... Kiyohiro Shikano
09 Jun 2004
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87

Recent improvements of the SpeeD Romanian LVCSR system
Horia Cucu ... Corneliu Burileanu
-
Horia Cucu, et. al.Horia Cucu ... Corneliu Burileanu
01 May 2014
01 May 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gender-Dependent Acoustic Models Fusion Developed for Automatic Subtitling of Parliament Meetings Broadcasted by the Czech TV

Abstract

Talk to us

Similar Papers