FRAME-SYNCHRONOUS AND LOCAL CONFIDENCE MEASURES FOR AUTOMATIC SPEECH RECOGNITION

Joseph Razik,Dominique Fohr,Odile Mella,Jean-Paul Haton

doi:10.1142/s0218001411008543

Abstract

In this paper, we introduce two new confidence measures for large vocabulary speech recognition systems. The major feature of these measures is that they can be computed without waiting for the end of the audio stream. We proposed two kinds of confidence measures: frame-synchronous and local. The frame-synchronous ones can be computed as soon as a frame is processed by the recognition engine and are based on a likelihood ratio. The local measures estimate a local posterior probability in the vicinity of the word to analyze. We evaluated our confidence measures within the framework of the automatic transcription of French broadcast news with the EER criterion. Our local measures achieved results very close to the best state-of-the-art measure (EER of 23% compared to 22.0%). We then conducted a preliminary experiment to assess the contribution of our confidence measure in improving the comprehension of an automatic transcription for the hearing impaired. We introduced several modalities to highlight words of low confidence in this transcription. We showed that these modalities used with our local confidence measure improved the comprehension of automatic transcription.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FRAME-SYNCHRONOUS AND LOCAL CONFIDENCE MEASURES FOR AUTOMATIC SPEECH RECOGNITION

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Journal: International Journal of Pattern Recognition and Artificial Intelligence	Publication Date: Mar 1, 2011
Citations: 5

Similar Papers

An experimental study on structural-MAP approaches to implementing very large vocabulary speech recognition systems for real-world tasks
I-Fan Chen ... Seokyong Moon
-
I-Fan Chen, et. al.I-Fan Chen ... Seokyong Moon
01 Oct 2013
01 Oct 2013

Speech/music segmentation using entropy and dynamism features in a HMM classification framework
Jitendra Ajmera ... Hervé Bourlard
Speech Communication | VOL. 40
Jitendra Ajmera, et. al.Jitendra Ajmera ... Hervé Bourlard
13 Sep 2002
Speech Communication | VOL. 40

Acoustic model topology optimization for large vocabulary speech recognition
Xirimo Bao ... J Joo
MATEC Web of Conferences | VOL. 309
Xirimo Bao, et. al.Xirimo Bao ... J Joo
01 Jan 2020
MATEC Web of Conferences | VOL. 309

Japanese broadcast news transcription demonstration
Long Nguyen ... Xuefeng Guo
-
Long Nguyen, et. al.Long Nguyen ... Xuefeng Guo
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FRAME-SYNCHRONOUS AND LOCAL CONFIDENCE MEASURES FOR AUTOMATIC SPEECH RECOGNITION

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence