Active learning for automatic speech recognition

Dilek Hakkani-Tur,Allen Gorin,Giuseppe Riccardi

doi:10.1109/icassp.2002.5745510

Abstract

State-of-the-art speech recognition systems are trained using transcribed utterances, preparation of which is labor intensive and time-consuming. In this paper, we describe a new method for reducing the transcription effort for training in automatic speech recognition (ASR). Active learning aims at reducing the number of training examples to be labeled by automatically processing the unlabeled examples, and then selecting the most informative ones with respect to a given cost function for a human to label. We automatically estimate a confidence score for each word of the utterance, exploiting the lattice output of a speech recognizer, which was trained on a small set of transcribed data. We compute utterance confidence scores based on these word confidence scores, then selectively sample the utterances to be transcribed using the utterance confidence scores. In our experiments, we show that we reduce the amount of labeled data needed for a given word accuracy by 27%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Active learning for automatic speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Active learning: theory and applications to automatic speech recognition
G Riccardi ... D Hakkani-Tur
IEEE Transactions on Speech and Audio Processing | VOL. 13
G Riccardi, et. al.G Riccardi ... D Hakkani-Tur
01 Jul 2005
IEEE Transactions on Speech and Audio Processing | VOL. 13

A Dropout-Based Single Model Committee Approach for Active Learning in ASR
Jiayi Fu ... Kuang Ru
-
Jiayi Fu, et. al.Jiayi Fu ... Kuang Ru
01 Dec 2019
01 Dec 2019

An active MBBNTree classifier learning from unlabeled samples
Yong C Cao ... Yue Zhao
-
Yong C Cao, et. al.Yong C Cao ... Yue Zhao
10 Oct 2008
10 Oct 2008

Supervised and unsupervised active learning for automatic speech recognition of low-resource languages
Ali Raza Syed ... Ellen Kislal
-
Ali Raza Syed, et. al.Ali Raza Syed ... Ellen Kislal
01 Mar 2016
01 Mar 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active learning for automatic speech recognition

Abstract

Talk to us

Similar Papers