Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

Dong Yu,Balakrishnan Varadarajan,Li Deng,Alex Acero

doi:10.1016/j.csl.2009.03.004

Dong Yu, Balakrishnan Varadarajan + Show 2 more

https://doi.org/10.1016/j.csl.2009.03.004

Copy DOI

Abstract

We propose a unified global entropy reduction maximization (GERM) framework for active learning and semi-supervised learning for speech recognition. Active learning aims to select a limited subset of utterances for transcribing from a large amount of un-transcribed utterances, while semi-supervised learning addresses the problem of selecting right transcriptions for un-transcribed utterances, so that the accuracy of the automatic speech recognition system can be maximized. We show that both the traditional confidence-based active learning and semi-supervised learning approaches can be improved by maximizing the lattice entropy reduction over the whole dataset. We introduce our criterion and framework, show how the criterion can be simplified and approximated, and describe how these approaches can be combined. We demonstrate the effectiveness of our new framework and algorithm with directory assistance data collected under the real usage scenarios and show that our GERM based active learning and semi-supervised learning algorithms consistently outperform the confidence-based counterparts by a significant margin. Using our new active learning algorithm cuts the number of utterances needed for transcribing by 50% to achieve the same recognition accuracy obtained using the confidence-based active learning approach, and by 60% compared to the random sampling approach. Using our new semi-supervised algorithm we can determine the cutoff point in determining which utterance-transcription pair to use in a principled way by demonstrating that the point it finds is very close to the achievable peak point.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Mar 25, 2009
Citations: 141

Similar Papers

A two-phase hybrid of semi-supervised and active learning approach for sequence labeling
Hamed Hassanzadeh ... Mohammadreza Keyvanpour
Intelligent Data Analysis | VOL. 17
Hamed Hassanzadeh, et. al.Hamed Hassanzadeh ... Mohammadreza Keyvanpour
17 Apr 2013
Intelligent Data Analysis | VOL. 17

Consistency-Based Semi-supervised Evidential Active Learning for Diagnostic Radiograph Classification
Shafa Balaram ... Cuong M Nguyen
-
Shafa Balaram, et. al.Shafa Balaram ... Cuong M Nguyen
01 Jan 2021
01 Jan 2021

A unified active and semi-supervised learning framework for image compression
Xiaofei He ... Hujun Bao
-
Xiaofei He, et. al. Xiaofei He ... Hujun Bao
01 Jun 2009
01 Jun 2009

A unified active and semi-supervised learning framework for image compression
Xiaofei He ... Ming Ji
-
Xiaofei He, et. al. Xiaofei He ... Ming Ji
01 Jun 2009
01 Jun 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Active learning and semi-supervised learning for speech recognition: A unified framework using the global entropy reduction maximization criterion

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language