Lightly supervised and unsupervised acoustic model training

Lori Lamel,Jean-Luc Gauvain,Gilles Adda

doi:10.1006/csla.2001.0186

Abstract

The last decade has witnessed substantial progress in speech recognition technology, with today’s state-of-the-art systems being able to transcribe unrestricted broadcast news audio data with a word error of about 20%. However, acoustic model development for these recognizers relies on the availability of large amounts of manually transcribed training data. Obtaining such data is both time-consuming and expensive, requiring trained human annotators and substantial amounts of supervision. This paper describes some recent experiments using lightly supervised and unsupervised techniques for acoustic model training in order to reduce the system development cost. The approach uses a speech recognizer to transcribe unannotated broadcast news data from the Darpa TDT-2 corpus. The hypothesized transcription is optionally aligned with closed-captions or transcripts to create labels for the training data. Experiments providing supervision only via the language model training materials show that including texts which are contemporaneous with the audio data is not crucial for success of the approach, and that the acoustic models can be initialized with as little as 10 min of manually annotated data. These experiments demonstrate that light or no supervision can dramatically reduce the cost of building acoustic models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lightly supervised and unsupervised acoustic model training

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Jan 1, 2002
Citations: 281

Similar Papers

Unsupervised acoustic model training
Lori Lamel ... Gilles Adda
-
Lori Lamel, et. al.Lori Lamel ... Gilles Adda
01 May 2002
01 May 2002

Multi-lingual unsupervised acoustic modeling using multi-task deep neural network under mismatch conditions
Yao Haitao ... Liu Jian
-
Yao Haitao, et. al.Yao Haitao ... Liu Jian
01 Jun 2016
01 Jun 2016

Unsupervised acoustic and language model training with small amounts of labelled data
Scott Novotney ... Richard Schwartz
-
Scott Novotney, et. al.Scott Novotney ... Richard Schwartz
01 Apr 2009
01 Apr 2009

Cross-language bootstrapping for unsupervised acoustic model training: rapid development of a Polish speech recognition system
Jonas Lööf ... Christian Gollan
-
Jonas Lööf, et. al.Jonas Lööf ... Christian Gollan
06 Sep 2009
06 Sep 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lightly supervised and unsupervised acoustic model training

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language