Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition

Xiaodong Cui,Jing Huang,Jen-Tzung Chien

doi:10.1109/tasl.2012.2191955

Abstract

Current hidden Markov acoustic modeling for large-vocabulary continuous speech recognition (LVCSR) heavily relies on the availability of abundant labeled transcriptions. Given that speech labeling is both expensive and time-consuming while there is a huge amount of unlabeled data easily available nowadays, the semi-supervised learning (SSL) from both labeled and unlabeled data aiming to reduce the development cost for LVCSR becomes more important than ever. In this paper, a new SSL approach is proposed which exploits the cross-view transfer learning for LVCSR through a committee machine consisting of multiple views learned from different acoustic features and randomized decision trees. In addition, a multi-objective learning scheme is developed in each view by maximizing a hybrid information-theoretic criterion which is established by the relative entropy between labeled data and their labels and the entropy of unlabeled data. The multi-objective scheme is then generalized to a unified SSL framework which can be interpreted into a variety of learning strategies under different weighting schemes. Experiments conducted on English Broadcast News using 50 hours of transcribed speech with 50 hours and 150 hours of untranscribed speech show the benefits of proposed approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Sep 1, 2012
Citations: 30

Similar Papers

Multi-view and multi-objective semi-supervised learning for large vocabulary continuous speech recognition
Xiaodong Cui ... Jing Huang
-
Xiaodong Cui, et. al.Xiaodong Cui ... Jing Huang
01 May 2011
01 May 2011

Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech
U Guz ... D Hakkani-Tur
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 18
U Guz, et. al.U Guz ... D Hakkani-Tur
01 Feb 2010
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 18

Graph-Based Semisupervised Learning for Acoustic Modeling in Automatic Speech Recognition
Yuzong Liu ... Katrin Kirchhoff
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Yuzong Liu, et. al.Yuzong Liu ... Katrin Kirchhoff
01 Nov 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Effective semi-supervised learning strategies for automatic sentence segmentation
Dogan Dalva ... Hakan Gurkan
Pattern Recognition Letters | VOL. 105
Dogan Dalva, et. al.Dogan Dalva ... Hakan Gurkan
10 Oct 2017
Pattern Recognition Letters | VOL. 105

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-View and Multi-Objective Semi-Supervised Learning for HMM-Based Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing