Transcribing broadcast data using MLP features

Petr Fousek,Lori Lamel,Jean-Luc Gauvain

doi:10.21437/interspeech.2008-414

Abstract

This paper describes incorporating discriminative features from a multi layer perceptron (MLP) into a state-of-the-art Arabic broadcast data transcription system based on cepstral features. The MLP features are based on a recently proposed Bottle-Neck architecture with long-term warped LPTRAP speech representation at the input. It is shown that the previously reported improvements on a development Arabic transcription system carry through to a full system at a state-ofthe-art level. SAT, CMLLR and MLLR adaptation techniques are shown to be useful for both MLP and combined features, though to a lesser degree than for PLPs. Without adaptation, MLP features obtain superior performance to cepstral features in all test conditions, and with adaptation both feature sets give comparable results. Combining the features, either by feature concatenation or system hypotheses, gives significant gains. Gains from MMI model training seem to be additive to the gain coming from discriminative MLP features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transcribing broadcast data using MLP features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

On the Use of MLP Features for Broadcast News Transcription
Petr Fousek ... Lori Lamel
-
Petr Fousek, et. al.Petr Fousek ... Lori Lamel
08 Sep 2008
08 Sep 2008

Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data
Cong-Thanh Do ... A K Sarkar
-
Cong-Thanh Do, et. al.Cong-Thanh Do ... A K Sarkar
25 Aug 2013
25 Aug 2013

On using MLP features in LVCSR
Qifeng Zhu ... Barry Chen
-
Qifeng Zhu, et. al.Qifeng Zhu ... Barry Chen
04 Oct 2004
04 Oct 2004

Multi-style MLP features for BN transcription
Viet-Bac Le ... Lori Lamel
-
Viet-Bac Le, et. al.Viet-Bac Le ... Lori Lamel
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transcribing broadcast data using MLP features

Abstract

Talk to us

Similar Papers