System-independent ASR error detection and classification using Recurrent Neural Network

Rahhal Errattahi,Asmaa El Hannani,Thomas Hain,Hassan Ouahmane

doi:10.1016/j.csl.2018.12.007

Abstract

This paper addresses errors in continuous Automatic Speech Recognition (ASR) in two stages: error detection and error type classification. Unlike the majority of research in this field, we propose to handle the recognition errors independently from the ASR decoder. We first establish an effective set of generic features derived exclusively from the recognizer output to compensate for the absence of ASR decoder information. Then, we apply a variant Recurrent Neural Network (V-RNN) based models for error detection and error type classification. Such model learn additional information to the recognized word classification using label dependency. As a result, experiments on Multi-Genre Broadcast Media corpus have shown that the proposed generic features setup leads to achieve competitive performances, compared to state of the art systems in both tasks. Furthermore, we have shown that V-RNN trained on the proposed feature set appear to be an effective classifier for the ASR error detection with an Accuracy of 85.43%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Speech & Language	Publication Date: Dec 14, 2018
Citations: 7	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

System-independent ASR error detection and classification using Recurrent Neural Network

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Similar Papers

Towards a generic approach for automatic speech recognition error detection and classification
Rahhal Errattahi ... Hassan Ouahmane
-
Rahhal Errattahi, et. al.Rahhal Errattahi ... Hassan Ouahmane
01 Mar 2018
01 Mar 2018

Evaluation of the effectiveness and efficiency of state-of-the-art features and models for automatic speech recognition error detection
Asmaa El Hannani ... Fatima Zahra Salmam
Journal of Big Data | VOL. 8
Asmaa El Hannani, et. al.Asmaa El Hannani ... Fatima Zahra Salmam
06 Jan 2021
Journal of Big Data | VOL. 8

Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates
Hirofumi Inaguma ... Shinji Watanabe
-
Hirofumi Inaguma, et. al.Hirofumi Inaguma ... Shinji Watanabe
13 Dec 2021
13 Dec 2021

Automatic Speech Recognition and Pronunciation Error Detection of Dutch Non-native Speech: cumulating speech resources in a pluricentric language
X Wei ... H Strik
Speech Communication | VOL. 144
X Wei, et. al.X Wei ... H Strik
01 Oct 2022
Speech Communication | VOL. 144

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

System-independent ASR error detection and classification using Recurrent Neural Network

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language