ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks

Atsunori Ogawa,Takaaki Hori

doi:10.1109/icassp.2015.7178796

Abstract

Recurrent neural networks (RNNs) have recently been applied as the classifiers for sequential labeling problems. In this paper, deep bidirectional RNNs (DBRNNs) are applied for the first time to error detection in automatic speech recognition (ASR), which is a sequential labeling problem. We investigate three types of ASR error detection tasks, i.e. confidence estimation, out-of-vocabulary word detection and error type classification. We also estimate recognition rates from the error type classification results. Experimental results show that the DBRNNs greatly outperform conditional random fields (CRFs), especially for the detection of infrequent error labels. The DBRNNs also slightly outperform the CRFs in recognition rate estimation. In addition, experiments using a reduced size of training data suggest that the DBRNNs have a better generalization ability than the CRFs owing to their word vector representation in a low-dimensional continuous space. As a result, the DBRNNs trained using only 20% of the training data show higher error detection performance than the CRFs trained using the full training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks
Atsunori Ogawa ... Takaaki Hori
Speech Communication | VOL. 89
Atsunori Ogawa, et. al.Atsunori Ogawa ... Takaaki Hori
11 Mar 2017
Speech Communication | VOL. 89

Improving protein disorder prediction by deep bidirectional long short-term memory recurrent neural networks.
Jack Hanson ... Kuldip Paliwal
Bioinformatics | VOL. 33
Jack Hanson, et. al.Jack Hanson ... Kuldip Paliwal
05 Dec 2016
Bioinformatics | VOL. 33

Speaker-Adapted Confidence Measures for ASR Using Deep Bidirectional Recurrent Neural Networks
Miguel Angel Del-Agua ... Alfons Juan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Miguel Angel Del-Agua, et. al.Miguel Angel Del-Agua ... Alfons Juan
01 Jul 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

Incorporating label dependency for ASR error detection via RNN
Rahhal Errattahi ... Hassan Ouahmane
Procedia Computer Science | VOL. 148
Rahhal Errattahi, et. al.Rahhal Errattahi ... Hassan Ouahmane
01 Jan 2019
Procedia Computer Science | VOL. 148

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks

Abstract

Talk to us

Similar Papers