Abstract

Automatic Speech Recognition (ASR) errors are essentially unavoidable. This premise motivates the attempts to develop post hoc tools that tackle the ASR errors. This paper addresses the problem of errors in continuous speech recognition outputs to improve the exploitation of ASR transcriptions. We propose a generic classifier-based approach for both error detection and error type classification. Unlike the majority of research in this field, we handle the recognition errors independently from the ASR decoder using a set of features derived exclusively from the recognizer output and hence should be usable with any ASR system. As a result, experiments on TV program transcription data have shown that the proposed non-decoder features setup leads to achieve competitive performances, compared to state of the art systems, in ASR error detection and classification. Furthermore, we have shown that Support Vector Machines trained on the proposed features set appear to be an effective classifier for the ASR error type classification with an Accuracy of 82.41%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.