Automatic Annotation of Speech Corpora using Approximate Transcripts

Cristian Manolache,Alexandru Caranica,Alexandru-Lucian Georgescu,Horia Cucu

doi:10.1109/tsp49548.2020.9163405

Abstract

High-performance automatic speech recognition (ASR) systems are regularly trained on tens of thousands of hours annotated speech (i.e. speech paired with correct transcripts). Collecting such amount of data is prohibitively costly if done manually (i.e. humans listening and transcribing audio clips). However, raw speech data (without transcripts) is widely available and easily collectable. This paper proposes an automatic method that uses approximate transcripts of raw speech and an already existing ASR system to generate annotations. The method is evaluated in terms of annotation efficiency (i.e. the percentage of the initial raw speech corpus for which it provides annotations) and in terms of data usefulness for further training ASR systems. We show that, although the method is able to produce less data than other methods, the ASR system retrained using the newly created dataset performs significantly better than the baseline. Furthermore, we report ASR results that are better by 17% to 25% than what was reported up to now on Romanian read and spontaneous speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Annotation of Speech Corpora using Approximate Transcripts

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Interaction between people with dysarthria and speech recognition systems: A review
Aisha Jaddoh ... Omer Rana
Assistive Technology | VOL. 35
Aisha Jaddoh, et. al.Aisha Jaddoh ... Omer Rana
16 Apr 2022
Assistive Technology | VOL. 35

Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling
G Thimmaraja Yadava ... H S Jayanna
International Journal of Speech Technology | VOL. 23
G Thimmaraja Yadava, et. al.G Thimmaraja Yadava ... H S Jayanna
22 Jan 2020
International Journal of Speech Technology | VOL. 23

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Annotation of Speech Corpora using Approximate Transcripts

Abstract

Talk to us

Similar Papers