Automatic phonetic transcription of large speech corpora

Christophe Van Bael,Lou Boves,Henk Van Den Heuvel,Helmer Strik

doi:10.1016/j.csl.2007.03.003

Christophe Van Bael, Lou Boves + Show 2 more

Open Access

https://doi.org/10.1016/j.csl.2007.03.003

Copy DOI

Journal: Computer Speech & Language	Publication Date: Mar 23, 2007
Citations: 45	License type: mit

Affiliation: Radboud University Nijmegen

Abstract

Most large speech corpora are delivered with a lexicon that contains a canonical transcription of every word in the orthographic transcription. Such a lexicon can be used for generating a hypothetical ‘canonical’ phonetic transcription from the orthography. In addition, time and money permitting, some speech corpora are provided with a manually verified broad phonetic transcription of at least part of the material. Since the manual verification of phonetic transcriptions is time-consuming and expensive, we investigated whether existing automatic transcription procedures and combinations of such procedures can offer a quick and cheap alternative for the generation of phonetic transcriptions like the manually verified transcriptions delivered with large speech corpora. In our study, we used 10 automatic transcription procedures to generate a broad phonetic transcription of well-prepared speech (read-aloud texts) and spontaneous speech (telephone dialogues) from the Spoken Dutch Corpus. The performance was assessed in terms of the number and the nature of the discrepancies between the emerging phonetic transcriptions and the corresponding manually verified phonetic transcriptions delivered with the Spoken Dutch Corpus. Some of the resulting automatic transcriptions appeared to be comparable to the manually verified transcriptions.

Full Text