Annotation errors detection in TTS corpora

Jindřich Matoušek,Daniel Tihelka

doi:10.21437/interspeech.2013-305

Annotation errors detection in TTS corpora

Jindřich Matoušek, Daniel Tihelka

Open Access

https://doi.org/10.21437/interspeech.2013-305

Copy DOI

Publication Date: Aug 25, 2013
Citations: 16	License type: other-oa

Affiliation: University of West Bohemia

#Novelty Detection #Annotation Error + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We investigate the problem of automatic detection of annotation errors in single-speaker read-speech corpora used for textto-speech (TTS) synthesis. Various word-level feature sets were used, and the performance of several detection methods based on support vector machines, extremely randomized trees, knearest neighbors, and the performance of novelty and outlier detection are evaluated. We show that both word- and utterancelevel annotation error detections perform very well with both high precision and recall scores and with F1 measure being almost 90%, or 97%, respectively. Index Terms: annotation error detection, classification, novelty detection, read speech corpora, speech synthesis

Full Text