Trigram duration modeling in speech recognition

Yun Tang Yun Tang,Wenju Liu Wenju Liu,Bo Xu Bo Xu

doi:10.1109/chinsl.2004.1409627

Trigram duration modeling in speech recognition

Yun Tang Yun Tang, Wenju Liu Wenju Liu + Show 1 more

Open Access

https://doi.org/10.1109/chinsl.2004.1409627

Copy DOI

Publication Date: Dec 15, 2004

Citations: 10

#Speech Rate Variation #Rate Of Speech + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Rate of speech (ROS) is a very important factor in speech recognition. We present a new speech rate measurement method which first normalizes the duration of different acoustic units to a standard duration and then builds a trigram duration model to measure the speech rate of a sentence. We propose two methods based on the standard duration to compensate the influence introduced by speech rate variation in a data corpus and get 11% error rate reduction in Mandarin digit string recognition.

Full Text