Speech resources for a Serbian LVCSR system

Stevan Ostrogonac,Edvin Pakoci,Sinisa Suzic,Milana Bojanic

doi:10.1109/telfor.2013.6716271

Speech resources for a Serbian LVCSR system

Stevan Ostrogonac, Edvin Pakoci + Show 2 more

https://doi.org/10.1109/telfor.2013.6716271

Copy DOI

Publication Date: Nov 1, 2013

Affiliation: University of Novi Sad

#Large Vocabulary Speech Recognition System #Read Utterances + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes the whole procedure of speech database collection and processing required for building a good large vocabulary speech recognition system for the Serbian language. The speech database consists of speech recordings from audio books, radio programs and talk shows, as well as read utterances from an array of male and female speakers. To date, around 200 hours of read speech is collected, as well as about 10 hours of radio recordings.

Full Text