Improvement of Lecture Speech Recognition by Using Unsupervised Adaptation

Tetsuo Kosaka,Masaharu Kato,Takashi Kusama,Masaki Kohda

doi:10.4018/978-1-61520-871-5.ch016

Tetsuo Kosaka, Masaharu Kato + Show 2 more

https://doi.org/10.4018/978-1-61520-871-5.ch016

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2011

Affiliation: Yamagata University Hospital

Abstract
Full-Text
Similar Papers

Abstract

Listen

The aim of this work is to improve the recognition performance of spontaneous speech. In order to achieve the purpose, the authors of this chapter propose new approaches of unsupervised adaptation for spontaneous speech and evaluate the methods by using diagonal-covariance and full-covariance hidden Markov models. In the adaptation procedure, both methods of language model (LM) adaptation and acoustic model (AM) adaptation are used iteratively. Several combination methods are tested to find the optimal approach. In the LM adaptation, a word trigram model and a part-of-speech (POS) trigram model are combined to build a more task-specific LM. In addition, the authors propose an unsupervised speaker adaptation technique based on adaptation data weighting. The weighting is performed depending on POS class. In Japan, a large-scale spontaneous speech database “Corpus of Spontaneous Japanese (CSJ)” has been used as the common evaluation database for spontaneous speech and the authors used it for their recognition experiments. From the results, the proposed methods demonstrated a significant advantage in that task.

Full Text