Abstract

In this paper, the problem of fast model adaptation and complexity selection for nonnative speaker is investigated. The key challenge lies in reliable complexity selection when only a small amount of adaptation data is available. A novel technique of combining a maximum likelihood (ML) based state-tying with a pseudo likelihood (PL) based state-tying is proposed to enable model complexity selection from using as little as three adaptation speech sentences. In MUPL, ML model complexity selection is performed on nodes with sufficient adaptation data, and PL based state tying is performed on nodes with insufficient adaptation data. Experiments were performed on WSJ data of six nonnative speakers. The combined model adaptation and complexity selection method led to consistent and significant improvement on recognition accuracy over MLLR, with an average error reduction of 13% when a varying number of adaptation speech sentences were taken from each speaker.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call