Abstract

This paper describes a novel speaker adaptation method that combines the maximum a posteriori (MAP) estimation and the weighted neighbor regression (WNR). The primary disadvantage of MAP adaptation is that only the parameters of those models with adaptation data are updated, thus great deals of adaptation data are required. In this paper, a technique called WNR is presented, in which the information of model neighbors is used to overcome that problem. The parameter relationships between the speaker independent models and the speaker adaptation models are trained by applying the distance weighted regressions to a set of neighbor model parameters with and without MAP adaptation. It gives nearly 15 percent error rate reduction with 10 adaptation utterances and more than 51 percent with 250 utterances in Chinese syllable recognition. In addition, the vector field smoothing (VFS) can be proved to be a degenerate case of WNR.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.