Abstract

Speaker adaptation has long been applied to improve speech recognition performance of hidden Markov model (HMM)-based systems. Recently, the polynomial segment model (PSM) has been shown as a viable alternative that can significantly improve the performance of large vocabulary continuous speech recognition (LVCSR). In this letter, we extend the widely used HMM-based maximum likelihood linear regression (MLLR) speaker adaptation technique to PSMs. PSM properties, such as using segment as a modeling unit, and a polynomial curve as model mean, are taken into account in deriving the PSM-based MLLR. Experiments show that PSM-based MLLR adaptation performs equally well as the HMM-based MLLR adaptation with about 19% relative improvement from the SI model. In addition, another 5% relative improvement can be obtained by combining the adapted PSMs and HMMs

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.