Abstract

The increasing demand for the performance improvement and robustness of automatic transcription of spontaneous speech in Slovak forces us to look for the advanced methods of adaptation of acoustic and language models to the user-specific voice characteristics and the topic of their speech. One of the ways how to increase the domain robustness of language models is to improve the process of retrieving text documents relevant to the current topic of the speech and use them to adapt the existing background language model. This paper focuses on the analysis, design and implementation of a new dual-space re-ranking model for document retrieval, adaptation of language models to the current topic of speech and personalization of speech recognition system. The experimental results of the proposed dual-space reranking model based on the averaging coefficients produced by latent semantic indexing and paragraph vectors ranking models show an additional 1% relative improvement in word error rate against the efficiency of single-space model ranking.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.