Abstract

This paper constructs a reading miscue detection system based on conventional large vocabulary continuous speech recognition (LVCSR) framework. In order to incorporate the knowledge of reference (what the reader ought to read) and some error patterns into the decoding process, two methods are proposed: dynamic interpolation of language model (DILM) and dynamic multiple pronunciation incorporation (DMPI). DILM dynamically interpolates the general language model based on analysis of reference and so restricts the active paths of decoding not too far away from the reference. It makes the recognition more accurate, which further improve the detection performance. DMPI dynamically adds some pronunciation variations into the search space to predict reading substitutions. It solves the confliction between the coverage of error predictions and the perplexity the search space. The experimental results show that the proposed two methods can totally decrease EER by 14% relatively totally, from 46.4% to 39.8%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.