Abstract

This paper presents our confidence measure system for speech recognition to integrate with e-Service to make Human-Computer Interaction more convenient. In order to make the system more robust for practical usage, the confidence measure is optimized to improve its performance as well as speed, compared with traditional state based confidence measure. First, the decoding likelihood of the best path is normalized with all the survival paths to form the onepass-based posteriori. After decoding, when recognition result is available as well as the phoneme level division point, the phoneme loop posteriori based confidence is calculated. Different models are compared for speed and performance. Then they are combined to form the final confidence for the judgement. Experiments are designed, and the proposed confidence measure get a relative improvement of 20%, 19.33% for equal error rate and 37.19%, 35.17% of false acceptance rate for out-of-vocabulary set on the development sets and the test sets, with no loss of false rejection rate for in-vocabulary set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.