Abstract

This paper presents a high performance real-time Mandarin and Sichuan Dialect speaker-independent continuous speech recognition system utilized for a post parcels checking task. The vocabulary of the system consists of 4500 Chinese place names and 1021 number strings. For Mandarin speech the recognition accuracies are achieved 98.9% for top-1 and 99.7% for top-3. For Sichuan Dialect speech, the recognition accuracies are achieved 98.6% for top-1 and 99.9% for top-3. Besides, the rejection method based online garbage model and speaker adaptation method are employed and integrated into the system to improve its robustness. A mixed start-end point detection algorithm is used. The system can work stably under a high-noise background environment with high performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.