An Innovative Prosody Modeling Method for Chinese Speech Recognition

Gang Peng,William S.-Y Wang

doi:10.1023/b:ijst.0000017013.70486.51

Abstract

This paper presents an innovative method for prosody modeling in Chinese speech recognition. Our method first evaluated the reliability of the prosodic information by which the recognition system dynamically tunes the balance between the spectral scores and prosodic scores. The basic idea of this method is to use prosodic knowledge based on its reliability. The higher the reliability, the more the prosodic information contributes to recognition. Thus, this method will not introduce extra errors but will incorporate more knowledge into the recognition system. Experimental results showed that this method reduced the relative word error rate by as much as 52.9% and 46.0% for Mandarin and Cantonese digit string recognition tasks, respectively. When incorporating tone information into Cantonese Large Vocabulary Continuous Speech Recognition (LVCSR) via the proposed method, a 20.16% relative character error rate reduction was obtained.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Innovative Prosody Modeling Method for Chinese Speech Recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Apr 1, 2004
Citations: 21

Similar Papers

Improving Speech Recognition Accuracy of Local POI Using Geographical Models
Songjun Cao ... Long Ma
-
Songjun Cao, et. al.Songjun Cao ... Long Ma
19 Jan 2021
19 Jan 2021

Cross-Lingual Language Modeling for Low-Resource Speech Recognition
Ping Xu ... P Fung
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 21
Ping Xu, et. al. Ping Xu ... P Fung
01 Jun 2013
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 21

Geo-location dependent deep neural network acoustic model for speech recognition
Guoli Ye ... Yifan Gong
-
Guoli Ye, et. al.Guoli Ye ... Yifan Gong
01 Mar 2016
01 Mar 2016

Optimizing Data Usage for Low-Resource Speech Recognition
Yanmin Qian ... Zhikai Zhou
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30
Yanmin Qian, et. al.Yanmin Qian ... Zhikai Zhou
01 Jan 2021
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Innovative Prosody Modeling Method for Chinese Speech Recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology