Abstract

After inspecting the pitch contours of tone 1 of Mandarin speech, we found that the pitch contour of tone 1 consists of upward and downward line segments, while it is supposed that the contour of tone 1 is flat. Our study also found that tone 1 tends to be recognized as other three tones if the recognition algorithm used is based on the tone contour slope or shape. According to our experiments, we conclude that the recognition rate of the tones would be improved if two stage tone recognition scheme is conducted. At the first stage, tone one is recognized out and then the other three tones are identified at the second stage. The fundamental frequencies of input Mandarin speech of tone 1 are first retrieved from the training data and then a threshold value relating to standard deviation of fundamental frequencies is determined. In the first recognition stage, if the statistic standard deviation of fundamental frequencies is less than the determined threshold, the Mandarin speech is recognized as tone one. The input Mandarin speech which is not classified as tone 1 are the recognition targets of the second recognition stage. In the second stage, a so-called linear gradient analysis is conducted, and the tones are identified according to the derived positive or negative linear gradients. Our proposed recognition method is superior to traditional methods of Mandarin tone recognition in terms of effectiveness and recognition rate. Some experiments to prove the necessity of conducting two recognition stages will be described in detail.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.