Abstract

The hidden Markov model is supposed as the most common and effective method used in speech recognition for all languages including Vietnamese. However, this method is quite cumbersome and difficult to implement in many embedded systems that have limited resources. Dynamic Time Warping (DTW) method, whereas, has been in much study by many scientists and is proved to be simple and efficient for a relatively small set of words (about 100 words). Though, this method has not been investigated for Vietnamese. This paper will present the investigation result of the combination between Dynamic Time Warping and Correlative Coefficient in Vietnamese speech recognition dependent with the speaker. The vocabulary to be recognized and trained are 124 words. The training data are recorded from 7 people (4 men and 3 women), with four recording time in noise free environment. The recognition outcome achieves the accuracy above 90% on average. In some control areas, such as computer and television control, this accuracy is very promising. DTW, therefore, is proposed as a simple and efficient for Vietnamese speech recognition in many simple control systems.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.