Abstract

Problem statement: In Thai, tone is an essential feature of a prosodic syllable to identify the meanings of that syllable or that part of word. To generate the tonal speech with natural prosody, it is needed to manage the fundamental frequency (F0) of the speech appropriately. A successful approach of structural modeling from Mandarin Chinese has been adapted to model Thai tone. Approach: The structural modeling of voice F0 contours for Thai tones has been studied. Both male and female speech are concerned. The speech material covers 15 syllables with 5 tones. We use 30 samples for each syllable. The structural modeling parameters for all tones are extracted. Thereafter, the Root Mean Square (RMS) error between the re-synthesized F0 contour and the natural F0 contour is calculated. Results: The experimental analysis shows that RMS errors of all tones are mutually different. It has been noticed that the tone 1 or low tone has the smallest error among all tones in average. Conclusion: The structural model is effectively applied to model Thai tones. The structural modeling can distinguish each tone empirically.

Highlights

  • In human speech production, the vocal chords vibrate at a temporal frequency to produce a semiperiodic air flow through the vocal tract

  • The vocal chords vibrate at a temporal frequency to produce a semiperiodic air flow through the vocal tract. This frequency is known as the fundamental frequency of the output speech signal. It is an essential feature among other speech features which carry prosodic information of the natural speech

  • In the modern speech technology, e.g., speech recognition, speech analysis and synthesis, it is. This research presented another approach of fundamental frequency contour modeling for Thai tones

Read more

Summary

INTRODUCTION

The vocal chords vibrate at a temporal frequency to produce a semiperiodic air flow through the vocal tract. This frequency is known as the fundamental frequency of the output speech signal. In the modern speech technology, e.g., speech recognition, speech analysis and synthesis, it is. This research presented another approach of fundamental frequency contour modeling for Thai tones. Necessary to model the F0 with high accuracy. In the former studies, several modeling techniques have been

MATERIALS AND METHODS
DISCUSSION
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call