MODELING OF FUNDAMENTAL FREQUENCY CONTOURS FOR THAI DIALECTS WITH LARGE SPEECH DATABASE

M M

doi:10.3844/ajassp.2012.1990.2003

Abstract

In four core regions of Thailand, there are four main dialects including central, north, northeast and south dialects. The prosody is significantly unique for each dialect. One important factor determining the prosody is the fundamental frequency. As a result, modeling of Fundamental frequency (F0) contour is very important for the natural speech processing. Even though there are many modeling techniques for modeling the F0 contour. In this study, the Fujisaki’s model has been selected because of its achievement in modeling of various Thai speech units. This study proposes an analysis of model parameters of Thai speech prosody for four regional dialects and two genders. Seven derived parameters from the Fujisaki’s model are as follows. The first parameter is baseline frequency which is the lowest level of F0 contour. The second and third parameters are the numbers of phrase commands and tone commands which reflect the frequencies of surges of the utterance in global and local levels, respectively. The fourth and fifth parameters are phrase command and tone command durations which reflect the speed of speaking and the length of a syllable, respectively. The sixth and seventh parameters are amplitudes of phrase command and tone command which reflect the energy of the global speech and the energy of local syllable. In the experimental results, the large speech material of each regional dialect includes 50 samples of 50 sentences with male and female speech. It can be obviously seen that most of the proposed parameters can distinguish four kinds of regional dialects explicitly. The results reveal that the proposed parameters of Fujisaki’s model can distinguish the regional dialects explicitly.

Highlights

The former study on F0 modeling has been considerably conducted in various speech units and several techniques such as utterance level (Fujisaki and Ohno, 1998; Fujisaki et al, 1990; Tao et al, 2006; Saito and Sakamoto, 2002; Ni and Hirose, 2006; Li et al, 2004), word and syllable levels (Fujisaki et al, 1990; Hiroya and Hiroshi, 1971; Dat et al, 2006)
An analysis of model parameters of Thai speech prosody for four regional dialects and two genders will be performed in the same way as modeling of fundamental frequency for Thai expressive speech conducted in 2010 which is proved to be effective for a limited-domain speech corpus (Chomphan, 2010a)
We analyzed the frequency distribution over its range and the distributions of four Thai dialects including Center dialect, North dialect, Northeast dialect and South dialect are plot in a graph to show the differences and similarities among those dialects

Summary

Introduction

The former study on F0 modeling has been considerably conducted in various speech units and several techniques such as utterance level (Fujisaki and Ohno, 1998; Fujisaki et al, 1990; Tao et al, 2006; Saito and Sakamoto, 2002; Ni and Hirose, 2006; Li et al, 2004), word and syllable levels (Fujisaki et al, 1990; Hiroya and Hiroshi, 1971; Dat et al, 2006). In Thai speech, Fujisaki’s model has been successfully applied for modeling of utterances, tones and words (Hiroya and Sumio, 2002; Seresangtakul and Takara, 2002; 2003). The previous study shows that the derived parameters can distinguish one style of speech from each other.

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MODELING OF FUNDAMENTAL FREQUENCY CONTOURS FOR THAI DIALECTS WITH LARGE SPEECH DATABASE

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: American Journal of Applied Sciences

Lead the way for us

Journal: American Journal of Applied Sciences	Publication Date: Dec 1, 2012
License type: cc-by

Similar Papers

Effects of Noises on Fujisakiâs Model of Fundamental Frequency Contours for Thai Dialects
...
American Journal of Applied Sciences | VOL. 9
, et. al. ...
01 Oct 2012
American Journal of Applied Sciences | VOL. 9

Fujisaki's Model of Fundamental Frequency Contours for Thai Dialects
Chomphan
Journal of Computer Science | VOL. 6
Chomphan Chomphan
01 Nov 2010
Journal of Computer Science | VOL. 6

Analytical Study on Fundamental Frequency Contours of Thai Expressive Speech Using Fujisaki's Model
Chomphan
Journal of Computer Science | VOL. 6
Chomphan Chomphan
01 Jan 2009
Journal of Computer Science | VOL. 6

Fujisakiâs Model of Thaiâs Fundamental Frequency Contours with Environmental Noises
Edgar
American Journal of Applied Sciences | VOL. 9
Edgar Edgar
01 Aug 2012
American Journal of Applied Sciences | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MODELING OF FUNDAMENTAL FREQUENCY CONTOURS FOR THAI DIALECTS WITH LARGE SPEECH DATABASE

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: American Journal of Applied Sciences