Perceptual Influence of Non-Spectral Cues on Spectral Properties of Korean Vowel Signals in Seoul Corpus: A Neural Network Modeling Study

Soonhyun Hong

doi:10.14342/smog.2019.102.135

Abstract

Though the importance of spectral characteristics of Korean vowels in the hVd syllable has been recognized in the literature, it has never been studied whether static spectral measurements at a (steady-state) central section are enough to characterize Korean vowels in spontaneous speech, or dynamic spectral measurements across the temporal dimension can better characterize vowels. Despite ample reported evidence of the perceptual influence of non-spectral cues on spectral properties of vowels in the literature, no reports have yet been released on the difference in the degree of the perceptual influence of non-spectral cues (e.g., place and manner of the preceding or following phones, F0, speaking rate, prosody, and gender) on spectral properties of vowels. Through Neural Network pattern recognition modeling in a supervised mode, it was found that dynamic spectral models with non-spectral cues better explain vowel perception than static spectral models and furthermore, flanking phone identities, and manner and place of flanking phones are perceptually the most influential while duration, F0 and speaking rate are perceptually far less contributive than argued in the literature.

Full Text