Abstract

Though the importance of spectral characteristics of Korean vowels in the hVd syllable has been recognized in the literature, it has never been studied whether static spectral measurements at a (steady-state) central section are enough to characterize Korean vowels in spontaneous speech, or dynamic spectral measurements across the temporal dimension can better characterize vowels. Despite ample reported evidence of the perceptual influence of non-spectral cues on spectral properties of vowels in the literature, no reports have yet been released on the difference in the degree of the perceptual influence of non-spectral cues (e.g., place and manner of the preceding or following phones, F0, speaking rate, prosody, and gender) on spectral properties of vowels. Through Neural Network pattern recognition modeling in a supervised mode, it was found that dynamic spectral models with non-spectral cues better explain vowel perception than static spectral models and furthermore, flanking phone identities, and manner and place of flanking phones are perceptually the most influential while duration, F0 and speaking rate are perceptually far less contributive than argued in the literature.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.