Abstract

Whispery voice is a type of voice quality in which the vocal folds do not vibrate and the airflow through the resonating cavity is modulated. Previous studies have shown that there is no fundamental frequency but has formant envelope in whispery voice. This is why the listener can recognize the voice quality. In this study, acoustic and spectral measures are extracted to investigate the formant pattern and voice quality of whispery voice. The results show that the differences in harmonic amplitudes (H4-H2K, H2K-H5K) are more effective in distinguishing whispered speech from modal voice than the differences in harmonic amplitudes at low frequencies (H1-H2, H2-H4). In addition, the values of Harmonic-to-Noise Ratio (HNR) and Cepstral Peak Prominence (CPP) of whispered voice were significantly lower than that of modal phonation, and the acoustic energy of whispery voice was also significantly reduced. As to formant frequency, the lower formant frequency of whispered vowels became higher compared to those of modal phonation. These findings can not only reveal the acoustic characteristics of whispery voice, but also provide theoretical foundation for whisper automatic recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.