Abstract

Objective indices for predicting speech intelligibility offer a quick and convenient alternative to behavioral measures of speech intelligibility. However, most such indices are designed for a specific language, such as English, and they do not take adequate account of tonal information in speech when applied to languages like Mandarin Chinese (hereafter called Mandarin) for which the patterns of fundamental frequency (F0) variation play an important role in distinguishing speech sounds with similar phonetic content. To address this, two experiments with normal-hearing listeners were conducted examining: (1) The impact of manipulations of tonal information on the intelligibility of Mandarin sentences presented in speech-shaped noise (SSN) at several signal-to-noise ratios (SNRs); (2) The intelligibility of Mandarin sentences with intact tonal information presented in SSN, pink noise, and babble at several SNRs. The outcomes were not correctly predicted by the Hearing Aid Speech Perception Index (HASPI-V1). A new intelligibility metric was developed that used one acoustic feature from HASPI-V1 plus Hilbert time envelope and temporal fine structure information from multiple frequency bands. For the new metric, the Pearson correlation between obtained and predicted intelligibility was 0.923 and the root mean square error was 0.119. The new metric provides a potential tool for evaluating Mandarin intelligibility.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.