On measuring the intelligibility of synthetic speech in noise &amp;#x2014; Do we need a realistic noise environment?

Tuomo Raitio,Paavo Alku,Olli Santala,Marko Takanen,Antti Suni,Martti Vainio

doi:10.1109/icassp.2012.6288801

On measuring the intelligibility of synthetic speech in noise &#x2014; Do we need a realistic noise environment?

Tuomo Raitio, Paavo Alku + Show 4 more

Open Access

https://doi.org/10.1109/icassp.2012.6288801

Copy DOI

Publication Date: Mar 1, 2012
Citations: 9	License type: other-oa

Affiliation: Aalto University, University of Helsinki

Abstract

Assessing the intelligibility of synthetic speech is important in creating synthetic voices to be used in real life applications, especially for the ones involving interfering noise. This raises the question how to measure the intelligibility of synthetic speech to correctly simulate such conditions. Conventionally, this has been done using a simple listening test setup where diotic speech and noise are played to both ears with headphones. This is indeed very different from the real noise environment where speech and noise are spatially distributed. This paper addresses the question whether a realistic noise environment should be used to test the intelligibility of synthetic speech. Three different test conditions, one with multichannel reproduction of noise and speech, and two headphone setups are evaluated. Tests are performed with natural and synthetic speech, including speech especially intended for noisy conditions. The results indicate a general trend in all setups but also some interesting differences.

Full Text