Abstract
In this paper, the results of automated subjective assessment of Ukrainian speech intelligibility are presented. Speech monosyllables of the consonant-vowel-consonant (CVC) type were listened in two modes: through headphones and through acoustic monitors. The assessment was carried out with the help of specially developed software that allowed automating of articulation tests. Speech listening was done for four situations: pure language; speech distorted by noise; speech distorted by reverberation; speech distorted by the combined effect of noise and reverberation. In the first case, speech monosyllables of 3 articulation tables were listened, each of which contained 50 monosyllables. In the second case, speech distorted by the additive noise with the signal-to-noise ratios (SNR) varied in the range ‑15…+10 dB was listened. In this case, models of white, pink and brown noises were used, the masking properties of which are rather well-studied. In the third case, the reverberant speech for reverberation times in the range 0.3…2.7 s was modeled by convolution of pure speech signals with room impulse responces (RIRs) of various rooms, and in the fourth case the joint action of pink noise and reverberation was considered. It turned out that the masking ability of white noise exceeds one for brown noise for SNR less than minus 5 dB, which is not entirely consistent with preliminary predictive estimates. In addition, it turned out that listening to speech distorted by noise through acoustic monitors could lead to a significant increase in the speech intelligibility, compared to the case of listening through headphones. The analysis of possible causes of abnormal increase in speech intelligibility has been carried out. Early reflections, presence of two loudspeakers, binaural listening, psychophysical features of listeners, as well as peculiarities of software and articulatory testing organization were considered as possible reasons of the phenomenon. After correction of the software and some features of articulation tests it turned out that the results of the speech intelligibility estimation almost coincide when listening to the signals through the headphones and through acoustic monitors, if the distance between the listener and acoustic monitors does not exceed 0.6-0.8 meters. At the same time, these corrections did not differ in the behavior of the dependencies of speech intelligibility on the SNR for small (less minus 5 dB) SNR values The general conclusion may be that listening to speech signals distorted by noise and reverberation interferences, performed with the application of the proposed automated system of articulation tests, indicates the performance and high quality of the developed system.Ref. 13, fig. 7.
Highlights
Speech intelligibility assessment is an important issue because high level of speech intelligibility need be ensured in communication channels, auditoriums and concert halls
In the case of noise interference, when changing the listening mode from listening through the headphones to listening through the acoustic monitors, the intelligibility increased from 10%-30% to 85%-93% in the case of signal-to-noise ratios (SNR) minus 10 dB
This means that the SNR has increased by 17-20 dB when listening to signals through the acoustic systems, compared with listening through the headphones
Summary
Speech intelligibility assessment is an important issue because high level of speech intelligibility need be ensured in communication channels, auditoriums and concert halls. The results obtained in this way are not final, since the quality and intelligibility of speech signals are closely related but not identical concepts To eliminate these shortcomings, monosyllable articulation tables of the CVC type of the Ukrainian speech were proposed and a set of computer programs for the automation of articulation tests has been created [8]. Monosyllable articulation tables of the CVC type of the Ukrainian speech were proposed and a set of computer programs for the automation of articulation tests has been created [8] It has been shown in [8] that listening to signals through acoustic monitors (computer speakers) can lead to much more high speech intelligibility than when listening through headphones. The object of this paper is, firstly, to highlight the peculiarities of the organization of experimental studies, which were not sufficiently reflected in [8], and secondly, to make in-depth analysis of possible causes of abovementioned phenomenon of abnormally high speech intelligibility growth
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have