Abstract

This study introduces a Quality of Experience (QoE) model of loudspeaker-based spatial speech reproduction, which specifies quality elements and quality features relevant to overall listening experience and Quality of Service (QoS), respectively. Assumptions about the relations between selected quality elements and quality features were validated in a listening-only test. Participants had the task to behaviorally identify the voices of two talkers and provided subjective judgments on category rating scales. Sentences uttered by the two talkers were alternately presented through only a central loudspeaker (non-spatial mode) versus through either the central or one talker-specific lateral loudspeaker (spatial mode). The transmitted speech signals were clean, superimposed with background noise or bandpass-filtered. Results showed that transmission quality, but not reproduction mode significantly affected evaluative (speech quality, speech intelligibility), immersion (voice naturalness, spatial presence, social presence) and task-related (voice similarity, talker identification effort) aspects of overall listening experience. Contrary to previous findings, the spatial mode did not reduce the effort of talker identification. This suggested that noticeable advantages from spatial reproduction may not be evident in relatively simple listening situations, but could manifest if complexity is increased. Besides their task-related influence, noise and spectral distortion should be mitigated when targeting a higher level of immersion.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call