Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification.

Stefan Uhrig,Sebastian Möller,Dawn M. Behne,U. Peter Svensson,Andrew Perkis

doi:10.3389/fnins.2021.730744

Abstract

This study investigates effects of spatial auditory cues on human listeners' response strategy for identifying two alternately active talkers (“turn-taking” listening scenario). Previous research has demonstrated subjective benefits of audio spatialization with regard to speech intelligibility and talker-identification effort. So far, the deliberate activation of specific perceptual and cognitive processes by listeners to optimize their task performance remained largely unexamined. Spoken sentences selected as stimuli were either clean or degraded due to background noise or bandpass filtering. Stimuli were presented via three horizontally positioned loudspeakers: In a non-spatial mode, both talkers were presented through a central loudspeaker; in a spatial mode, each talker was presented through the central or a talker-specific lateral loudspeaker. Participants identified talkers via speeded keypresses and afterwards provided subjective ratings (speech quality, speech intelligibility, voice similarity, talker-identification effort). In the spatial mode, presentations at lateral loudspeaker locations entailed quicker behavioral responses, which were significantly slower in comparison to a talker-localization task. Under clean speech, response times globally increased in the spatial vs. non-spatial mode (across all locations); these “response time switch costs,” presumably being caused by repeated switching of spatial auditory attention between different locations, diminished under degraded speech. No significant effects of spatialization on subjective ratings were found. The results suggested that when listeners could utilize task-relevant auditory cues about talker location, they continued to rely on voice recognition instead of localization of talker sound sources as primary response strategy. Besides, the presence of speech degradations may have led to increased cognitive control, which in turn compensated for incurring response time switch costs.

Highlights

The ability of the human auditory system for rapid extraction of spatial auditory cues is thought to facilitate perceptual and cognitive speech processing, especially under adverse and dynamic listening conditions (Zekveld et al, 2014; Koelewijn et al, 2015)
Significant main effects of speech degradation resulted for all four subjective constructs: Speech quality [F(2, 62) = 1η3p15G2
Task difficulty should most probably depend on the amount of allocated information processing resources [i.e., the perceptual-cognitive load (Wickens, 2008); measurable, e.g., by pupillometry or EEG] to discriminate between the two talkers’ voices, which was higher for degraded vs. clean speech; better talker voice discriminability would in turn ease TI based on voice recognition

Summary

Introduction

The ability of the human auditory system for rapid extraction of spatial auditory cues is thought to facilitate perceptual and cognitive speech processing, especially under adverse and dynamic listening conditions (Zekveld et al, 2014; Koelewijn et al, 2015). Past research usually centered around listening situations involving multiple, simultaneously active talkers. The present study addresses another kind of listening situation in dyadic human–human conversation, namely when two talkers take turns in active speaking time (with silence gaps in between). What are their underlying perceptual and cognitive processes? To what extent are such effects dependent on speech degradations as well as impacting various attributes of subjective listening experience like perceived speech quality, speech intelligibility, or talker-identification effort?1 Does auditory information cuing talker location affect behavioral talker-identification (TI) performance in this “turn-taking” listening scenario (Lin and Carlile, 2015, 2019)? If significant effects exist, what are their underlying perceptual and cognitive processes? To what extent are such effects dependent on speech degradations as well as impacting various attributes of subjective listening experience like perceived speech quality, speech intelligibility, or talker-identification effort?1

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in neuroscience	Publication Date: Jan 28, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neuroscience

Lead the way for us

Similar Papers

Reasons why Current Speech-Enhancement Algorithms do not Improve Speech Intelligibility and Suggested Solutions
Philipos C Loizou ... Gibak Kim
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19
Philipos C Loizou, et. al.Philipos C Loizou ... Gibak Kim
01 Jan 2010
IEEE Transactions on Audio, Speech, and Language Processing | VOL. 19

Testing a Quality of Experience (QoE) Model of Loudspeaker-Based Spatial Speech Reproduction
Stefan Uhrig ... Dawn M Behne
-
Stefan Uhrig, et. al.Stefan Uhrig ... Dawn M Behne
01 May 2020
01 May 2020

On the spatial specificity of audiovisual crossmodal exogenous cuing effects
Jae Lee ... Charles Spence
Acta Psychologica | VOL. 177
Jae Lee, et. al.Jae Lee ... Charles Spence
05 May 2017
Acta Psychologica | VOL. 177

On Non-Reference Speech Intelligibility Estimation Using DNN De-reverberation
Kazushi Nakazawa ... Kazuhiro Kondo
-
Kazushi Nakazawa, et. al.Kazushi Nakazawa ... Kazuhiro Kondo
13 Oct 2020
13 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effects of Spatial Speech Presentation on Listener Response Strategy for Talker-Identification.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in neuroscience