Speech intelligibility and talker gender classification with noise-vocoded and tone-vocoded speech.

Sarah Villard,Gerald Kidd

doi:10.1121/10.0006285

Abstract

Vocoded speech provides less spectral information than natural, unprocessed speech, negatively affecting listener performance on speech intelligibility and talker gender classification tasks. In this study, young normal-hearing participants listened to noise-vocoded and tone-vocoded (i.e., sinewave-vocoded) sentences containing 1, 2, 4, 8, 16, or 32 channels, as well as non-vocoded sentences, and reported the words heard as well as the gender of the talker. Overall, performance was significantly better with tone-vocoded than noise-vocoded speech for both tasks. Within the talker gender classification task, biases in performance were observed for lower numbers of channels, especially when using the noise carrier.

Highlights

Since the development of the vocoder in the first half of the 20th century (Dudley, 1939), many scientific pursuits and clinical applications have arisen that were based on the underlying technique; that is, imposing the amplitude envelopes derived on a band-by-band basis from a broadband signal on a specified carrier
There is a wide range of applications of vocoding in modern communications and music synthesis technology, the most common topic areas in the speech perception and psychological acoustics fields involve using vocoding to control various aspects of a signal in the study of normal auditory processes and as a useful analog to the information conveyed by cochlear implants (CIs), especially with regard to speech recognition (Shannon et al, 1995)
Recent work has established that differences in talker gender can be a powerful cue to overcoming masking in multiple-talker listening situations (e.g., Kidd et al, 2016) and that the control of the stimulus parameters that is possible via vocoding can be useful in determining the different factors that contribute to speech-on-speech masking (e.g., Arbogast et al, 2002; Swaminathan et al, 2016)

Summary

Introduction

Since the development of the vocoder in the first half of the 20th century (Dudley, 1939), many scientific pursuits and clinical applications have arisen that were based on the underlying technique; that is, imposing the amplitude envelopes derived on a band-by-band basis from a broadband signal (e.g., speech or music) on a specified carrier (e.g., continuous noise, tones, or pulse trains). There is a wide range of applications of vocoding in modern communications and music synthesis technology, the most common topic areas in the speech perception and psychological acoustics fields involve using vocoding to control various aspects of a signal in the study of normal auditory processes (e.g., pitch, binaural hearing, masking) and as a useful ( limited) analog to the information conveyed by cochlear implants (CIs), especially with regard to speech recognition (Shannon et al, 1995). Recent work has established that differences in talker gender can be a powerful cue to overcoming masking in multiple-talker listening situations (e.g., Kidd et al, 2016) and that the control of the stimulus parameters that is possible via vocoding can be useful in determining the different factors that contribute to speech-on-speech masking (e.g., Arbogast et al, 2002; Swaminathan et al, 2016). It is not our intent to draw conclusions about the perceptual abilities of CI listeners by using vocoder simulations in normal-hearing (NH) listeners, we do hope to establish a baseline against which future studies with CI listeners may be compared, in keeping with many past studies that have contributed to this body of knowledge (e.g., Fu et al, 2004; Whitmal et al, 2007; Yun et al, 2021)

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JASA express letters	Publication Date: Sep 1, 2021
Citations: 4	License type: cc-by

R Discovery Prime

R Discovery Prime

Speech intelligibility and talker gender classification with noise-vocoded and tone-vocoded speech.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JASA express letters

Lead the way for us

Similar Papers

The interaction between speech intelligibility task and non-auditory tasks
Jędrzej Kociński
Speech Communication | VOL. 66
Jędrzej KocińskiJędrzej Kociński
12 Oct 2014
Speech Communication | VOL. 66

Speech Intelligibility Improvement in Noisy Environments for Near-End Listening Enhancement
...
-
, et. al. ...
24 Mar 2016
24 Mar 2016

Comparing the effects of scattered and specular sound reflections on speech intelligibility in rooms
Nicola Prodi ... Chiara Visentin
Building and Environment | VOL. 228
Nicola Prodi, et. al.Nicola Prodi ... Chiara Visentin
30 Nov 2022
Building and Environment | VOL. 228

Speech intelligibility improvement in noisy environments based on energy correlation in frequency bands
Peyman Goli ... Mohammad Reza Karami-Mollaei
Digital Signal Processing | VOL. 62
Peyman Goli, et. al.Peyman Goli ... Mohammad Reza Karami-Mollaei
15 Dec 2016
Digital Signal Processing | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech intelligibility and talker gender classification with noise-vocoded and tone-vocoded speech.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JASA express letters