Using auditory classification images for the identification of fine acoustic cues used in speech perception.

Léo Varnet,Kenneth Knoblauch,Fanny Meunier,Michel Hoen

doi:10.3389/fnhum.2013.00865

Abstract

An essential step in understanding the processes underlying the general mechanism of perceptual categorization is to identify which portions of a physical stimulation modulate the behavior of our perceptual system. More specifically, in the context of speech comprehension, it is still a major open challenge to understand which information is used to categorize a speech stimulus as one phoneme or another, the auditory primitives relevant for the categorical perception of speech being still unknown. Here we propose to adapt a method relying on a Generalized Linear Model with smoothness priors, already used in the visual domain for the estimation of so-called classification images, to auditory experiments. This statistical model offers a rigorous framework for dealing with non-Gaussian noise, as it is often the case in the auditory modality, and limits the amount of noise in the estimated template by enforcing smoother solutions. By applying this technique to a specific two-alternative forced choice experiment between stimuli “aba” and “ada” in noise with an adaptive SNR, we confirm that the second formantic transition is key for classifying phonemes into /b/ or /d/ in noise, and that its estimation by the auditory system is a relative measurement across spectral bands and in relation to the perceived height of the second formant in the preceding syllable. Through this example, we show how the GLM with smoothness priors approach can be applied to the identification of fine functional acoustic cues in speech perception. Finally we discuss some assumptions of the model in the specific case of speech perception.

Highlights

A major challenge in psychophysics is to establish what exact parts of a complex physical stimulation modulate its percept by an observer and constrain his/her behavior toward that stimulus
As we do not know the appropriate importance of smoothness along the time and frequency axis, we introduce two. This Generalized Linear Model (GLM) relates strongly to that derived from Equation (4), except that it does not take into account information about the target signals that was presented at each trial
The signalto-noise ratio (SNR) was manipulated across trials via an adaptive procedure, in order to maintain the percentage of correct answers roughly equal to 75% during the course of the entire experiment

Summary

METHODS

Reviewed by: Shanqing Cai, Boston University, USA Nima Mesgarani, Columbia University, USA. This statistical model offers a rigorous framework for dealing with non-Gaussian noise, as it is often the case in the auditory modality, and limits the amount of noise in the estimated template by enforcing smoother solutions By applying this technique to a specific two-alternative forced choice experiment between stimuli “aba” and “ada” in noise with an adaptive SNR, we confirm that the second formantic transition is key for classifying phonemes into /b/ or /d/ in noise, and that its estimation by the auditory system is a relative measurement across spectral bands and in relation to the perceived height of the second formant in the preceding syllable.

INTRODUCTION

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSIONS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in human neuroscience	Publication Date: Jan 1, 2013
Citations: 24	License type: cc-by

R Discovery Prime

R Discovery Prime

Using auditory classification images for the identification of fine acoustic cues used in speech perception.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in human neuroscience

Lead the way for us

Similar Papers

Show me what you listen to! auditory classification images can reveal the processing of fine acoustic cues during speech categorization
Léo Varnet ... Michel Hoen
-
Léo Varnet, et. al.Léo Varnet ... Michel Hoen
25 Aug 2013
25 Aug 2013

Teaching and Learning Guide for: Mirror Neurons, the Motor System, and Language – From the Motor Theory to Embodied Cognition and Beyond
Jonathan H Venezia ... Gregory Hickok
Language and Linguistics Compass | VOL. 4
Jonathan H Venezia, et. al.Jonathan H Venezia ... Gregory Hickok
01 Aug 2010
Language and Linguistics Compass | VOL. 4

Matching the Mismatch: The interaction between perceptual and conceptual cues in bilinguals’ speech perception
Noelle Wig ... Adrián García-Sierra
Bilingualism: Language and Cognition | VOL. 24
Noelle Wig, et. al.Noelle Wig ... Adrián García-Sierra
04 Nov 2020
Bilingualism: Language and Cognition | VOL. 24

A psychophysical imaging method evidencing auditory cue extraction during speech perception: a group analysis of auditory classification images.
Léo Varnet ... Willy Serniclaes
PLOS ONE | VOL. 10
Léo Varnet, et. al.Léo Varnet ... Willy Serniclaes
17 Mar 2015
PLOS ONE | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using auditory classification images for the identification of fine acoustic cues used in speech perception.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in human neuroscience