Predictive power in models of audiovisual integration of speech

Tobias Søren Andersen

doi:10.1163/187847612x647379

Abstract

Seeing the talking face can influence the phoneme perceived from the voice. This facilitates speech perception in the natural case where the face and voice are congruent and can cause the McGurk illusion when they are not. The classical example of the McGurk illusion is when acoustic /aba/ is perceived as /ada/ when dubbed onto a face articulating /aga/. In order to fully understand the underlying process of integrating information across the senses we need a computational account with predictive power. The Fuzzy Logical Model of Perception is one computational account of audiovisual integration in speech perception. Here we describe alternative accounts in which integration is based on an early continuous internal representation on which the phonetic classes fall. We show that these alternative accounts can provide just as good a fit when corrected for the number of free parameters. We also show, using cross-validation, that they have greater, but not great, predictive power. Finally, we show that introducing a regularization term can amend the lack of predictive power. With regularization, models based on continuous representations have the highest predictive power.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Predictive power in models of audiovisual integration of speech

Abstract

Talk to us

Similar Papers

More From: Seeing and Perceiving

Lead the way for us

Journal: Seeing and Perceiving	Publication Date: Jan 1, 2012
Citations: 1

Similar Papers

Regularization improves models of audiovisual integration in speech perception
Tobias S Andersen
-
Tobias S AndersenTobias S Andersen
01 Jan 2013
01 Jan 2013

The early maximum likelihood estimation model of audiovisual integration in speech perception.
Tobias S Andersen
The Journal of the Acoustical Society of America | VOL. 137
Tobias S AndersenTobias S Andersen
01 May 2015
The Journal of the Acoustical Society of America | VOL. 137

Regularized models of audiovisual integration of speech with predictive power for sparse behavioral data
Tobias S Andersen ... Ole Winther
Journal of Mathematical Psychology | VOL. 98
Tobias S Andersen, et. al.Tobias S Andersen ... Ole Winther
25 Jul 2020
Journal of Mathematical Psychology | VOL. 98

Testing between the TRACE model and the fuzzy logical model of speech perception
Dominic W Massaro
Cognitive Psychology | VOL. 21
Dominic W MassaroDominic W Massaro
01 Jul 1989
Cognitive Psychology | VOL. 21

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predictive power in models of audiovisual integration of speech

Abstract

Talk to us

Similar Papers

More From: Seeing and Perceiving