Perception of incongruent audiovisual English consonants.

Kaylah Lalonde,Lynne A Werner

doi:10.1371/journal.pone.0213588

Abstract

Causal inference—the process of deciding whether two incoming signals come from the same source—is an important step in audiovisual (AV) speech perception. This research explored causal inference and perception of incongruent AV English consonants. Nine adults were presented auditory, visual, congruent AV, and incongruent AV consonant-vowel syllables. Incongruent AV stimuli included auditory and visual syllables with matched vowels, but mismatched consonants. Open-set responses were collected. For most incongruent syllables, participants were aware of the mismatch between auditory and visual signals (59.04%) or reported the auditory syllable (33.73%). Otherwise, participants reported the visual syllable (1.13%) or some other syllable (6.11%). Statistical analyses were used to assess whether visual distinctiveness and place, voice, and manner features predicted responses. Mismatch responses occurred more when the auditory and visual consonants were visually distinct, when place and manner differed across auditory and visual consonants, and for consonants with high visual accuracy. Auditory responses occurred more when the auditory and visual consonants were visually similar, when place and manner were the same across auditory and visual stimuli, and with consonants produced further back in the mouth. Visual responses occurred more when voicing and manner were the same across auditory and visual stimuli, and for front and middle consonants. Other responses were variable, but typically matched the visual place, auditory voice, and auditory manner of the input. Overall, results indicate that causal inference and incongruent AV consonant perception depend on salience and reliability of auditory and visual inputs and degree of redundancy between auditory and visual inputs. A parameter-free computational model of incongruent AV speech perception based on unimodal confusions, with a causal inference rule, was applied. Data from the current study present an opportunity to test and improve the generalizability of current AV speech integration models.

Highlights

Mismatch responses were more common for visual consonants associated with high visual accuracy
When participants could accurately identify visual consonants, they were more likely to notice that what they saw and heard did not match. These results suggest stimulus uncertainty governs causal inference in AV speech perception
The same relationship was not observed for auditory consonant accuracy, this analysis was limited by the overall high accuracy of identification for most auditory consonants

Summary

Introduction

In face-to-face communication, we automatically combine speech information from the face and voice. Perception of incongruent audiovisual English consonants demonstrated in the popular McGurk illusion [1]. When presented an auditory /bɑbɑ/ paired with visual /gɑgɑ/, participants often perceive a fused /dɑdɑ/ that was not present in either modality. The McGurk illusion demonstrates that the brain integrates signals from across modalities into a single perceptual representation

Objectives

Methods

Findings

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PloS one	Publication Date: Mar 21, 2019
Citations: 7	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Perception of incongruent audiovisual English consonants.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

Electrophysiological indicators of phonetic and non-phonetic multisensory interactions during audiovisual speech perception
Vasily Klucharev ... Mikko Sams
Cognitive Brain Research | VOL. 18
Vasily Klucharev, et. al.Vasily Klucharev ... Mikko Sams
01 Dec 2003
Cognitive Brain Research | VOL. 18

Exploiting sensor fusion architectures and stimuli complementarity in AV speech recognition
Jordi Robert-Ribes ... Jean-Luc Schwartz
-
Jordi Robert-Ribes, et. al.Jordi Robert-Ribes ... Jean-Luc Schwartz
01 Jan 1996
01 Jan 1996

Perception of Incongruent Audiovisual Speech: Distribution of Modality-Specific Responses
Sandhya ... Manchaiah, V
American Journal of Audiology | VOL. 30
Sandhya, et. al. Sandhya ... Manchaiah, V
09 Sep 2021
American Journal of Audiology | VOL. 30

Audiovisual speech perception: A new approach and implications for clinical populations.
Julia Irwin ... Lori Diblasi
Language and Linguistics Compass | VOL. 11
Julia Irwin, et. al.Julia Irwin ... Lori Diblasi
01 Mar 2017
Language and Linguistics Compass | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Perception of incongruent audiovisual English consonants.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one