Visual scanning of a talking face when evaluating segmental and prosodic information

Xizi Deng,Yue Wang,Henny Yeung

doi:10.1121/1.5147695

Abstract

Prior work has shown that the mouth area can yield articulatory features of speech segments and durational information (Navarra et al., 2010), while pitch and speech amplitude, are cued by the eyebrows and other head movements (Hamarneh et al., 2019). It has been reported that adults will look more at the mouth when evaluating speech information in a non-native language (Barenholtz et al., 2016). In the present study, we ask how listeners' visual scanning of a talking face is affected by task demands that specifically target prosodic and segmental information, which has not been examined by the prior work. Twenty-five native English speakers heard two audio sentences in English (the native language) or Mandarin (the non-native language) that might differ in segmental or prosodic information, or even both, and then saw a silent video of a talking face. Their task was to judge whether the video matched either the first or second audio sentence (or whether both sentences were the same).The results show that although looking was generally weighted towards the mouth, reflecting task demands, increased looking to the mouth predicted correct responses only for Mandarin trials. This effect was more pronounced in the Prosody and Both conditions, relative to the Segment condition (p &lt; 0.05). The results suggest a link between mouth-looking and the extraction of speech-relevant information at both prosodic and segmental levels, but only under high cognitive load.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual scanning of a talking face when evaluating segmental and prosodic information

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Visual scanning patterns of a talking face when evaluating phonetic information in a native and non-native language.
Xizi Deng ... Yue Wang
PloS one | VOL. 19
Xizi Deng, et. al.Xizi Deng ... Yue Wang
28 May 2024
PloS one | VOL. 19

Prosodic and segmental information effects on perceived accentedness of native and non-native speech
Kurtis Foster ... Melissa M Baese-Berk
The Journal of the Acoustical Society of America | VOL. 150
Kurtis Foster, et. al.Kurtis Foster ... Melissa M Baese-Berk
01 Oct 2021
The Journal of the Acoustical Society of America | VOL. 150

Modeling prosodic differences for speaker and language recognition
...
-
, et. al. ...
01 Jan 2004
01 Jan 2004

Co-training using prosodic and lexical information for sentence segmentation
Umit Guz ... Sébastien Cuendet
-
Umit Guz, et. al.Umit Guz ... Sébastien Cuendet
27 Aug 2007
27 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual scanning of a talking face when evaluating segmental and prosodic information

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America