Quantitative analysis of multimodal speech data

Samantha Gordon Danner,Adriano Vilela Barbosa,Louis Goldstein

doi:10.1016/j.wocn.2018.09.007

Samantha Gordon Danner, Adriano Vilela Barbosa + Show 1 more

Open Access

https://doi.org/10.1016/j.wocn.2018.09.007

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

This study presents techniques for quantitatively analyzing coordination and kinematics in multimodal speech using video, audio and electromagnetic articulography (EMA) data. Multimodal speech research has flourished due to recent improvements in technology, yet gesture detection/annotation strategies vary widely, leading to difficulty in generalizing across studies and in advancing this field of research. We describe how FlowAnalyzer software can be used to extract kinematic signals from basic video recordings; and we apply a technique, derived from speech kinematic research, to detect bodily gestures in these kinematic signals. We investigate whether kinematic characteristics of multimodal speech differ dependent on communicative context, and we find that these contexts can be distinguished quantitatively, suggesting a way to improve and standardize existing gesture identification/annotation strategy. We also discuss a method, Correlation Map Analysis (CMA), for quantifying the relationship between speech and bodily gesture kinematics over time. We describe potential applications of CMA to multimodal speech research, such as describing characteristics of speech-gesture coordination in different communicative contexts. The use of the techniques presented here can improve and advance multimodal speech and gesture research by applying quantitative methods in the detection and description of multimodal speech.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Phonetics	Publication Date: Oct 19, 2018
Citations: 20	License type: cc-by-nc-nd

R Discovery Prime

Quantitative analysis of multimodal speech data

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Phonetics

Lead the way for us

Similar Papers

Towards an Intrinsic Interpretability Approach for Multimodal Hate Speech Detection
Pengfei Du ... Xiaoyong Li
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 36
Pengfei Du, et. al.Pengfei Du ... Xiaoyong Li
28 Sep 2022
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 36

Assessing multimodal speech-gesture coordination: correlation map analysis for signal comparison
Samantha G Danner
The Journal of the Acoustical Society of America | VOL. 144
Samantha G DannerSamantha G Danner
01 Sep 2018
The Journal of the Acoustical Society of America | VOL. 144

Spatial and temporal alignment of multimodal human speech production data: Real time imaging, flesh point tracking and audio
Jangwon Kim ... Adam Lammert
-
Jangwon Kim, et. al.Jangwon Kim ... Adam Lammert
01 May 2013
01 May 2013

Concluding Remarks
Gale Stam ... Kimberly (Buescher) Urbanski
-
Gale Stam, et. al.Gale Stam ... Kimberly (Buescher) Urbanski
03 Aug 2022
03 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Quantitative analysis of multimodal speech data

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Phonetics