Predicting multimodal presentation skills based on instance weighting domain adaptation

Yutaro Yagi,Sota Sugimura,Shota Shiobara,Shogo Okada

doi:10.1007/s12193-021-00367-x

Abstract

Presentation skills assessment is one of the central challenges of multimodal modeling. Presentation skills are composed of verbal and nonverbal skill components, but because people demonstrate their presentation skills in a variety of manners, the observed multimodal features vary widely. Due to the differences in features, when test data samples are generated on different training data sample distributions, in many cases, the prediction accuracy of the skills degrades. In machine learning theory, this problem in which training (source) data are biased is known as instance selection bias or covariate shift. To solve this problem, this paper presents an instance weighting adaptation method that is applied to estimate the presentation skills of each participant from multimodal (verbal and nonverbal) features. For this purpose, we collect a novel multimodal presentation dataset that includes audio signal data, body motion sensor data, and text data of the speech content for participants observed in 58 presentation sessions. The dataset also includes both verbal and nonverbal presentation skills, which are assessed by two external experts from a human resources department. We extract multimodal features, such as spoken utterances, acoustic features, and the amount of body motion, to estimate the presentation skills. We propose two approaches, early fusing and late fusing, for the regression models based on multimodal instance weighting adaptation. The experimental results show that the early fusing regression model with instance weighting adaptation achieved $$\rho =0.39$$ for the Pearson correlation, which presents the regression accuracy for the clarity of presentation goal elements. In the maximum case, the accuracy (correlation coefficient) is improved from $$-0.34$$ to +0.35 by instance weighting adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Predicting multimodal presentation skills based on instance weighting domain adaptation

Abstract

Talk to us

Similar Papers

More From: Journal on Multimodal User Interfaces

Lead the way for us

Journal: Journal on Multimodal User Interfaces	Publication Date: Feb 18, 2021
Citations: 5

Similar Papers

Reflections on Blended Learning: A Case Study at the Open University of Hong Kong
Anna Wing Bo Tso
Asian Association of Open Universities Journal | VOL. 10
Anna Wing Bo TsoAnna Wing Bo Tso
01 Jun 2015
Asian Association of Open Universities Journal | VOL. 10

Integrating acoustic and lexical features in topic segmentation of Chinese broadcast news using maximum entropy approach
Lei Xie ... Zihan Liu
-
Lei Xie, et. al.Lei Xie ... Zihan Liu
01 Nov 2010
01 Nov 2010

Real-Time Robotic Presentation Skill Scoring Using Multi-Model Analysis and Fuzzy Delphi-Analytic Hierarchy Process.
Rafeef Fauzi Najim Alshammari ... Haslina Arshad
Sensors (Basel, Switzerland) | VOL. 23
Rafeef Fauzi Najim Alshammari, et. al.Rafeef Fauzi Najim Alshammari ... Haslina Arshad
05 Dec 2023
Sensors (Basel, Switzerland) | VOL. 23

Lq regularization for fair artificial intelligence robust to covariate shift
Seonghyeon Kim ... Kunwoong Kim
Statistical Analysis and Data Mining: The ASA Data Science Journal | VOL. 16
Seonghyeon Kim, et. al.Seonghyeon Kim ... Kunwoong Kim
22 Feb 2023
Statistical Analysis and Data Mining: The ASA Data Science Journal | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Predicting multimodal presentation skills based on instance weighting domain adaptation

Abstract

Talk to us

Similar Papers

More From: Journal on Multimodal User Interfaces