Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder.

Yue Gu,Xinyu Li,Shiyu Fu,Shuhong Chen,Kangning Yang,Moliang Zhou,Ivan Marsic,Kaixiang Huang

doi:10.1145/3240508.3240714

Abstract

Human conversation analysis is challenging because the meaning can be expressed through words, intonation, or even body language and facial expression. We introduce a hierarchical encoder-decoder structure with attention mechanism for conversation analysis. The hierarchical encoder learns word-level features from video, audio, and text data that are then formulated into conversation-level features. The corresponding hierarchical decoder is able to predict different attributes at given time instances. To integrate multiple sensory inputs, we introduce a novel fusion strategy with modality attention. We evaluated our system on published emotion recognition, sentiment analysis, and speaker trait analysis datasets. Our system outperformed previous state-of-the-art approaches in both classification and regressions tasks on three datasets. We also outperformed previous approaches in generalization tests on two commonly used datasets. We achieved comparable performance in predicting co-existing labels using the proposed model instead of multiple individual models. In addition, the easily-visualized modality and temporal attention demonstrated that the proposed attention mechanism helps feature selection and improves model interpretability.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... ACM International Conference on Multimedia, with co-located Symposium & Workshops. ACM International Conference on Multimedia

Lead the way for us

Journal: Proceedings of the ... ACM International Conference on Multimedia, with co-located Symposium & Workshops. ACM International Conference on Multimedia	Publication Date: Oct 15, 2018
Citations: 25

Similar Papers

Sentiment Analysis and Emotion Detection with Healthcare Perspective
Sathish Kumar ... Selvakumar Samuel
-
Sathish Kumar, et. al.Sathish Kumar ... Selvakumar Samuel
01 Jan 2021
01 Jan 2021

When Homecoming is not Coming: 2021 Homecoming Ban Sentiment Analysis on Twitter Data Using Support Vector Machine Algorithm
Lidia Sandra ... Ford Lumbangaol
-
Lidia Sandra, et. al.Lidia Sandra ... Ford Lumbangaol
02 Aug 2021
02 Aug 2021

Feature Selection for Highly Skewed Sentiment Analysis Tasks
Can Liu ... Ning Yu
-
Can Liu, et. al.Can Liu ... Ning Yu
01 Jan 2014
01 Jan 2014

Japanese Political Interviews: The Integration of Conversation Analysis and Facial Expression Analysis

-

31 Aug 2020
31 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder.

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... ACM International Conference on Multimedia, with co-located Symposium & Workshops. ACM International Conference on Multimedia