Multi-task Gated Contextual Cross-Modal Attention Framework for Sentiment and Emotion Analysis

Suyash Sangwan,Md Shad Akhtar,Asif Ekbal,Pushpak Bhattacharyya,Dushyant Singh Chauhan

doi:10.1007/978-3-030-36808-1_72

Suyash Sangwan, Md Shad Akhtar + Show 3 more

https://doi.org/10.1007/978-3-030-36808-1_72

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2019

Citations: 8

Affiliation: Indian Institute of Technology Patna

Abstract
Full-Text
Similar Papers

Abstract

Listen

Multi-modal sentiment and emotion analysis have been an emerging and prominent field nowadays at the intersection of natural language processing, deep learning, machine learning, computer vision, and speech processing. Sentiment and emotion prediction model finds the attitude of a speaker or writer towards any discussion, debate, event, document or topic. It can be expressed in different ways like the words spoken, energy and tone while delivering words, accompanying facial expressions, gestures, etc. Moreover related and similar tasks generally depend on each other and are predicted better if solved through a joint framework. In this paper, we present a multi-task gated contextual cross-modal attention framework which considers all the three modalities (viz. text, acoustic and visual) and multiple utterances for sentiment and emotion prediction together. We evaluate our proposed approach on CMU-MOSEI dataset for sentiment and emotion prediction. Evaluation results depict that our proposed approach extracts co-relation among the three modalities and attains an improvement over the previous state-of-the-art models.

Full Text