MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations

Soujanya Poria,Gautam Naik,Navonil Majumder,Rada Mihalcea,Devamanyu Hazarika,Erik Cambria

doi:10.18653/v1/p19-1050

Abstract

Emotion recognition in conversations is a challenging task that has recently gained popularity due to its potential applications. Until now, however, a large-scale multimodal multi-party emotional conversational database containing more than two speakers per dialogue was missing. Thus, we propose the Multimodal EmotionLines Dataset (MELD), an extension and enhancement of EmotionLines. MELD contains about 13,000 utterances from 1,433 dialogues from the TV-series Friends. Each utterance is annotated with emotion and sentiment labels, and encompasses audio, visual and textual modalities. We propose several strong multimodal baselines and show the importance of contextual and multimodal information for emotion recognition in conversations. The full dataset is available for use at http://affective-meld.github.io.

Highlights

With the rapid growth of Artificial Intelligence (AI), multimodal emotion recognition has become a major research topic, primarily due to its potential applications in many challenging tasks, such as dialogue generation, user behavior understanding, multimodal interaction, and others
The remainder of the paper is organized as follows: Section 2 illustrates the EmotionLines dataset; we present Multimodal EmotionLines Dataset (MELD) in Section 3; strong baselines and experiments are elaborated in Section 4; future directions and applications of MELD are covered in Section 5 and 6, respectively; Section 7 concludes the paper
We introduced MELD, a multimodal multi-party conversational emotion recognition dataset

Summary

Introduction

With the rapid growth of Artificial Intelligence (AI), multimodal emotion recognition has become a major research topic, primarily due to its potential applications in many challenging tasks, such as dialogue generation, user behavior understanding, multimodal interaction, and others. A conversational emotion recognition system can be used to generate appropriate responses by analyzing user emotions (Zhou et al, 2017; Rashkin et al, 2018). Recent work proposes solutions based on multimodal memory networks (Hazarika et al, 2018). They are mostly limited to dyadic conversations, and not scalable to ERC with multiple interlocutors. This calls for a multi-party conversational data resource that can encourage research in this direction

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2019
Citations: 444	License type: cc-by

Similar Papers

M2FNet: Multi-modal Fusion Network for Emotion Recognition in Conversation
Vishal Chudasama ... Nirmesh Shah
-
Vishal Chudasama, et. al.Vishal Chudasama ... Nirmesh Shah
01 Jun 2022
01 Jun 2022

PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation.
Zheng Lian ... Jianhua Tao
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35
Zheng Lian, et. al.Zheng Lian ... Jianhua Tao
01 Feb 2024
IEEE Transactions on Neural Networks and Learning Systems | VOL. 35

Multimodal graph learning with framelet-based stochastic configuration networks for emotion recognition in conversation
Jiandong Shi ... Lu Bai
Information Sciences | VOL. -
Jiandong Shi, et. al.Jiandong Shi ... Lu Bai
01 Aug 2024
Information Sciences | VOL. -

Domain Adversarial Network for Cross-Domain Emotion Recognition in Conversation
Hongchao Ma ... Junyi Chen
Applied Sciences | VOL. 12
Hongchao Ma, et. al.Hongchao Ma ... Junyi Chen
27 May 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations

Abstract

Highlights

Summary

Talk to us

Similar Papers