CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.

Amir Zadeh,Paul Pu Liang,Simon Hessner,Louis-Philippe Morency,Yan Sheng Cao,Soujanya Poria

doi:10.18653/v1/2020.emnlp-main.141

Abstract

Modeling multimodal language is a core research area in natural language processing. While languages such as English have relatively large multimodal language resources, other widely spoken languages across the globe have few or no large-scale datasets in this area. This disproportionately affects native speakers of languages other than English. As a step towards building more equitable and inclusive multimodal systems, we introduce the first large-scale multimodal language dataset for Spanish, Portuguese, German and French. The proposed dataset, called CMU-MOSEAS (CMU Multimodal Opinion Sentiment, Emotions and Attributes), is the largest of its kind with 40, 000 total labelled sentences. It covers a diverse set topics and speakers, and carries supervision of 20 labels including sentiment (and subjectivity), emotions, and attributes. Our evaluations on a state-of-the-art multimodal model demonstrates that CMU-MOSEAS enables further research for multilingual studies in multimodal language.

Highlights

Humans use a coordinated multimodal signal to communicate with each other
As Artificial Intelligence Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, pages 1801–1812, November 16–20, 2020. c 2020 Association for Computational Linguistics (AI) increasingly blends into everyday life across the globe, there is a genuine need for intelligent entities capable of understanding multimodal language in different cultures
We believe that data of this scale presents a step towards learning human communication at a more fine-grained level, with the longterm goal of building more equitable and inclusive NLP systems

Summary

Introduction

Humans use a coordinated multimodal signal to communicate with each other. While English, Chinese, and Spanish languages have resources for computational analysis of multimodal language (focusing on analysis of sentiment, subjectivity, or emotions (Yu et al, 2020; Poria et al, 2019; Zadeh et al, 2018b; Park et al, 2014; Wollmer et al, 2013; Poria et al, 2020)), other commonly spoken languages across the globe lag behind. We introduce a large-scale dataset for 4 languages of Spanish, Portuguese, German and French. The dataset, called CMU-MOSEAS (CMU Multimodal Opinion Sentiment, Emotions and Attributes) contains 10, 000 annotated sentences from across a wide variety of speakers and topics. We believe that data of this scale presents a step towards learning human communication at a more fine-grained level, with the longterm goal of building more equitable and inclusive NLP systems. We experiment with a state-of-the-art multimodal language model, and demonstrate that CMUMOSEAS presents new challenges to the NLP community

Related Resources

Computational Models of Multimodal Language

Acquisition and Verification

Labels

Privacy and Ethics

Annotator Selection

Label Statistics

Multimodal Feature Extraction

Findings

Experimental Baselines

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing	Publication Date: Jan 1, 2020
Citations: 64	License type: cc-by

R Discovery Prime

R Discovery Prime

CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

Lead the way for us

Similar Papers

Multi-Modal Language Analysis with Hierarchical Interaction-Level and Selection-Level Attentions
Dong Zhang ... Qiaoming Zhu
-
Dong Zhang, et. al.Dong Zhang ... Qiaoming Zhu
01 Jul 2019
01 Jul 2019

Author response: THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior
Oliver Contier ... Adam H Rockter
-
Oliver Contier, et. al.Oliver Contier ... Adam H Rockter
24 Jan 2023
24 Jan 2023

Editor's evaluation: THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior
Morgan Barense
-
Morgan BarenseMorgan Barense
26 Oct 2022
26 Oct 2022

Decision letter: THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior
Talia Konkle ... Floris P de Lange
-
Talia Konkle, et. al.Talia Konkle ... Floris P de Lange
26 Oct 2022
26 Oct 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CMU-MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing