Unsupervised online multitask learning of behavioral sentence embeddings.

Shao-Yen Tseng,Panayiotis Georgiou,Brian Baucom

doi:10.7717/peerj-cs.200

Shao-Yen Tseng, Panayiotis Georgiou + Show 1 more

Open Access

https://doi.org/10.7717/peerj-cs.200

Copy DOI

Abstract

Appropriate embedding transformation of sentences can aid in downstream tasks such as NLP and emotion and behavior analysis. Such efforts evolved from word vectors which were trained in an unsupervised manner using large-scale corpora. Recent research, however, has shown that sentence embeddings trained using in-domain data or supervised techniques, often through multitask learning, perform better than unsupervised ones. Representations have also been shown to be applicable in multiple tasks, especially when training incorporates multiple information sources. In this work we aspire to combine the simplicity of using abundant unsupervised data with transfer learning by introducing an online multitask objective. We present a multitask paradigm for unsupervised learning of sentence embeddings which simultaneously addresses domain adaption. We show that embeddings generated through this process increase performance in subsequent domain-relevant tasks. We evaluate on the affective tasks of emotion recognition and behavior analysis and compare our results with state-of-the-art general-purpose supervised sentence embeddings. Our unsupervised sentence embeddings outperform the alternative universal embeddings in both identifying behaviors within couples therapy and in emotion recognition.

Highlights

Representation learning has become a crucial tool for obtaining superior results in many machine learning tasks (Bengio, Courville & Vincent, 2013)
IEMOCAP We evaluated the effectiveness of our sentence embeddings in emotion recognition using the Interactive Emotional Dyadic Motion Capture Database (IEMOCAP) (Busso et al, 2008)
We evaluated the performance of our unsupervised multitask sentences embeddings on the task of behavior annotation in the Couples Therapy Corpus, as well as emotion recognition on the IEMOCAP dataset

Summary

Introduction

Representation learning has become a crucial tool for obtaining superior results in many machine learning tasks (Bengio, Courville & Vincent, 2013). Word embeddings exploit the use of language by learning semantic regularities based on a context of neighboring words. This form of contextual learning is unsupervised, which allows learning from large-scale corpora and is the main reason for its effectiveness in improved performance on many tasks such as constituency parsing (Tai, Socher & Manning, 2015), sentiment analysis (Dos Santos & Gatti, 2014; Severyn & Moschitti, 2015), natural language inference (Parikh et al, 2016), and video/image captioning (Karpathy & Fei-Fei, 2015; Venugopalan et al, 2016).

Methods

Results

Conclusion