Inferring Emotion from Conversational Voice Data: A Semi-Supervised Multi-Path Generative Neural Network Approach

Suping Zhou,Yufeng Yin,Yufei Dong,Kehua Lei,Jia Jia,Qi Wang

doi:10.1609/aaai.v32i1.11280

Abstract

To give a more humanized response in Voice Dialogue Applications (VDAs), inferring emotion states from users’ queries may play an important role. However, in VDAs, we have tremendous amount of VDA users and massive scale of unlabeled data with high dimension features from multimodal information, which challenge the traditional speech emotion recognition methods. In this paper, to better infer emotion from conversational voice data, we proposed a semi-supervised multi-path generative neural network. Specifically, first, we build a novel supervised multi-path deep neural network framework. To avoid high dimensional input, raw features are trained by groups in local classifiers. Then high-level features of each local classifiers are concatenated as input of a global classifier. These two kinds classifiers are trained simultaneously through a single objective function to achieve a more effective and discriminative emotion inferring. To further solve the labeled-data-scarcity problem, we extend the multi-path deep neural network to a generative model based on semi-supervised variational autoencoder (semi-VAE), which is able to train the labeled and unlabeled data simultaneously. Experiment based on a 24,000 real-world dataset collected from Sogou Voice Assistant (SVAD13) and a benchmark dataset IEMOCAP show that our method significantly outperforms the existing state-of-the-art results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Inferring Emotion from Conversational Voice Data: A Semi-Supervised Multi-Path Generative Neural Network Approach

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Apr 25, 2018
Citations: 26

Similar Papers

An algorithm of face recognition based on generative adversarial networks
Sergey C Leonov ... Julia Diaz-Escobar
-
Sergey C Leonov, et. al.Sergey C Leonov ... Julia Diaz-Escobar
17 Sep 2018
17 Sep 2018

An Improved Semi-supervised Variational Autoencoder with Gate Mechanism for Text Classification
Haiming Ye ... Mengna Nie
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 36
Haiming Ye, et. al.Haiming Ye ... Mengna Nie
25 Jul 2022
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 36

Embedding Image Through Generated Intermediate Medium Using Deep Convolutional Generative Adversarial Network

-

13 Sep 2018
13 Sep 2018

Quantitative Imaging Features Predict Clinical Survival: An Initial Analysis with Neural Network Models
X Pan ... X Qi
International Journal of Radiation Oncology*Biology*Physics | VOL. 99
X Pan, et. al.X Pan ... X Qi
23 Sep 2017
International Journal of Radiation Oncology*Biology*Physics | VOL. 99

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Inferring Emotion from Conversational Voice Data: A Semi-Supervised Multi-Path Generative Neural Network Approach

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence