Deep Learning Based Human Emotional State Recognition in a Video

A A Moskvin,A.G Shishkin

doi:10.32732/jmo.2020.12.1.51

Abstract

Human emotions play significant role in everyday life. There are a lot of applications of automatic emotion recognition in medicine, e-learning, monitoring, marketing etc. In this paper the method and neural network architecture for real-time human emotion recognition by audio-visual data are proposed. To classify one of seven emotions, deep neural networks, namely, convolutional and recurrent neural networks are used. Visual information is represented by a sequence of 16 frames of 96 × 96 pixels, and audio information - by 140 features for each of a sequence of 37 temporal windows. To reduce the number of audio features autoencoder was used. Audio information in conjunction with visual one is shown to increase recognition accuracy up to 12%. The developed system being not demanding to be computing resources is dynamic in terms of selection of parameters, reducing or increasing the number of emotion classes, as well as the ability to easily add, accumulate and use information from other external devices for further improvement of classification accuracy.

Highlights

Artificial intelligence technologies are actively used in everyday life, including a wide range of software and hardware systems being able to change their behavior during operation
Feature extraction based on deep learning often solves the given problem more effectively than a human, especially when it comes to large amounts of data, where it is necessary to detect non-linear relationship between variables
To use a video channel for emotion recognition, one must select a sequence of images where a face is present in each image, and all images are of good quality

Summary

Introduction

Artificial intelligence technologies are actively used in everyday life, including a wide range of software and hardware systems being able to change their behavior during operation. Feature extraction based on deep learning often solves the given problem more effectively than a human, especially when it comes to large amounts of data, where it is necessary to detect non-linear relationship between variables. One of such examples is a recognition of faces and their individual details in images. In this paper a system based on deep neural networks has been developed It allows recognizing human emotions in real-time with limited computing resources from a video sequence in which both visual and speech information is present. The advantage of the system is its robustness, since the system was mainly trained on raw examples, and not on selected and prepared in a special way video

Related work

Proposed method

Image selection from video sequence

Collecting a dataset of blurry and clear images

Image type determination

Audio features selection

Parameter covariance

Autoencoder

The structure of neural networks

Analysis of the results

Conclusion

Findings

10. References

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning Based Human Emotional State Recognition in a Video

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Modeling and Optimization

Lead the way for us

Journal: Journal of Modeling and Optimization	Publication Date: Jun 15, 2020
License type: CC BY 4.0

Similar Papers

ОБЗОР И АНАЛИЗ ПОДХОДОВ И ПРАКТИЧЕСКИХ ОБЛАСТЕЙ ПРИМЕНЕНИЯ РАСПОЗНАВАНИЯ ЭМОЦИЙ ЧЕЛОВЕКА
A.A Orlov ... M.I Mironov
Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics | VOL. 24
A.A Orlov, et. al.A.A Orlov ... M.I Mironov
01 Jan 2023
Bulletin of the South Ural State University. Ser. Computer Technologies, Automatic Control & Radioelectronics | VOL. 24

Sleep Deprivation and Emotion Recognition
Carmen M Schroder
Sleep | VOL. 33
Carmen M SchroderCarmen M Schroder
01 Mar 2010
Sleep | VOL. 33

Human emotion recognition from EEG-based brain–computer interface using machine learning: a comprehensive review
Essam H Houssein ... Abdelmgeid A Ali
Neural Computing and Applications | VOL. 34
Essam H Houssein, et. al.Essam H Houssein ... Abdelmgeid A Ali
07 May 2022
Neural Computing and Applications | VOL. 34

Deformable Convolutional LSTM for Human Body Emotion Recognition
Peyman Tahghighi ... Masoume Jalali
-
Peyman Tahghighi, et. al.Peyman Tahghighi ... Masoume Jalali
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning Based Human Emotional State Recognition in a Video

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Modeling and Optimization