Speech-based human emotion recognition

Talieh Seyed Tabtabae

doi:10.32920/ryerson.14651964

Abstract

Automatic Emotion Recognition (AER) is an emerging research area in the Human-Computer Interaction (HCI) field. As Computers are becoming more and more popular every day, the study of interaction between humans (users) and computers is catching more attention. In order to have a more natural and friendly interface between humans and computers, it would be beneficial to give computers the ability to recognize situations the same way a human does. Equipped with an emotion recognition system, computers will be able to recognize their users' emotional state and show the appropriate reaction to that. In today's HCI systems, machines can recognize the speaker and also content of the speech, using speech recognition and speaker identification techniques. If machines are equipped with emotion recognition techniques, they can also know "how it is said" to react more appropriately, and make the interaction more natural. One of the most important human communication channels is the auditory channel which carries speech and vocal intonation. In fact people can perceive each other's emotional state by the way they talk. Therefore in this work the speech signals are analyzed in order to set up an automatic system which recognizes the human emotional state. Six discrete emotional states have been considered and categorized in this research: anger, happiness, fear, surprise, sadness, and disgust. A set of novel spectral features are proposed in this contribution. Two approaches are applied and the results are compared. In the first approach, all the acoustic features are extracted from consequent frames along the speech signals. The statistical values of features are considered to constitute the features vectors. Suport Vector Machine (SVM), which is a relatively new approach in the field of machine learning is used to classify the emotional states. In the second approach, spectral features are extracted from non-overlapping logarithmically-spaced frequency sub-bands. In order to make use of all the extracted information, sequence discriminant SVMs are adopted. The empirical results show that the employed techniques are very promising.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech-based human emotion recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speech-based human emotion recognition
Talieh Seyed Tabtabae
-
Talieh Seyed TabtabaeTalieh Seyed Tabtabae
08 Jun 2021
08 Jun 2021

Emotional speech Recognition using CNN and Deep learning techniques
C Hema ... Fausto Pedro Garcia Marquez
Applied Acoustics | VOL. 211
C Hema, et. al.C Hema ... Fausto Pedro Garcia Marquez
28 Jun 2023
Applied Acoustics | VOL. 211

A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
Yu Zhou ... Junfeng Li
IEICE Transactions on Information and Systems | VOL. E93-D
Yu Zhou, et. al.Yu Zhou ... Junfeng Li
01 Jan 2009
IEICE Transactions on Information and Systems | VOL. E93-D

A Deep Learning Approach for Human Facial Expression Recognition using Residual Network – 101
Ranjana Kumari ... Javed Wasim
Journal of Current Science and Technology | VOL. 13
Ranjana Kumari, et. al.Ranjana Kumari ... Javed Wasim
30 Aug 2023
Journal of Current Science and Technology | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech-based human emotion recognition

Abstract

Talk to us

Similar Papers