Exploring the benefits of discretization of acoustic features for speech emotion recognition

Thurid Vogt,Elisabeth André

doi:10.21437/interspeech.2009-107

Abstract

Abstract We present a contribution to the Open Performance sub-challenge of the INTERSPEECH 2009 Emotion Challenge. Weevaluate the feature extraction and classiﬁer of EmoVoice, ourframework for real-time emotion recognition from voice on thechallenge database and achieve competitive results. Further-more, we explore the beneﬁts of discretizing numeric acousticfeatures and ﬁnd it beneﬁcial in a multi-class task.Index Terms: speech emotion recognition, discretization offeatures 1. Introduction Emotion recognition from speech has made considerable ad-vances in the last years. The number of research studies ofemotional speech databases has grown, and also ﬁrst applica-tions and prototypes have been developed [1, 2]. There arelarge EU projects (e.g. Callas 1 and Semaine 2 ) that push real-time emotion recognition. The real-time recognition of emo-tion in speech is also our goal, for which we have developedEmoVoice, our framework for real-time emotion recognitionfrom voice [3], that has already been integrated in a numberof prototypes and showcases. However, real-time processingsometimes requires to accept lower recognition accuracies com-pared to offline research systems. In this contribution to theOpen Performance subchallenge of the INTERSPEECH 2009Emotion Challenge we evaluate our methodology to assess if itis competitive. Since our main focus lies on the acoustic fea-tures, we also explore here whether a discretization of numericacoustic features can make the classiﬁcation problem easier.Though being promising, discretization has not been investi-gated extensively so far. For instance, Casale and colleagues [4]achieve an improvement by feature discretization on two smalldatabases with acted emotions. Here, we study the effects ofdiscretization on a large database with spontaneous emotionssuch as the Challenge database.The rest of this paper is organized as follows: ﬁrst, webrieﬂy characterize the challenge database. Next we present ourmethodology to speech emotion recognition, which includesfeature extraction, feature selection and classiﬁcation. After-wards, we present and discuss our results on the database withtwo pre-processing strategies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring the benefits of discretization of acoustic features for speech emotion recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Sep 6, 2009
Citations: 15	License type: other-oa

Similar Papers

Speech Signal Imaging and Emotion Recognition Based on Symmetric-Diagonal Matrix Model
Zijun Yang ... Aoran Xi
-
Zijun Yang, et. al.Zijun Yang ... Aoran Xi
01 Jan 2023
01 Jan 2023

Children’s recognition of emotion in music and speech
Dianna Vidas ... Nicole L Nelson
Music & Science | VOL. 1
Dianna Vidas, et. al.Dianna Vidas ... Nicole L Nelson
01 Jan 2018
Music & Science | VOL. 1

Learning multi-scale features for speech emotion recognition with connection attention mechanism
Zengzhao Chen ... Qiuyu Zheng
Expert Systems with Applications | VOL. 214
Zengzhao Chen, et. al.Zengzhao Chen ... Qiuyu Zheng
08 Oct 2022
Expert Systems with Applications | VOL. 214

Modulation spectral features for speech emotion recognition using deep neural networks
Premjeet Singh ... Goutam Saha
Speech Communication | VOL. 146
Premjeet Singh, et. al.Premjeet Singh ... Goutam Saha
19 Nov 2022
Speech Communication | VOL. 146

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the benefits of discretization of acoustic features for speech emotion recognition

Abstract

Talk to us

Similar Papers