Fractal dimension pattern-based multiresolution analysis for rough estimator of speaker-dependent audio emotion recognition

Miao Cheng,Ah Chung Tsoi

doi:10.1142/s0219691317500424

Abstract

As a general means of expression, audio analysis and recognition have attracted much attention for its wide applications in real-life world. Audio emotion recognition (AER) attempts to understand the emotional states of human with the given utterance signals, and has been studied abroad for its further development on friendly human–machine interfaces. Though there have been several the-state-of-the-arts auditory methods devised to audio recognition, most of them focus on discriminative usage of acoustic features, while feedback efficiency of recognition demands is ignored. This makes possible application of AER, and rapid learning of emotion patterns is desired. In order to make predication of audio emotion possible, the speaker-dependent patterns of audio emotions are learned with multiresolution analysis, and fractal dimension (FD) features are calculated for acoustic feature extraction. Furthermore, it is able to efficiently learn the intrinsic characteristics of auditory emotions, while the utterance features are learned from FDs of each sub-band. Experimental results show the proposed method is able to provide comparative performance for AER.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fractal dimension pattern-based multiresolution analysis for rough estimator of speaker-dependent audio emotion recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing

Lead the way for us

Journal: International Journal of Wavelets, Multiresolution and Information Processing	Publication Date: Aug 28, 2017
Citations: 6

Similar Papers

Audio emotion recognition by perceptual features
Cenk Sezgin ... Bilge Gunsel
-
Cenk Sezgin, et. al.Cenk Sezgin ... Bilge Gunsel
01 Apr 2012
01 Apr 2012

A combined CNN-LSTM Network for Audio Emotion Recognition using Speech and Song attributs
Souha Ayadi ... Zied Lachiri
-
Souha Ayadi, et. al.Souha Ayadi ... Zied Lachiri
24 May 2022
24 May 2022

Semi-supervised classification-aware cross-modal deep adversarial data augmentation
Shaoqiang Wang ... Fangfang Fan
Future Generation Computer Systems | VOL. 125
Shaoqiang Wang, et. al.Shaoqiang Wang ... Fangfang Fan
18 Jun 2021
Future Generation Computer Systems | VOL. 125

Image2Audio: Facilitating Semi-supervised Audio Emotion Recognition with Facial Expression Image
Gewen He ... Jane You
-
Gewen He, et. al.Gewen He ... Jane You
01 Jun 2020
01 Jun 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fractal dimension pattern-based multiresolution analysis for rough estimator of speaker-dependent audio emotion recognition

Abstract

Talk to us

Similar Papers

More From: International Journal of Wavelets, Multiresolution and Information Processing