Research on Speech Emotion Recognition Based on the Fractional Fourier Transform

Lirong Huang,Xizhong Shen

doi:10.3390/electronics11203393

Lirong Huang, Xizhong Shen

Open Access

https://doi.org/10.3390/electronics11203393

Copy DOI

Journal: Electronics	Publication Date: Oct 20, 2022
Citations: 10	License type: CC BY 4.0

Affiliation: Shanghai Institute of Technology

Abstract

Speech emotion recognition is an important part of human–computer interaction, and the use of computers to analyze emotions and extract speech emotion features that can achieve high recognition rates is an important step. We applied the Fractional Fourier Transform (FrFT), and then constructed it to extract MFCC and combined it with a deep learning method for speech emotion recognition. Since the performance of FrFT depends on the transform order p, we utilized an ambiguity function to determine the optimal order for each frame of speech. The MFCC was extracted under the optimal order of FrFT for each frame of speech. Finally, combining the deep learning network LSTM for speech emotion recognition. Our experiment was conducted on the RAVDESS, and detailed confusion matrices and accuracy were given for analysis. The MFCC extracted using FrFT was shown to have better performance than ordinal FT, and the proposed model achieved a weighting accuracy of 79.86%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on Speech Emotion Recognition Based on the Fractional Fourier Transform

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Emotion Recognition Combining Acoustic and Linguistic Features Based on Speech Recognition Results
Misaki Sakurai ... Tetsuo Kosaka
-
Misaki Sakurai, et. al.Misaki Sakurai ... Tetsuo Kosaka
12 Oct 2021
12 Oct 2021

Speech emotion recognition based on Fuzzy Least Squares Support Vector Machines
Shiqing Zhang
-
Shiqing Zhang Shiqing Zhang
01 Jan 2008
01 Jan 2008

Algorithm for speech emotion recognition classification based on Mel-frequency Cepstral coefficients and broad learning system
Zhiyou Yang ... Ying Huang
Evolutionary Intelligence | VOL. 15
Zhiyou Yang, et. al.Zhiyou Yang ... Ying Huang
14 Jan 2021
Evolutionary Intelligence | VOL. 15

Two-level discriminative speech emotion recognition model with wave field dynamics: A personalized speech emotion recognition method
Ning Jia ... Chunjun Zheng
Computer Communications | VOL. 180
Ning Jia, et. al.Ning Jia ... Chunjun Zheng
22 Sep 2021
Computer Communications | VOL. 180

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on Speech Emotion Recognition Based on the Fractional Fourier Transform

Abstract

Talk to us

Similar Papers

More From: Electronics