Fuzzy-based voiced-unvoiced segmentation for emotion recognition using spectral feature fusions

Yusnita Mohd Ali,Zuhaila Mat Yassin,Mohamad Helmy Ramlan,Alhan Farhanah Abd Rahim,Nor Fadzilah Mokhtar,Emilia Noorsal

doi:10.11591/ijeecs.v19.i1.pp196-206

Abstract

Despite abundant growth in automatic emotion recognition system (ERS) studies using various techniques in feature extractions and classifiers, scarce sources found to improve the system via pre-processing techniques. This paper proposed a smart pre-processing stage using fuzzy logic inference system (FIS) based on Mamdani engine and simple time-based features i.e. zero-crossing rate (ZCR) and short-time energy (STE) to initially identify a frame as voiced (V) or unvoiced (UV). Mel-frequency cepstral coefficients (MFCC) and linear prediction coefficients (LPC) were tested with K-nearest neighbours (KNN) classifiers to evaluate the proposed FIS V-UV segmentation. We also introduced two feature fusions of MFCC and LPC with formants to obtain better performance. Experimental results of the proposed system surpassed the conventional ERS which yielded a rise in accuracy rate from 3.7% to 9.0%. The fusion of LPC and formants named as SFF LPC-fmnt indicated a promising result between 1.3% and 5.1% higher accuracy rate than its baseline features in classifying between neutral, angry, happy and sad emotions. The best accuracy rates yielded for male and female speakers were 79.1% and 79.9% respectively using SFF MFCC-fmnt fusion technique.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fuzzy-based voiced-unvoiced segmentation for emotion recognition using spectral feature fusions

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: Jul 1, 2020
License type: CC BY-NC 4.0

Similar Papers

Detecting depression in speech: Comparison and combination between different speech types
Hailiang Long ... Bin Hu
-
Hailiang Long, et. al.Hailiang Long ... Bin Hu
01 Nov 2017
01 Nov 2017

Speech Emotion Recognition Using Convolutional Neural Networks on Spectrograms and Mel-frequency Cepstral Coefficients Images
Sambhavi Mukherjee ... Ankit Mundra
-
Sambhavi Mukherjee, et. al.Sambhavi Mukherjee ... Ankit Mundra
01 Jan 2023
01 Jan 2023

Performance Analysis of Malayalam Language Speech Emotion Recognition System Using ANN/SVM
T.M Rajisha ... K.S Riyas
Procedia Technology | VOL. 24
T.M Rajisha, et. al.T.M Rajisha ... K.S Riyas
01 Jan 2015
Procedia Technology | VOL. 24

Computer aided recognition of pathological voice
Manal Abdel Wahed
-
Manal Abdel WahedManal Abdel Wahed
01 Apr 2014
01 Apr 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fuzzy-based voiced-unvoiced segmentation for emotion recognition using spectral feature fusions

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science