An Encapsulation of Vital Non-Linear Frequency Features for Various Speech Applications

S Lalitha,Deepa Gupta

doi:10.1166/jctn.2020.8666

Abstract

Mel Frequency Cepstral Coefficients (MFCCs) and Perceptual linear prediction coefficients (PLPCs) are widely casted nonlinear vocal parameters in majority of the speaker identification, speaker and speech recognition techniques as well in the field of emotion recognition. Post 1980s, significant exertions are put forth on for the progress of these features. Considerations like the usage of appropriate frequency estimation approaches, proposal of appropriate filter banks, and selection of preferred features perform a vital part for the strength of models employing these features. This article projects an overview of MFCC and PLPC features for different speech applications. The insights such as performance metrics of accuracy, background environment, type of data, and size of features are inspected and concise with the corresponding key references. Adding more to this, the advantages and shortcomings of these features have been discussed. This background work will hopefully contribute to floating a heading step in the direction of the enhancement of MFCC and PLPC with respect to novelty, raised levels of accuracy, and lesser complexity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Encapsulation of Vital Non-Linear Frequency Features for Various Speech Applications

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Theoretical Nanoscience

Lead the way for us

Journal: Journal of Computational and Theoretical Nanoscience	Publication Date: Jan 1, 2020
Citations: 3

Similar Papers

Non-intrusive objective speech quality assessment using a combination of MFCC, PLP and LSF features
Rajesh Kumar Dubey ... Arun Kumar
-
Rajesh Kumar Dubey, et. al.Rajesh Kumar Dubey ... Arun Kumar
01 Dec 2013
01 Dec 2013

Effects of the Dynamic and Energy Based Feature Extraction on Hindi Speech Recognition
Shobha Bhatt ... Amita Dev
Recent Advances in Computer Science and Communications | VOL. 14
Shobha Bhatt, et. al.Shobha Bhatt ... Amita Dev
30 Aug 2021
Recent Advances in Computer Science and Communications | VOL. 14

Multiple windowed spectral features for emotion recognition
Yazid Attabi ... Douglas O'Shaughnessy
-
Yazid Attabi, et. al.Yazid Attabi ... Douglas O'Shaughnessy
01 May 2013
01 May 2013

Text-Independent Speaker Identification Through Feature Fusion and Deep Neural Network
Rashid Jahangir ... Nisar Ahmed Memon
IEEE Access | VOL. 8
Rashid Jahangir, et. al.Rashid Jahangir ... Nisar Ahmed Memon
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Encapsulation of Vital Non-Linear Frequency Features for Various Speech Applications

Abstract

Talk to us

Similar Papers

More From: Journal of Computational and Theoretical Nanoscience