Bark Frequency Cepstral Coefficients Research Articles

In the current study, we explore the factors underlying the well-known difficulty in acoustic classification of front nonsibilant fricatives (Maniwa, Jongman & Wade 2009, McMurray & Jongman 2011) by applying a novel classification method to the production of Greek speakers. The Greek fricative inventory [f v θ ð s z ç ʝ x ɣ] includes voiced and voiceless segments from five distinct places of articulation. Our corpus contains all of the Greek fricatives produced by 29 monolingual speakers, but our focus is on the distinction between the front nonsibilant fricatives [f v θ ð]. For comparison, we also discuss the other places of articulation where relevant. We apply a relatively novel classification method based on cepstral coefficients, previously successful in categorizing English obstruent bursts (Bunnell, Polikoff & McNicholas 2004), English vowels (Ferragne & Pellegrino 2010), Romanian fricatives (Spinu & Lilley 2016), and Russian fricatives (Spinu, Kochetov & Lilley 2018). For this study, fricative boundaries were automatically aligned using Hidden Markov Models (HMMs) and then manually checked. Six Bark-frequency cepstral coefficients (c0–c5) were extracted from 20-millisecond Hann windows. HMMs were used to divide the fricatives and adjacent vowels into three regions of internally minimized variance. A multinomial logistic regression analysis then used the mean cepstral coefficients from each region as predictors for classification by consonant identity. Our method yields highly successful classification rates, exceeding the performance of previous methods. We discuss these results in light of the differences of the phonemic distributions of fricatives between English and Greek.

Read full abstract

Speaker verification (SV) systems involve mainly two individual stages: feature extraction and classification. In this paper, we explore these two modules with the aim of improving the performance of a speaker verification system under noisy conditions. On the one hand, the choice of the most appropriate acoustic features is a crucial factor for performing robust speaker verification. The acoustic parameters used in the proposed system are: Mel Frequency Cepstral Coefficients, their first and second derivatives (Deltas and Delta–Deltas), Bark Frequency Cepstral Coefficients, Perceptual Linear Predictive, and Relative Spectral Transform Perceptual Linear Predictive. In this paper, a complete comparison of different combinations of the previous features is discussed. On the other hand, the major weakness of a conventional support vector machine (SVM) classifier is the use of generic traditional kernel functions to compute the distances among data points. However, the kernel function of an SVM has great influence on its performance. In this work, we propose the combination of two SVM-based classifiers with different kernel functions: linear kernel and Gaussian radial basis function kernel with a logistic regression classifier. The combination is carried out by means of a parallel structure approach, in which different voting rules to take the final decision are considered. Results show that significant improvement in the performance of the SV system is achieved by using the combined features with the combined classifiers either with clean speech or in the presence of noise. Finally, to enhance the system more in noisy environments, the inclusion of the multiband noise removal technique as a preprocessing stage is proposed.

Read full abstract

Bark Frequency Cepstral Coefficients Research Articles

Related Topics

Articles published on Bark Frequency Cepstral Coefficients

Bark frequency cepstral coefficient based sadness emotion level recognition system

Comparative study of respiratory sounds classification methods based on cepstral analysis and artificial neural networks

End-to-end Multi-modal Low-resourced Speech Keywords Recognition Using Sequential Conv2D Nets

Using multi-audio feature fusion for android malware detection

BarkDroid: Android Malware Detection Using Bark Frequency Cepstral Coefficients

Spectral features and optimal Hierarchical attention networks for pulmonary abnormality detection from the respiratory sound signals

Voice Analysis and Classification System Based on Perturbation Parameters and Cepstral Presentation in Psychoacoustic Scales

Speech stress recognition using semi-eager learning

Exploring the front fricative contrast in Greek: A study of acoustic variability based on cepstral coefficients

Real-Time Speech Enhancement Algorithm Based on Attention LSTM

Infant cry language analysis and recognition: an experimental approach

Heart Sound Diagnose System with BFCC, MFCC, and Backpropagation Neural Network

Enhanced speech emotion detection using deep neural networks

Voice Verification System Based on Bark-frequency Cepstral Coefficient

Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers

Comparative Analysis of LPCC, MFCC and BFCC for the Recognition of Hindi Words using Artificial Neural Networks

A FRAMEWORK FOR MULTILINGUAL TEXT- INDEPENDENT SPEAKER IDENTIFICATION SYSTEM

Investigation of distance effect on Gaussian Mixture Models in Speaker Identification

Automated Speaker Recognition for Home Service Robots Using Genetic Algorithm and Dempster–Shafer Fusion Technique

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bark Frequency Cepstral Coefficients Research Articles

Related Topics

Articles published on Bark Frequency Cepstral Coefficients

Bark frequency cepstral coefficient based sadness emotion level recognition system

Comparative study of respiratory sounds classification methods based on cepstral analysis and artificial neural networks

End-to-end Multi-modal Low-resourced Speech Keywords Recognition Using Sequential Conv2D Nets

Using multi-audio feature fusion for android malware detection

BarkDroid: Android Malware Detection Using Bark Frequency Cepstral Coefficients

Spectral features and optimal Hierarchical attention networks for pulmonary abnormality detection from the respiratory sound signals

Voice Analysis and Classification System Based on Perturbation Parameters and Cepstral Presentation in Psychoacoustic Scales

Speech stress recognition using semi-eager learning

Exploring the front fricative contrast in Greek: A study of acoustic variability based on cepstral coefficients

Real-Time Speech Enhancement Algorithm Based on Attention LSTM

Infant cry language analysis and recognition: an experimental approach

Heart Sound Diagnose System with BFCC, MFCC, and Backpropagation Neural Network

Enhanced speech emotion detection using deep neural networks

Voice Verification System Based on Bark-frequency Cepstral Coefficient

Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers

Comparative Analysis of LPCC, MFCC and BFCC for the Recognition of Hindi Words using Artificial Neural Networks

A FRAMEWORK FOR MULTILINGUAL TEXT- INDEPENDENT SPEAKER IDENTIFICATION SYSTEM

Investigation of distance effect on Gaussian Mixture Models in Speaker Identification

Automated Speaker Recognition for Home Service Robots Using Genetic Algorithm and Dempster–Shafer Fusion Technique