Robust Front-End Based on MVA and HEQ Post-processing for Arabic Speech Recognition Using Hidden Markov Model Toolkit (HTK)

Elhem Techini,Zied Sakka,Medsalim Bouhlel

doi:10.1109/aiccsa.2017.180

Abstract

This paper describes a study of a set of features based on cepstral mean and variance normalization (CMVN) plus auto regressive moving average (ARMA) filtering technique which is called MVA and on histogram equalization (HEQ) for robust speech recognition. First, we use MVA then HEQ in combination with CMVN and ARMA filtering as a post-processing module to mel frequency cepstral coefficients (MFCC), Relative Spectral-Perceptual linear prediction (RASTA-PLP) and power normalized cepstral coefficients (PNCC) features to improve the performance of the automatic speech recognition (ASR) system. The results on the Arabic database task have shown that both methods MVA and HEQ+ARMA improves the success rate for all features compared to the baseline system however HEQ was not found to perform better than MVA. The results also provide that RASTA-PLP outperforms PNCC and MFCC features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust Front-End Based on MVA and HEQ Post-processing for Arabic Speech Recognition Using Hidden Markov Model Toolkit (HTK)

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Robust front-end based on MVA processing for Arabic speech recognition
Elhem Techini ... Medsalim Bouhlel
-
Elhem Techini, et. al.Elhem Techini ... Medsalim Bouhlel
01 May 2017
01 May 2017

Chapter 7 - Closed-set speaker identification system based on MFCC and PNCC features combination with different fusion strategies
Musab T.S Al-Kaltakchi ... Satnam S Dlay
Applied Speech Processing | VOL. -
Musab T.S Al-Kaltakchi, et. al.Musab T.S Al-Kaltakchi ... Satnam S Dlay
01 Jan 2020
Applied Speech Processing | VOL. -

Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification
M.T.S Al-Kaltakchi ... J A Chambers
-
M.T.S Al-Kaltakchi, et. al.M.T.S Al-Kaltakchi ... J A Chambers
01 Mar 2016
01 Mar 2016

Detecting keywords in Persian conversational telephony speech using a discriminative English keyword spotter
Akram Shokri ... Ahmad Akbari
-
Akram Shokri, et. al.Akram Shokri ... Ahmad Akbari
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust Front-End Based on MVA and HEQ Post-processing for Arabic Speech Recognition Using Hidden Markov Model Toolkit (HTK)

Abstract

Talk to us

Similar Papers