Moving average multi directional local features for speaker recognition

Awais Mahmood,Esam M Asem Othman,Ghulam Muhammad,Habib Dhahri,Mohammed Faisal,Mansour Alsulaiman

doi:10.1007/s10586-018-2030-5

Abstract

A new speech feature extraction technique called moving average multi directional local features (MA-MDLF) is presented in this paper. This method is based on linear regression (LR) and moving average (MA) in the time–frequency plane. Three-point LR is taken along time axis and frequency axis, and 3 points MA is taken along 45° and 135° in the time–frequency plane. The LR captures the voice onset\offset, formant contour, while the moving average captures the dynamics on time–frequency axes which can be seen as voiceprints. The MA-MDLF performance is compared to commonly used speech features in speaker recognition. The comparison is performed in a speaker recognition system (SRS) for three different conditions, namely clean speech, mobile speech, and cross channel. MA-MDLF has shown better performance than the baseline MFCC, RASTA-PLP and LPCC. In clean and mobile speech, MA-MDLF feature performs the best and also in the cross channel task MA-MDLF performed excellent. We also evaluated the MA-MDLF using three speech databases, namely KSU, LDC Babylon and TIMITdatabases, and found that MA-MDLF outperformed the other commonly used features with speech from all the three databases. The first and second databases are for Arabic speech while third is for English speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Moving average multi directional local features for speaker recognition

Abstract

Talk to us

Similar Papers

More From: Cluster Computing

Lead the way for us

Journal: Cluster Computing	Publication Date: Feb 23, 2018
Citations: 4

Similar Papers

Multidirectional Local Feature for Speaker Recognition
Awais Mahmood ... Mansour Alsulaiman
-
Awais Mahmood, et. al.Awais Mahmood ... Mansour Alsulaiman
01 Feb 2012
01 Feb 2012

Analyzing Noise Robustness of Cochleogram and Mel Spectrogram Features in Deep Learning Based Speaker Recognition
Wondimu Lambamo ... Ramasamy Srinivasagan
Applied Sciences | VOL. 13
Wondimu Lambamo, et. al.Wondimu Lambamo ... Ramasamy Srinivasagan
31 Dec 2022
Applied Sciences | VOL. 13

Boosting Localized Features for Speaker and Speech Recognition

-

01 Jan 2010
01 Jan 2010

Speaker Recognition with VAD
Jian Ling ... Jianwei Zhu
-
Jian Ling, et. al.Jian Ling ... Jianwei Zhu
01 Jun 2009
01 Jun 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Moving average multi directional local features for speaker recognition

Abstract

Talk to us

Similar Papers

More From: Cluster Computing