A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

Victor Poblete,Felipe Espic,Simon King,Richard M Stern,Fernando Huenupán,Josué Fredes,Nestor Becerra Yoma

doi:10.1016/j.csl.2014.10.006

Abstract

This paper proposes a new set of speech features called Locally-Normalized Cepstral Coefficients (LNCC) that are based on Seneff's Generalized Synchrony Detector (GSD). First, an analysis of the GSD frequency response is provided to show that it generates spurious peaks at harmonics of the detected frequency. Then, the GSD frequency response is modeled as a quotient of two filters centered at the detected frequency. The numerator is a triangular band pass filter centered around a particular frequency similar to the ordinary Mel filters. The denominator term is a filter that responds maximally to frequency components on either side of the numerator filter. As a result, a local normalization is performed without the spurious peaks of the original GSD. Speaker verification results demonstrate that the proposed LNCC features are of low computational complexity and far more effectively compensate for spectral tilt than ordinary MFCC coefficients. LNCC features do not require the computation and storage of a moving average of the feature values, and they provide relative reductions in Equal Error Rate (EER) as high as 47.7%, 34.0% or 25.8% when compared with MFCC, MFCC+CMN, or MFCC+RASTA in one case of variable spectral tilt, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Nov 7, 2014
Citations: 8

Similar Papers

Speaker verification based on fusion of acoustic and articulatory information
Ming Li ... Vikram Ramanarayanan
-
Ming Li, et. al.Ming Li ... Vikram Ramanarayanan
25 Aug 2013
25 Aug 2013

Development of TEO phase for speaker recognition
Hemant A Patil ... Keshab K Parhi
-
Hemant A Patil, et. al.Hemant A Patil ... Keshab K Parhi
01 Jul 2010
01 Jul 2010

Emotion attribute projection for speaker recognition on emotional speech
Huanjun Bao ... Thomas Fang Zheng
-
Huanjun Bao, et. al.Huanjun Bao ... Thomas Fang Zheng
27 Aug 2007
27 Aug 2007

Sentence‐HMM state‐based i‐vector/PLDA modelling for improved performance in text dependent single utterance speaker verification
Osman Büyük
IET Signal Processing | VOL. 10
Osman BüyükOsman Büyük
01 Oct 2016
IET Signal Processing | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language