A Speaker Verification Method Based on TDNN–LSTMP

Hui Liu,Longlian Zhao

doi:10.1007/s00034-019-01092-3

Abstract

In speaker recognition, a robust recognition method is essential. This paper proposes a speaker verification method that is based on the time-delay neural network (TDNN) and long short-term memory with recurrent project layer (LSTMP) model for the speaker modeling problem in speaker verification. In this work, we present the application of the fusion of TDNN and LSTMP to the i-vector speaker recognition system that is based on the Gaussian mixture model-universal background model. By using a model that can establish long-term dependencies to create a universal background model that contains a larger amount of speaker information, it is possible to extract more feature parameters, which are speaker dependent, from the speech signal. We conducted experiments with this method on four corpora: two in Chinese and two in English. The equal error rate, minimum detection cost function and detection error tradeoff curve are used as criteria for system performance evaluation. The experimental results show that the TDNN–LSTMP/i-vector speaker recognition method outperforms the baseline system on both Chinese and English corpora and has better robustness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Speaker Verification Method Based on TDNN–LSTMP

Abstract

Talk to us

Similar Papers

More From: Circuits, Systems, and Signal Processing

Lead the way for us

Journal: Circuits, Systems, and Signal Processing	Publication Date: Mar 20, 2019
Citations: 7

Similar Papers

Multi-task deep cross-attention networks for far-field speaker verification and keyword spotting
Xingwei Liang ... Ruifeng Xu
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023
Xingwei Liang, et. al.Xingwei Liang ... Ruifeng Xu
01 Jul 2023
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023

Discriminative Universal Background Model Training for Speaker Recognition
Wei-Qiang Zhang ... Jia Liu
-
Wei-Qiang Zhang, et. al.Wei-Qiang Zhang ... Jia Liu
21 Jun 2011
21 Jun 2011

Wavelet based dynamic Mel Frequency Cepstral Coefficients (MFCC) and block truncation techniques for efficient speaker identification under narrowband noise conditions
...
International Journal of the Physical Sciences | VOL. 8
, et. al. ...
23 Sep 2013
International Journal of the Physical Sciences | VOL. 8

Influence of G729 Speech Coding on Automatic Speaker Recognition in VoIP Applications
Dalila Yessad ... Mohamed Debyeche
-
Dalila Yessad, et. al.Dalila Yessad ... Mohamed Debyeche
10 Dec 2011
10 Dec 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Speaker Verification Method Based on TDNN–LSTMP

Abstract

Talk to us

Similar Papers

More From: Circuits, Systems, and Signal Processing