Phoneme dependent inter-session variability reduction for speaker verification

Haoze Lu,Yasuo Horiuchi,Shingo Kuroiwa,Wenbin Zhang

doi:10.1504/ijbm.2015.070922

Abstract

GMM-UBM super-vectors will potentially lead to worse modelling for speaker verification due to the inter-session variability, especially when a small amount of training utterances were available. In this study, we propose a phoneme dependent method to suppress the inter-session variability. A speaker's model can be represented by several various phoneme Gaussian mixture models. Each of them covers an individual phoneme whose inter-session variability can be constrained in an inter-session independent subspace constructed by principal component analysis PCA, and it uses corpus uttered by a single speaker that has been recorded over a long period. SVM-based experiments performed using a large corpus, constructed by the National Research Institute of Police Science NRIPS to evaluate Japanese speaker recognition, and demonstrate the improvements gained from the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Phoneme dependent inter-session variability reduction for speaker verification

Abstract

Talk to us

Similar Papers

More From: International Journal of Biometrics

Lead the way for us

Similar Papers

Experiments in Speaker Adaptation for Factor Analysis Based Speaker Verification
Shou-Chun Yin ... Richard Rose
-
Shou-Chun Yin, et. al.Shou-Chun Yin ... Richard Rose
01 Jun 2006
01 Jun 2006

Speaker Model and Decision Threshold Updating in Speaker Verification
M Mehdi Homayounpour
-
M Mehdi HomayounpourM Mehdi Homayounpour
01 Jan 2002
01 Jan 2002

Model compression for GMM based speaker recognition systems
Douglas A Reynolds
-
Douglas A ReynoldsDouglas A Reynolds
01 Sep 2003
01 Sep 2003

Combining evidences from magnitude and phase information using VTEO for person recognition using humming
Hemant A Patil ... Maulik C Madhavi
Computer Speech & Language | VOL. 52
Hemant A Patil, et. al.Hemant A Patil ... Maulik C Madhavi
15 Sep 2017
Computer Speech & Language | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phoneme dependent inter-session variability reduction for speaker verification

Abstract

Talk to us

Similar Papers

More From: International Journal of Biometrics