Speaker verification based on fusion of acoustic and articulatory information

Ming Li,Shrikanth Narayanan,Prasanta Kumar Ghosh,Jangwon Kim,Vikram Ramanarayanan

doi:10.21437/interspeech.2013-405

Abstract

We propose a practical, feature-level fusion approach for combining acoustic and articulatory information in speaker verification task. We find that concatenating articulation features obtained from the measured speech production data with conventional Mel-frequency cepstral coefficients (MFCCs) improves the overall speaker verification performance. However, since access to the measured articulatory data is impractical for real world speaker verification applications, we also experiment with estimated articulatory features obtained using acoustic-to-articulatory inversion technique. Specifically, we show that augmenting MFCCs with articulatory features obtained from subject-independent acoustic-to-articulatory inversion technique also significantly enhances the speaker verification performance. This performance boost could be due to the information about inter-speaker variation present in the estimated articulatory features, especially at the mean and variance level. Experimental results on the Wisconsin X-Ray Microbeam database show that the proposed acoustic-estimatedarticulatory fusion approach significantly outperforms the traditional acoustic-only baseline, providing up to 10% relative reduction in Equal Error Rate (EER). We further show that we can achieve an additional 5% relative reduction in EER after score-level fusion. Index Terms: speech production, speaker verification, articulation features, acoustic-to-articulatory inversion, biometrics

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speaker verification based on fusion of acoustic and articulatory information

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker verification based on the fusion of speech acoustics and inverted articulatory signals
Ming Li ... Shrikanth Narayanan
Computer Speech & Language | VOL. 36
Ming Li, et. al.Ming Li ... Shrikanth Narayanan
22 May 2015
Computer Speech & Language | VOL. 36

Sentence‐HMM state‐based i‐vector/PLDA modelling for improved performance in text dependent single utterance speaker verification
Osman Büyük
IET Signal Processing | VOL. 10
Osman BüyükOsman Büyük
01 Oct 2016
IET Signal Processing | VOL. 10

Mixture linear prediction Gammatone Cepstral features for robust speaker verification under transmission channel noise
Ahmed Krobba ... Sid-Ahmed Selouani
Multimedia Tools and Applications | VOL. 79
Ahmed Krobba, et. al.Ahmed Krobba ... Sid-Ahmed Selouani
09 Mar 2020
Multimedia Tools and Applications | VOL. 79

New Acoustic Features for Synthetic and Replay Spoofing Attack Detection
Linqiang Wei ... Yijie Li
Symmetry | VOL. 14
Linqiang Wei, et. al.Linqiang Wei ... Yijie Li
29 Jan 2022
Symmetry | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speaker verification based on fusion of acoustic and articulatory information

Abstract

Talk to us

Similar Papers