Articulatory movement features for short-duration text-dependent speaker verification

Yan Zhang,Xiangrong Shen,Haoran Wei,Min Yang,Hongwei Mao,Hong Ye,Yanhua Long

doi:10.1007/s10772-017-9447-8

Yan Zhang, Xiangrong Shen + Show 5 more

https://doi.org/10.1007/s10772-017-9447-8

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

During our pronunciation process, the position and movement properties of articulators such as tongue, jaw, lips, etc are mainly captured by the articulatory movement features (AMFs). This paper investigates to use the AMFs for short-duration text-dependent speaker verification. The AMFs can characterize the relative motion trajectory of articulators of individual speakers directly, which is rarely affected by the external environment. Therefore, we expect that, the AMFs are superior to the traditional acoustic features, such as mel-frequency cepstral coefficients (MFCC), to characterize the speaker identity differences between speakers. The speaker similarity scores measured by the dynamic time warping (DTW) algorithm are used to make the speaker verification decisions. Experimental results show that the AMFs can bring significant performance gains over the traditional MFCC features for short-duration text-dependent speaker verification task.

Full Text