Abstract
Text-dependent speaker verification systems can be deployed for more and more applications of security authorization, in which both speaker identity and lexical content should be verified together. There are several methods to model the utterances with short duration, such as Gaussian mixture model-universal background model (GMM-UBM), tied mixture model-UBM(TMM-UBM) and GMM-support vector machines(GMM-SVM). In this paper, we build an efficient i-vector based system and compare it with other systems based on the RSR2015 and King-ASR-L-057 corpus. Experimental results showed that the EER value of imposter-correct(IC) could be reduced by 40.3% for King-ASR-L-057 and 63.8% for RSR2015 in contrast to the traditional GMM-UBM system.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have