Abstract

This work is mainly focused on showing experimental results of speaker recognition with voice activity detection. A VAD algorithm based on the finite state machine is introduced firstly. The algorithm is incorporated into two speaker recognition (SR)systems. The mel frequency ceptral coefficients(MFCCs) are adopted as the speaker speech feature parameters in both systems. Vector quantization (VQ)and Gaussian mixture model (GMM) are the classifiers of the two SR systems, respectively. The experimental results show that the VAD improved the performance of both SR systems with small speech database. However, as the speech databases get bigger and bigger, the performance of both SR systems with VAD gets worse and worse, compared to those of systems without VAD. The reason of the phenomenon is analyzed in detail.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call