Abstract

This paper describes the implementation of a speaker identification system with reference to Assamese language. The database consists of speech samples that were collected from 15 (fifteen) speakers for ten Assamese words representing the Assamese digits from 0 (shounyo) to 9 (no). Mel Frequency Cepstral Coefficients (MFCC) are used as features for this study. Two independent speaker identification systems have been built in this paper using Vector Quantization (VQ) and I-vector technique. The system built using the I-vector technique obtains comparatively better identification accuracy for speakers when compared with the system developed using VQ technique. Three different systems have been built for both the techniques based on variable feature size. A maximum accuracy of 92.38% is achieved using I-vector technique with 39 MFCC features.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call