Abstract
The use of feature vectors obtained by concatenation of different features for text independent speaker identification from clean and telephone speech is studied. The composite feature vectors are examined with GMM and VQ models used to classify speakers. Linear discriminant analysis (LDA), a statistical tool designed to select a reduced set of features for best classification, is applied to enhance performance. The use of LDA for reducing the size of composite feature vector was found satisfactory for clean speech but not for telephone speech. On the other hand, using LDA in the not conventional manner — as a nonsingular transformation (i.e. without size reduction) — improved the performance of composite features in both clean and the telephone speaker identification experiments.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.