Abstract
This paper motivates the use of RASTA-MFCC (RelAtive SpecTrA-Mel Frequency Cepstral Coefficients) feature and GMM-UBM modeling for text independent voice based students' attendance system under noisy environment. MFCC has been identified as an efficient feature for identifying the speaker because it extracts speaker specific information. The performance of even best speaker identification system with MFCC feature degrades in uncontrolled communication environment. RASTA processing of speech improves the performance of identification system even in the presence of convolutional and additive noise. This paper combines the best of these two processes to yield RASTA-MFCC feature which is robust to noise and also contributes speaker dependent information to identify the speaker efficiently. GMM-UBM (Gaussian Mixture Model-Universal Background Model) modeling technique is used for its faster training and relatively easier updating of new speakers. Experimental result of 93.2% accuracy for Triangular filter bank and 94.5% accuracy for Gaussian filter bank are obtained for 50 speakers of MEPCO speech database in presence of additive and convolutive noise in the context of voice based students' attendance entry.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.