Abstract
An automated speaker recognition system for home service robots is proposed in this paper. In an uncontrolled environment, a speech classifier should be adaptive to different users and robust to noisy environments. It is usually observed that specific features and classifiers are more appropriate to parts of the problem domain than others; therefore, we propose a self-optimizing approach in which multiple feature extraction and classification techniques are simultaneously considered. The system uses a genetic algorithm to simultaneously select features and classifier, and the results from multiple classifiers are then combined using the Dempster-Shafer theory. The set of feature extractors used here includes linear-prediction coefficients, linear-prediction cepstral coefficients, mel-frequency cepstral coefficients, and bark-frequency cepstral coefficients, and the set of classifiers includes the Gaussian mixture model, support vector machines, C4.5 decision tree, k nearest neighbors, and multilayer perceptron neural network. The WEVER-R2 home service robot is used in a typical Korean home environment to collect speech signals for evaluating the performance of the proposed system for gender and age classification. Classification results show that the performance of the proposed method consistently outperforms the individual classifiers.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: IEEE Transactions on Instrumentation and Measurement
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.