Abstract
This paper presents comparison and optimization of acoustic features for source cell-phone recognition using recorded speech signals. Different acoustic feature extraction methods such as Mel-frequency, linear frequency and Bark frequency cepstral coefficients (MFCC, LFCC and BFCC) and linear prediction cepstral coefficients (LPCC) are considered. In addition to different feature sets, the effect of dynamic features, delta and double-delta coefficients (Δ and Δ2), and feature normalizations, cepstral mean normalization (CMN), cepstral variance normalization (CVN) and cepstral mean and variance normalization (CMVN) are also examined on the performance of source cell-phone recognition. The same support vector machine (SVM) classifier with fixed parameters and the same cell-phone dataset are used in the experiments in order to make a fair comparison of different features and feature normalization techniques.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.