Audio classification is a difficult task because of the issue of extracting and choosing the optimum audio features. To reduce the computational complication from existing methods, this study proposes a feature-selection method based on modified bacterial foraging optimisation algorithm (MBFOA) for classification of audio signals. Enhanced mel-frequency cepstral coefficient and enhanced power normalised cepstral coefficients with peak and pitch are estimated the signal feature and optimised using MBFOA with the fitness function. Using the probabilistic neural network, the audio signal is classified into music and speech signal. Then, if the signal is music, the signal is classified as cello, clarinet, flute etc. If the signal is detected as a speech, then it is again classified as male or female voice. This approach shows that it is possible to boost the classification accuracy by using different features and optimisation technique.
Read full abstract