Musical pitch estimation using a supervised single hidden layer feed-forward neural network

Pat Taweewat,Chai Wutiwiwatchai

doi:10.1016/j.eswa.2012.07.063

Abstract

Musical pitch estimation is used to find musical note pitch or the fundamental frequency (F0) of audio signal which can be applied to a pre-processing part of many applications such as sound separation, musical note transcription, etc. In this work, a method for the pitch estimation based on classification framework has been designed using a supervised single hidden layer feed-forward neural network. To make this method have good performances in terms of generalization, high-speed training and small network size, two main investigations have been done. First, we find the suitable feature vector by comparing different performances of feature generation methods using extreme learning machine (ELM) framework for training the network. Second, different input-weight fine tuning methods have been compared for reducing the network size. We evaluated the method using multiple-pitch multi-instrument signals generated from datasets of real musical instrument recordings. For feature generation method, the feature vector generated from combining pitch histogram and pitch-frequency scaled spectrum shows the best performance in the experiment. For the fine tuning method, we compare ELM framework with Cuckoo search and sign-based propagation tunings. After the network size is further reduced to 40%, we found that the network trained with sign-based propagation tuning shows a better performance than that trained by ELM framework for the unseen dataset.

Full Text