Abstract

While pitch detection has been the focal subject of numerous research efforts for several decades, it is still a challenging task in noisy conditions. In this article, we propose a method to improve the pitch detection accuracy of conventional pitch detection methods. The proposed pitch detection process starts with using the pitch value estimated by a conventional pitch detection method. Then, it extracts pitch candidates according to the most probable types of errors in the initial estimation of high-pitch and low-pitch frames classified by a Deep Convolutional Neural Network (DCNN). Next, a restrained selection procedure is run to find the true pitch value from the set of pitch candidates. In this procedure, we employ two features (harmonic summation and Euclidian deviation), the soft decision of the DCNN, the pitch smoothness feature in successive frames, and the effect of the initial estimation in a cost function. The pitch value which leads to the lowest cost value is chosen as the estimated pitch value. The simulations on CSTR and KEELE databases, in noisy environments with twelve types of noise, were performed. The results show the superiority of the proposed method over the state-of-the-art methods under different SNR conditions.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call