Abstract

The authors present an algorithm for pitch estimation including voiced/unvoiced decision in the case of a noisy speech and when two speakers are talking simultaneously. The approach is based on the spectral multi-scale product (SMP) analysis of the sound mixture. SMP is the spectrum of the product of three successive wavelet transform coefficients of the speech. The wavelet used for SMP analysis is the quadratic spline function. The proposed method is compared with other state-of-the-art algorithms. It is robust in the presence of a noise and permits the pitch estimation of the dominant speech and the concurrent one from the sound mixture with high accuracy.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call