Abstract

In this work, a new instantaneous fundamental frequency extraction method is presented, with the attention especially focused on its robustness for pathological voices processing. It is based on the Ensemble Empirical Mode Decomposition (EEMD) algorithm, which is a completely data-driven method for signal decomposition into a sum of AM - FM components, called Intrinsic Mode Functions (IMFs) or modes. Our results show that the speech fundamental frequency can be captured in a single IMF. We also propose an algorithm for selecting the mode where the fundamental frequency can be found, based on the logarithm of the power of the IMFs. The instantaneous frequency is then extracted by means of well-known techniques. The behaviour of the proposed method is compared with other two ones (Robust Algorithm for Pitch Tracking -RAPT- and auto-correlation based algorithms), both in normal and pathological sustained vowels.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call