Abstract
Current pitch detection algorithms run into difficulties when used on dysphonic voices. Two major sources of difficulty are the presence in the phonatory output of frictional, non-harmonic energy (in whispery voices), and microperturbatory fundamental frequency jitter and amplitude shimmer (in harsh and creaky voices). For adequate performance on dysphonic voices, pitch detection algorithms should have the following characteristics: 1. work on acoustic recordings from men, women and children 2. be noise resistant 3. work on continuous speech. Measures of pitch perturbation are defined. Three pitch detection algorithms were applied to the speech of dysphonic speakers as well as a control group of speakers. Two detectors work in the time domain (simplified inverse filter tracking (1) and a parallel processing method (2)), and one in the frequency domain (cepstral pitch detection (3)), Their comparative performance on perceptually rated clinical material is discussed.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.