Abstract
AbstractDetection of the pitch frequency as the basic characteristic of the voice is one of the most important research subjects in voice analysis and synthesis. In this paper we study, in comparison with the cepstrum method and the modified correlation method, how the BPFP method using bandpass filter pairs and the BPFP‐NN method combining the BPFP banks and the neural network (NN) proposed by the authors perform in the presence of pitch disturbance, waveform variation, and added noise when applied to voiced/unvoiced (U/V) detection and pitch extraction. According to the experimental results, the BPFP method and the BPFP‐NN method exhibit more stable performance and more effective pitch extraction with U/V detection than the typical cepstrum method and modified correlation method, and the effect is particularly evident at low frequencies due to the logarithmic expression of the center frequency spacing of the BPFP banks. © 2003 Wiley Periodicals, Inc. Electron Comm Jpn Pt 3, 86(5): 24–35, 2003; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/ecjc.10045
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have