Abstract
Vowel onset point (VOP) is the instant of time at which vowel region starts in a speech signal. VOP plays a vital role in different applications of speech processing, such as syllable detection, speaker verification, duration modification, language identification etc. There are different existing algorithms for the detection of instance of VOP in a speech signal. The algorithm based on the combined evidences extracted from the source excitation, spectral peaks and modulation spectrum has been used as a baseline system for the present work. The baseline system performs well under clean speech data. However, under noisy conditions the performance of the baseline system degrades. The performance of the system degrades in terms of more number of spurious VOPs, which get detected under noisy speech conditions. According to the available literature, this degraded performance is due to the spectral broadening of the speech in the noisy environments. In this paper we have proposed a pre-processing technique on top of the baseline system to reduce this spectral broadening effect of noise. The noisy speech data are passed through the pre-processing algorithm in order to minimize the spectral broadening effect of speech signal. The pre-processed speech is then passed through the baseline system to detect the VOPs in the speech signal. Experiments were carried out under clean and different noisy speech signals. The results of the experiment show an improvement of 16–21% in terms of removal of spurious VOPs, over the existing baseline system under different noisy speech conditions. Further, the performance of the proposed method has been compared with two different best performing techniques for detection of VOP, and found that the proposed method gives a superior level of performance in terms of identification accuracy and identification rate.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.