Abstract

ABSTRACT In this paper, we propose a novel approach for accurate detection of the vowel end points (VEPs) in any mode of speech. VEP is the instant at which the vowel ends in the speech signal. In this study, we have considered three broad modes of speech, namely; conversation, extempore, and read. The existing methods were explored the VEP detection for read mode of speech, and it may not be appropriate for the VEP detection in extempore and conversation modes. This is due to the acoustic characteristic of read mode is very different from the modes as mentioned earlier. To handle this problem, we proposed a two-stage method for accurately detecting the VEPs, irrespective of modes. At the first stage, vowel onset points (VOPs) are detected in a speech signal using our recent method based on continuous wavelet transform and phone boundary. VOP represents the start of the vowel in the speech signal. At the second stage, phone boundaries are detected using spectral transition measure approach, and then the closest succeeding phone boundary for each detected VOP is considered as detected VEP. Experiments involve TIMIT and Bengali speech corpora. Performance of the proposed VEP detection method is compared with two state-of-the-art signal processing methods. The significance of the proposed method is shown by automatically detecting vowel regions from the TIMIT and Bengali speech corpora. The evaluation results report that the performance of the proposed method is significantly better than the existing methods.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call