Abstract

In this paper we develop a voice activity detection algorithm based on spectrum estimation of speech and non-speech segments using vector quantization method. In this method, we try to classify entry speech signal to speech and non-speech classes. Commonly, the performance of the voice activity detection (VAD) algorithms in non-stationary background noise is not so satisfying under low SNR, so we try to concentrate our study on this issue. The model of a non-speech is a codebook generated from noise and model of speech is several codebook generated from speech contaminated by noise in some different SNR. The labeling is performed by evaluating the distortions between the entry signal samples and the designed models. Our simulation results based on the Persian speech database show that the VQ based VAD is high performance in low SNR conditions (SNR<5 dB).

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.