Abstract

In this paper, a customized speaker verification system combined with noise-cancellation using blind source separation was proposed. This system is divided into two phases: the noise-cancellation phase and the speaker verification phase. In the noise-cancellation phase, a fast time-frequency mask technique based on Short Time Fourier Transform (STFT) was proposed for separating a mixture of two input sounds in a single signal. After obtaining the separated speech data, this input is processed to the wake-up word system. In the speaker verification phase, we use Mel-Frequency Cepstral Coefficients (MFCC) as the feature extraction module. Then we train the feature data into a voiceprint model and a state sequence model of the speaker using Gaussian mixture model (GMM) and hidden Markov model (HMM), respectively. An analysis is done on noisy speech signals corrupted by white noise at different angles. Based on the output SIR (Signal to Interference Ratio) and SDR (Signal to Distortion Ratio) analysis, the improved accuracy is derived in the proposed system. We have obtained promising results in the real experimental environment.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call