Customized Speaker Verification System with Noise-Cancellation using Blind Source Separation

Tsung-Han Tsai,Ping-Cheng Hao,Fong-Lin Tsai

doi:10.1109/icce-taiwan55306.2022.9869211

Tsung-Han Tsai, Ping-Cheng Hao + Show 1 more

https://doi.org/10.1109/icce-taiwan55306.2022.9869211

Copy DOI

Export

Save

Cite

Publication Date: Jul 6, 2022

Affiliation: Central University

Abstract
Full-Text
Similar Papers

Abstract

Listen

In this paper, a customized speaker verification system combined with noise-cancellation using blind source separation was proposed. This system is divided into two phases: the noise-cancellation phase and the speaker verification phase. In the noise-cancellation phase, a fast time-frequency mask technique based on Short Time Fourier Transform (STFT) was proposed for separating a mixture of two input sounds in a single signal. After obtaining the separated speech data, this input is processed to the wake-up word system. In the speaker verification phase, we use Mel-Frequency Cepstral Coefficients (MFCC) as the feature extraction module. Then we train the feature data into a voiceprint model and a state sequence model of the speaker using Gaussian mixture model (GMM) and hidden Markov model (HMM), respectively. An analysis is done on noisy speech signals corrupted by white noise at different angles. Based on the output SIR (Signal to Interference Ratio) and SDR (Signal to Distortion Ratio) analysis, the improved accuracy is derived in the proposed system. We have obtained promising results in the real experimental environment.

Full Text