Voice Liveness Detection using Constant-Q Transform-Based Features

Ankur T Patil,Kuldeep Khoria,Hemant A Patil

doi:10.23919/eusipco55093.2022.9909591

Abstract

In this work, we propose to use the Constant-Q transform (CQT)-based feature set for voice liveness detection (VLD), which can enhance the confidence in authenticity of the speaker in Automatic Speaker Verification (ASV) system. The live speaker can be characterized via his/her voice using the presence of the pop noise in the speech signal. Pop noise comes out as a burst and possesses the low frequency characteristics. In this paper, we present the modified CQT-based approach over the traditional Short-Time Fourier Transform (STFT)-based algorithm (baseline) for VLD. The experiments are performed on recently released POp noise COrpus (POCO) dataset with various statistical, discriminative, and deep learning-based classifiers, namely, Gaussian Mixture Models (GMMs), Support Vector Machine (SVM), Convolutional Neural Networks (CNN), and Light-CNN (LCNN), respectively. The significant improvement in performance is observed for the proposed CQT-based features over STFT-based features. Relatively best performance is obtained for CQT-LCNN architecture, which shows 81.93% accuracy on evaluation set. Furthermore, we analyzed the performance of the CNN and LCNN-based VLD systems for each word using proposed CQT-based vs. STFT-based baseline features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Voice Liveness Detection using Constant-Q Transform-Based Features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

On significance of constant-Q transform for pop noise detection
Kuldeep Khoria ... Hemant A Patil
Computer Speech & Language | VOL. 77
Kuldeep Khoria, et. al.Kuldeep Khoria ... Hemant A Patil
11 Jun 2022
Computer Speech & Language | VOL. 77

Morse Wavelet Features for Pop Noise Detection
Priyanka Gupta ... Hemant A Patil
-
Priyanka Gupta, et. al.Priyanka Gupta ... Hemant A Patil
11 Jul 2022
11 Jul 2022

Morse wavelet transform-based features for voice liveness detection
Priyanka Gupta ... Hemant A Patil
Computer Speech & Language | VOL. 84
Priyanka Gupta, et. al.Priyanka Gupta ... Hemant A Patil
29 Sep 2023
Computer Speech & Language | VOL. 84

Modified Group Delay Function Using Different Spectral Smoothing Techniques for Voice Liveness Detection
Shrishti Singh ... Hemant A Patil
-
Shrishti Singh, et. al.Shrishti Singh ... Hemant A Patil
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Voice Liveness Detection using Constant-Q Transform-Based Features

Abstract

Talk to us

Similar Papers