Combination of Amplitude and Frequency Modulation Features for Presentation Attack Detection

Madhu R Kamble,Hemant A Patil

doi:10.1007/s11265-020-01532-3

Abstract

In this paper, we propose the combination of Amplitude Modulation and Frequency Modulation (AM-FM) features for replay Spoof Speech Detection (SSD) task. The AM components are known to be affected by noise (in this case, due to replay mechanism). In particular, we exploit this damage in AM component to corresponding Instantaneous Frequency (IF) for SSD task. Thus, the novelty of proposed Amplitude Weighted Frequency Cepstral Coefficients (AWFCC) feature set lies in using frequency components along with squared weighted amplitude components that are degraded due to replay noise. The AWFCC feature set contains the information of both AM and FM components together and hence, gave discriminatory information in the spectral characteristics. The experiments were performed on publicly available ASVspoof 2017 challenge version 1.0 and 2.0 databases using AWFCC feature set. We have compared results of proposed feature set with the other state-of-the-art feature set, such as Constant Q Cepstral Coefficients (CQCC), Linear Frequency Cepstral Coefficients (LFCC), Mel Frequency Cepstral Coefficients (MFCC) and using a simple Gaussian Mixture Model (GMM) classifier. The individual performance of AWFCC feature set obtained lower % EER than the other feature sets on both version 1.0 and 2.0 databases. Furthermore, we used score-level fusion in order to obtain the possible complementary information of two feature sets to reduce the % EER further. To that effect, the score-level fusion of CQCC and AWFCC feature sets gave 5.75 % and 10.42 % EER on development and evaluation sets, respectively, of ASVspoof 2017 version 2.0 database. Moreover, for evaluation dataset, we have also studied the performance of proposed feature set on different Replay Configurations (RC), namely, acoustic environments, playback, and recording devices. For all the levels of threat conditions (i.e., low, medium, and high) to the ASV system, the proposed feature set performed better compared to the existing state-of-the-art feature sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Combination of Amplitude and Frequency Modulation Features for Presentation Attack Detection

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Journal: Journal of Signal Processing Systems	Publication Date: Apr 15, 2020
Citations: 1

Similar Papers

Novel Amplitude Weighted Frequency Modulation Features for Replay Spoof Detection
Madhu R Kamble ... Hemant A Patil
-
Madhu R Kamble, et. al.Madhu R Kamble ... Hemant A Patil
01 Nov 2018
01 Nov 2018

Detection of replay spoof speech using teager energy feature cues
Madhu R Kamble ... Hemant A Patil
Computer Speech & Language | VOL. 65
Madhu R Kamble, et. al.Madhu R Kamble ... Hemant A Patil
14 Aug 2020
Computer Speech & Language | VOL. 65

Robustness of DAS Beamformer Over MVDR for Replay Attack Detection On Voice Assistants
Piyushkumar K Chodingala ... Ankur T Patil
-
Piyushkumar K Chodingala, et. al.Piyushkumar K Chodingala ... Ankur T Patil
11 Jul 2022
11 Jul 2022

Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection
Madhu R Kamble ... Hemant A Patil
-
Madhu R Kamble, et. al.Madhu R Kamble ... Hemant A Patil
01 Nov 2018
01 Nov 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combination of Amplitude and Frequency Modulation Features for Presentation Attack Detection

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems