Dataset Artefacts in Anti-Spoofing Systems: A Case Study on the ASVspoof 2017 Benchmark

Bhusan Chettri,Emmanouil Benetos,Bob L T Sturm

doi:10.1109/taslp.2020.3036777

Abstract

The Automatic Speaker Verification Spoofing and Countermeasures Challenges motivate research in protecting speech biometric systems against a variety of different access attacks. The 2017 edition focused on replay spoofing attacks, and involved participants building and training systems on a provided dataset (ASVspoof 2017). More than 60 research papers have so far been published with this dataset, but none have sought to answer why countermeasures appear successful in detecting spoofing attacks. This article shows how artefacts inherent to the dataset may be contributing to the apparent success of published systems. We first inspect the ASVspoof 2017 dataset and summarize various artefacts present in the dataset. Second, we demonstrate how countermeasure models can exploit these artefacts to appear successful in this dataset. Third, for reliable and robust performance estimates on this dataset we propose discarding nonspeech segments and silence before and after the speech utterance during training and inference. We create speech start and endpoint annotations in the dataset and demonstrate how using them helps countermeasure models become less vulnerable from being manipulated using artefacts found in the dataset. Finally, we provide several new benchmark results for both frame-level and utterance-level models that can serve as new baselines on this dataset.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dataset Artefacts in Anti-Spoofing Systems: A Case Study on the ASVspoof 2017 Benchmark

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jan 1, 2020
Citations: 66

Similar Papers

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang ...
Computer Speech & Language | VOL. 64
Xin Wang, et. al.Xin Wang ...
20 May 2020
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech
Xin Wang ...

Replay spoofing attack detection using deep neural networks
Bekir Bakar ... Cemal Hanilçi
-
Bekir Bakar, et. al.Bekir Bakar ... Cemal Hanilçi
01 May 2018
01 May 2018

DNN controlled adaptive front-end for replay attack detection systems
Buddhi Wickramasinghe ... Ting Dang
Speech Communication | VOL. 154
Buddhi Wickramasinghe, et. al.Buddhi Wickramasinghe ... Ting Dang
20 Aug 2023
Speech Communication | VOL. 154

Recurrent Neural Networks for Automatic Replay Spoofing Attack Detection
Zhuxin Chen ... Zhifeng Xie
-
Zhuxin Chen, et. al.Zhuxin Chen ... Zhifeng Xie
01 Apr 2018
01 Apr 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dataset Artefacts in Anti-Spoofing Systems: A Case Study on the ASVspoof 2017 Benchmark

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing