ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Xin Wang,Hirokazu Kameoka,Yu Tsao,Avashna Govender,Tomi Kinnunen,Fergus Henderson,Sébastien Le Maguer,Srikanth Ronanki,Massimiliano Todisco,Kong Aik Lee,Yi-Chiao Wu,Jean-François Bonastre,Junichi Yamagishi,Andreas Nautsch,Md Sahidullah,Zhen-Hua Ling,Rob Clark,Wen-Chin Huang,Nicholas Evans,Kou Tanaka,Takashi Kaneda,Lauri Juvela,Héctor Delgado,Ingmar Steiner,Yuan Jiang,Kai Onuma,Tomoki Toda,Yu Zhang,Quan Wang,Koji Mushika,Ville Vestman,Driss Matrouf,Hsin-Te Hwang,Paavo Alku,Markus C Becker ,Yuheng Jia ,Jingxuan Zhang ,Hsin‐Min Wang ,Yuhuai Peng ,Lijuan Liu

doi:10.1016/j.csl.2020.101114

Abstract

Automatic speaker verification (ASV) is one of the most natural and convenient means of biometric person recognition. Unfortunately, just like all other biometric systems, ASV is vulnerable to spoofing, also referred to as “presentation attacks.” These vulnerabilities are generally unacceptable and call for spoofing countermeasures or “presentation attack detection” systems. In addition to impersonation, ASV systems are vulnerable to replay, speech synthesis, and voice conversion attacks.The ASVspoof challenge initiative was created to foster research on anti-spoofing and to provide common platforms for the assessment and comparison of spoofing countermeasures. The first edition, ASVspoof 2015, focused upon the study of countermeasures for detecting of text-to-speech synthesis (TTS) and voice conversion (VC) attacks. The second edition, ASVspoof 2017, focused instead upon replay spoofing attacks and countermeasures. The ASVspoof 2019 edition is the first to consider all three spoofing attack types within a single challenge. While they originate from the same source database and same underlying protocol, they are explored in two specific use case scenarios. Spoofing attacks within a logical access (LA) scenario are generated with the latest speech synthesis and voice conversion technologies, including state-of-the-art neural acoustic and waveform model techniques. Replay spoofing attacks within a physical access (PA) scenario are generated through carefully controlled simulations that support much more revealing analysis than possible previously. Also new to the 2019 edition is the use of the tandem detection cost function metric, which reflects the impact of spoofing and countermeasures on the reliability of a fixed ASV system. This paper describes the database design, protocol, spoofing attack implementations, and baseline ASV and countermeasure results. It also describes a human assessment on spoofed data in logical access. It was demonstrated that the spoofing data in the ASVspoof 2019 database have varied degrees of perceived quality and similarity to the target speakers, including spoofed data that cannot be differentiated from bona fide utterances even by human subjects. It is expected that the ASVspoof 2019 database, with its varied coverage of different types of spoofing data, could further foster research on anti-spoofing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer Speech & Language	Publication Date: May 20, 2020
Citations: 201	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Similar Papers

Voice Presentation Attacks Detection using Acoustic MLTP Features and BiLSTM
Sundas Ibrar ... Hafsa Ilyas
-
Sundas Ibrar, et. al.Sundas Ibrar ... Hafsa Ilyas
17 May 2023
17 May 2023

GANBA: Generative Adversarial Network for Biometric Anti-Spoofing
Alejandro Gomez-Alanis ... Antonio M Peinado
Applied Sciences | VOL. 12
Alejandro Gomez-Alanis, et. al.Alejandro Gomez-Alanis ... Antonio M Peinado
29 Jan 2022
Applied Sciences | VOL. 12

On Joint Optimization of Automatic Speaker Verification and Anti-Spoofing in the Embedding Space
Alejandro Gomez-Alanis ... Antonio M Peinado
IEEE Transactions on Information Forensics and Security | VOL. 16
Alejandro Gomez-Alanis, et. al.Alejandro Gomez-Alanis ... Antonio M Peinado
18 Nov 2020
IEEE Transactions on Information Forensics and Security | VOL. 16

Cross-Database Evaluation of Audio-Based Spoofing Detection Systems
Pavel Korshunov ... Sébastien Marcel
-
Pavel Korshunov, et. al.Pavel Korshunov ... Sébastien Marcel
08 Sep 2016
08 Sep 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language