End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics

Jahangir Alam,Woo Hyun Kang,Abderrahim Fathan

doi:10.1007/978-3-030-87802-3_2

Abstract

Technological progress and proliferation of sophisticated software has made it easier than ever to spoof a person’s voice and audio in general. Like other biometrics, speaker verification is vulnerable to spoofing attacks. Detecting these attacks using the artifacts present in the recordings is a major challenge. Current trend in spoofing detection is to employ deep learning architectures to perform end-to-end detection by employing a pooling layer which aggregates the frame-level information into utterance-level embeddings. To do so, only the first or first and second order statistics are normally pooled across temporal dimension. In this paper, we investigate the influence of higher order statistics, such as third and fourth order moments, on spoofing detection performance. A Time Delay Neural Network (TDNN) architecture is used on the top of linear frequency cepstral coefficients for carrying out spoofing detection experiments on the ASVspoof2019 challenge logical access and physical access corpora. Experiments results, in terms of equal error rate (EER) and minimum tandem detection cost function (min-tDCF), show that inclusion of higher order statistics is accommodating for improving the performance of spoofing detection systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Speaker Verification Method Based on TDNN–LSTMP
Hui Liu ... Longlian Zhao
Circuits, Systems, and Signal Processing | VOL. 38
Hui Liu, et. al.Hui Liu ... Longlian Zhao
20 Mar 2019
Circuits, Systems, and Signal Processing | VOL. 38

Multi-task deep cross-attention networks for far-field speaker verification and keyword spotting
Xingwei Liang ... Ruifeng Xu
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023
Xingwei Liang, et. al.Xingwei Liang ... Ruifeng Xu
01 Jul 2023
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023

Robust deep feature for spoofing detection — the SJTU system for ASVspoof 2015 challenge
Nanxin Chen ... Bo Chen
-
Nanxin Chen, et. al.Nanxin Chen ... Bo Chen
06 Sep 2015
Robust deep feature for spoofing detection — the SJTU system for ASVspoof 2015 challenge
Nanxin Chen ... Bo Chen

Relative phase information for detecting human speech and spoofed speech
Longbiao Wang ... Seiichi Nakagawa
-
Longbiao Wang, et. al.Longbiao Wang ... Seiichi Nakagawa
06 Sep 2015
06 Sep 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics

Abstract

Talk to us

Similar Papers