Multi-scale Information Aggregation for Spoofing Detection

Changtao Li,Yi Wan,Feiran Yang,Jun Yang

doi:10.1186/s13636-024-00379-x

Abstract

Synthesis artifacts that span scales from small to large are important cues for spoofing detection. However, few spoofing detection models leverage artifacts across different scales together. In this paper, we propose a spoofing detection system built on SincNet and Deep Layer Aggregation (DLA), which leverages speech representations at different levels to distinguish synthetic speech. DLA is totally convolutional with an iterative tree-like structure. The unique topology of DLA makes possible compounding of speech features from convolution layers at different depths, and therefore the local and the global speech representations can be incorporated simultaneously. Moreover, SincNet is employed as the frontend feature extractor to circumvent manual feature extraction and selection. SincNet can learn fine-grained features directly from the input speech waveform, thus making the proposed spoofing detection system end-to-end. The proposed system outperforms the baselines when tested on ASVspoof LA and DF datasets. Notably, our single model surpasses all competing systems in ASVspoof DF competition with an equal error rate (EER) of 13.99%, which demonstrates the importance of multi-scale information aggregation for synthetic speech detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-scale Information Aggregation for Spoofing Detection

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing

Lead the way for us

Journal: EURASIP Journal on Audio, Speech, and Music Processing	Publication Date: Nov 5, 2024
License type: CC BY 4.0

Similar Papers

End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics
Jahangir Alam ... Abderrahim Fathan
-
Jahangir Alam, et. al.Jahangir Alam ... Abderrahim Fathan
01 Jan 2020
01 Jan 2020

Robust deep feature for spoofing detection — the SJTU system for ASVspoof 2015 challenge
Nanxin Chen ... Bo Chen
-
Nanxin Chen, et. al.Nanxin Chen ... Bo Chen
06 Sep 2015
Robust deep feature for spoofing detection — the SJTU system for ASVspoof 2015 challenge
Nanxin Chen ... Bo Chen

Spoofing countermeasure for fake speech detection using brute force features
Arsalan Rahman Mirza ... Abdulbasit K Al-Talabani
Computer Speech & Language | VOL. 90
Arsalan Rahman Mirza, et. al.Arsalan Rahman Mirza ... Abdulbasit K Al-Talabani
02 Oct 2024
Computer Speech & Language | VOL. 90

Factor analysis methods for joint speaker verification and spoof detection
Dhanush B K ... Sriram Ganapathy
-
Dhanush B K, et. al. Dhanush B K ... Sriram Ganapathy
01 Mar 2017
01 Mar 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-scale Information Aggregation for Spoofing Detection

Abstract

Talk to us

Similar Papers

More From: EURASIP Journal on Audio, Speech, and Music Processing