AFP-Conformer: Asymptotic Feature Pyramid Conformer for Spoofing Speech Detection

Yida Huang,Qian Shen,Jianfen Ma

doi:10.1016/j.specom.2024.103149

Abstract

The existing spoofing speech detection methods mostly use either convolutional neural networks or Transformer architectures as their backbone, which fail to adequately represent speech features during feature extraction, resulting in poor detection and generalization performance of the models. To solve this limitation, we propose a novel spoofing speech detection method based on the Conformer architecture. This method integrates a convolutional module into the Transformer framework to enhance its capacity for local feature modeling, enabling to extract both local and global information from speech signals simultaneously. Besides, to mitigate the issue of semantic information loss or degradation in traditional feature pyramid networks during feature fusion, we propose a feature fusion method based on the asymptotic feature pyramid network (AFPN) to fuse multi-scale features and improve generalization of detecting unknown attacks. Our experiments conducted on the ASVspoof 2019 LA dataset demonstrate that our proposed method achieved the equal error rate (EER) of 1.61% and the minimum tandem detection cost function (min t-DCF) of 0.045, effectively improving the detection performance of the model while enhancing its generalization capability against unknown spoofing attacks. In particular, it demonstrates substantial performance improvement in detecting the most challenging A17 attack.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AFP-Conformer: Asymptotic Feature Pyramid Conformer for Spoofing Speech Detection

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Similar Papers

A2-FPN: Attention Aggregation based Feature Pyramid Network for Instance Segmentation
Miao Hu ... Lu Fang
-
Miao Hu, et. al.Miao Hu ... Lu Fang
01 Jun 2021
01 Jun 2021

Latent Feature Pyramid Network for Object Detection
Jin Xie ... Jungong Han
IEEE Transactions on Multimedia | VOL. 25
Jin Xie, et. al.Jin Xie ... Jungong Han
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

Adaptive learning feature pyramid for object detection
Fukoeng Wong ... Haifeng Hu
IET Computer Vision | VOL. 13
Fukoeng Wong, et. al.Fukoeng Wong ... Haifeng Hu
01 Dec 2019
IET Computer Vision | VOL. 13

A Non-Local Attention Feature Fusion Network for Multiscale Object Detection
Xuke Wu ... Bo Lu
IEEE Journal of Radio Frequency Identification | VOL. 6
Xuke Wu, et. al.Xuke Wu ... Bo Lu
01 Jan 2021
IEEE Journal of Radio Frequency Identification | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AFP-Conformer: Asymptotic Feature Pyramid Conformer for Spoofing Speech Detection

Abstract

Talk to us

Similar Papers

More From: Speech Communication