Refining Localized Attention Features with Multi-Scale Relationships for Enhanced Deepfake Detection in Spatial-Frequency Domain

Yuan Gao,Yu Zhang,Ping Zeng,Yingjie Ma

doi:10.3390/electronics13091749

Yuan Gao, Yu Zhang + Show 2 more

https://doi.org/10.3390/electronics13091749

Copy DOI

Abstract

The rapid advancement of deep learning and large-scale AI models has simplified the creation and manipulation of deepfake technologies, which generate, edit, and replace faces in images and videos. This gradual ease of use has turned the malicious application of forged faces into a significant threat, complicating the task of deepfake detection. Despite the notable success of current deepfake detection methods, which predominantly employ data-driven CNN classification models, these methods exhibit limited generalization capabilities and insufficient robustness against novel data unseen during training. To tackle these challenges, this paper introduces a novel detection framework, ReLAF-Net. This framework employs a restricted self-attention mechanism that applies self-attention to deep CNN features flexibly, facilitating the learning of local relationships and inter-regional dependencies at both fine-grained and global levels. This attention mechanism has a modular design that can be seamlessly integrated into CNN networks to improve overall detection performance. Additionally, we propose an adaptive local frequency feature extraction algorithm that decomposes RGB images into fine-grained frequency domains in a data-driven manner, effectively isolating fake indicators in the frequency space. Moreover, an attention-based channel fusion strategy is developed to amalgamate RGB and frequency information, achieving a comprehensive facial representation. Tested on the high-quality version of the FaceForensics++ dataset, our method attained a detection accuracy of 97.92%, outperforming other approaches. Cross-dataset validation on Celeb-DF, DFDC, and DFD confirms the robust generalizability, offering a new solution for detecting high-quality deepfake videos.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: May 1, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Refining Localized Attention Features with Multi-Scale Relationships for Enhanced Deepfake Detection in Spatial-Frequency Domain

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

RCS interpolation in frequency and angle using adaptive feature extraction
Yuanxun Wang ... Hao Ling
-
Yuanxun Wang, et. al. Yuanxun Wang ... Hao Ling
11 Jul 1999
11 Jul 1999

Demystifying Attention Mechanisms for Deepfake Detection
Abhijit Das ... Srijan Das
-
Abhijit Das, et. al.Abhijit Das ... Srijan Das
15 Dec 2021
15 Dec 2021

Multi-domain awareness for compressed deepfake videos detection over social networks guided by common mechanisms between artifacts
Yan Wang ... Rong Geng
Computer Vision and Image Understanding | VOL. 247
Yan Wang, et. al.Yan Wang ... Rong Geng
10 Jul 2024
Computer Vision and Image Understanding | VOL. 247

Strain measurement with adaptive local feature extraction method based on special fiber OFDR system.
Yuejuan Lv ... Xiangpeng Xiao
Optics Express | VOL. 32
Yuejuan Lv, et. al.Yuejuan Lv ... Xiangpeng Xiao
29 Jan 2024
Optics Express | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Refining Localized Attention Features with Multi-Scale Relationships for Enhanced Deepfake Detection in Spatial-Frequency Domain

Abstract

Talk to us

Similar Papers

More From: Electronics