Interactive Two-Stream Network Across Modalities for Deepfake Detection

Jianghao Wu,Zhaoyang Li,Guilin Pang,Jianping Fan,Zhu Teng,Baopeng Zhang

doi:10.1109/tcsvt.2023.3269841

Abstract

As face forgery techniques have become more mature, the proliferation of deepfakes may threaten the security of human society. Although existing deepfake detection methods achieve good performance for in-dataset evaluation, it remains to be improved in the generalization ability, where the representation of the imperceptible artifacts plays a significant role. In this paper, we propose an Interactive Two-Stream Network (ITSNet) to explore the discriminant inconsistency representation from the perspective of cross-modality. In particular, the patch-wise Decomposable Discrete Cosine Transform (DDCT) is adopted to extract fine-grained high-frequency clues, and information from different modalities communicates with each other via a designed interaction module. To perceive the temporal inconsistency, we first develop a Short-term Embedding Module (SEM) to refine subtle local inconsistency representation between adjacent frames, and then a Long-term Embedding Module (LEM) is designed to further refine the erratic temporal inconsistency representation from the long-range perspective. Extensive experimental results conducted on three public datasets show that ITSNet outperforms the state-of-the-art methods both in terms of in-dataset and cross-dataset evaluations.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interactive Two-Stream Network Across Modalities for Deepfake Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Nov 1, 2023
Citations: 2

Similar Papers

Learning a deep dual-level network for robust DeepFake detection
Wenbo Pu ... Siwei Lyu
Pattern Recognition | VOL. 130
Wenbo Pu, et. al.Wenbo Pu ... Siwei Lyu
03 Jun 2022
Pattern Recognition | VOL. 130

Exposing Deepfake Videos with Spatial, Frequency and Multi-scale Temporal Artifacts
Yongjian Hu ... Zeqiong Yu
-
Yongjian Hu, et. al.Yongjian Hu ... Zeqiong Yu
01 Jan 2021
01 Jan 2021

Deepfake Video Detection Based on Spatial, Spectral, and Temporal Inconsistencies Using Multimodal Deep Learning
John K Lewis ... Helen Chen
-
John K Lewis, et. al.John K Lewis ... Helen Chen
13 Oct 2020
13 Oct 2020

Improving Generalization of Deepfake Detection with Domain Adaptive Batch Normalization
Zixin Yin ... Haotong Qin
-
Zixin Yin, et. al.Zixin Yin ... Haotong Qin
20 Oct 2021
20 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interactive Two-Stream Network Across Modalities for Deepfake Detection

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society