Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video Detection

Dengyong Zhang,Wenjie Zhu,Xin Liao,Feifan Qi,Gaobo Yang,Xiangling Ding

doi:10.1145/3664654

Abstract

With the rise of the metaverse, the rapid advancement of Deepfakes technology has become closely intertwined. Within the metaverse, individuals exist in digital form and engage in interactions, transactions, and communications through virtual avatars. However, the development of Deepfakes technology has led to the proliferation of forged information disseminated under the guise of users’ virtual identities, posing significant security risks to the metaverse. Hence, there is an urgent need to research and develop more robust methods for detecting deep forgeries to address these challenges. This paper explores deepfake video detection by leveraging the spatiotemporal inconsistencies generated by deepfake generation techniques, and thereby proposing the interactive spatioTemporal inconsistency learning and interactive fusion (ST-ILIF) detection method, which consists of phase-aware and sequence streams. The spatial inconsistencies exhibited in frames of deepfake videos are primarily attributed to variations in the structural information contained within the phase component of the Fourier domain. To mitigate the issue of overfitting the content information, a phase-aware stream is introduced to learn the spatial inconsistencies from the phase-based reconstructed frames. Additionally, considering that deepfake videos are generated frame-by-frame and lack temporal consistency between frames, a sequence stream is proposed to extract temporal inconsistency features from the spatiotemporal difference information between consecutive frames. Finally, through feature interaction and fusion of the two streams, the representation ability of intermediate and classification features is further enhanced. The proposed method, which was evaluated on four mainstream datasets, outperformed most existing methods, and extensive experimental results demonstrated its effectiveness in identifying deepfake videos. Our source code is available at https://github.com/qff98/Deepfake-Video-Detection

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video Detection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: May 13, 2024
License type: mit

Similar Papers

DeepFake Videos Detection via Spatiotemporal Inconsistency Learning and Interactive Fusion
Xiangling Ding ... Dengyong Zhang
-
Xiangling Ding, et. al.Xiangling Ding ... Dengyong Zhang
20 Sep 2022
20 Sep 2022

Capturing the Persistence of Facial Expression Features for Deepfake Video Detection
Yiru Zhao ... Lei Zhao
-
Yiru Zhao, et. al.Yiru Zhao ... Lei Zhao
01 Jan 2020
01 Jan 2020

Detecting Compressed Deepfake Videos in Social Networks Using Frame-Temporality Two-Stream Convolutional Network
Wei Wang ... Zheng Qin
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32
Wei Wang, et. al.Wei Wang ... Zheng Qin
01 Mar 2022
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 32

Fully Unsupervised Deepfake Video Detection via Enhanced Contrastive Learning.
Tong Qiao ... Xiangyang Luo
IEEE transactions on pattern analysis and machine intelligence | VOL. PP
Tong Qiao, et. al.Tong Qiao ... Xiangyang Luo
01 Jan 2024
IEEE transactions on pattern analysis and machine intelligence | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spatiotemporal Inconsistency Learning and Interactive Fusion for Deepfake Video Detection

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications