Deepfake Detection Research Articles

Misinformation has become a pressing issue. Fake media, in both visual and textual forms, is widespread on the web. While various DeepFake detection and text fake news detection methods have been proposed, they are only designed for single-modality forgery based on binary classification, let alone analyzing and reasoning subtle forgery traces across different modalities. In this paper, we highlight a new research problem for multi-modal fake media, namely Detecting and Grounding Multi-Modal Media Manipulation (DGM 4). DGM 4 aims to not only detect the authenticity of multi-modal media, but also ground the manipulated content (i.e., image bounding boxes and text tokens), which requires deeper reasoning of multi-modal media manipulation. To support a large-scale investigation, we construct the first DGM 4 dataset, where image-text pairs are manipulated by various approaches, with rich annotation of diverse manipulations. Moreover, we propose a novel HierArchical Multi-modal Manipulation rEasoning tRansformer (HAMMER) to fully capture the fine-grained interaction between different modalities. HAMMER performs: 1) manipulation-aware contrastive learning between two uni-modal encoders as shallow manipulation reasoning and 2) modality-aware cross-attention by multi-modal aggregator as deep manipulation reasoning. Dedicated manipulation detection and grounding heads are integrated from shallow to deep levels based on the interacted multi-modal information. To exploit more fine-grained contrastive learning for cross-modal semantic alignment, we further integrate Manipulation-Aware Contrastive Loss with Local View and construct a more advanced model HAMMER++. Finally, we build an extensive benchmark and set up rigorous evaluation metrics for this new research problem. Comprehensive experiments demonstrate the superiority of HAMMER and HAMMER++; several valuable observations are also revealed to facilitate future research in multi-modal media manipulation.

The viral spread of massive deepfake videos over social networks has caused serious security problems. Despite the remarkable advancements achieved by existing deepfake detection algorithms, deepfake videos over social networks are inevitably influenced by compression factors. This causes deepfake detection performance to be limited by the following challenging issues: (a) interfering with compression artifacts, (b) loss of feature information, and (c) aliasing of feature distributions. In this paper, we analyze the common mechanism between compression artifacts and deepfake artifacts, revealing the structural similarity between them and providing a reliable theoretical basis for enhancing the robustness of deepfake detection models against compression. Firstly, based on the common mechanism between artifacts, we design a frequency domain adaptive notch filter to eliminate the interference of compression artifacts on specific frequency bands. Secondly, to reduce the sensitivity of deepfake detection models to unknown noise, we propose a spatial residual denoising strategy. Thirdly, to exploit the intrinsic correlation between feature vectors in the frequency domain branch and the spatial domain branch, we enhance deepfake features using an attention-based feature fusion method. Finally, we adopt a multi-task decision approach to enhance the discriminative power of the latent space representation of deepfakes, achieving deepfake detection with robustness against compression. Extensive experiments show that compared with the baseline methods, the detection performance of the proposed algorithm on compressed deepfake videos has been significantly improved. In particular, our model is resistant to various types of noise disturbances and can be easily combined with baseline detection models to improve their robustness.

Deepfake Detection Research Articles

Articles published on Deepfake Detection

Detecting and Grounding Multi-Modal Media Manipulation and Beyond.

PVASS-MDD: Predictive Visual-Audio Alignment Self-Supervision for Multimodal Deepfake Detection

Deep Learning Approach to Detect Fake Video on Raspberry PI

Real-Time Deepfake Video Detection Using Eye Movement Analysis with a Hybrid Deep Learning Approach

A New Deepfake Detection Method Based on Compound Scaling Dual-Stream Attention Network

Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme

Multi-domain awareness for compressed deepfake videos detection over social networks guided by common mechanisms between artifacts

Learning Self-distilled Features for Facial Deepfake Detection Using Visual Foundation Models: General Results and Demographic Analysis

A place for (socio)linguistics in audio deepfake detection and discernment: Opportunities for convergence and interdisciplinary collaboration

Optimizing CNN-based Deepfake detection with firefly algorithm: A Hybrid Approach

Predicting manipulated regions in deepfake videos using convolutional vision transformers

Face Warping Deepfake Detection and Localization in a Digital Video using Transfer Learning Approach

Multimodal Deepfake Detection

Deepfake Detection using Integrate-backward-integrate Logic Optimization Algorithm with CNN

Feature fusion Vision Transformers using MLP-Mixer for enhanced deepfake detection

AI Integrity Solutions for Deepfake Identification and Prevention

An Approach to Deepfake Video Detection Based on ACO-PSO Features and Deep Learning

JRC: Deepfake detection via joint reconstruction and classification

Deepfake video detection: challenges and opportunities

Deepfake forensics: a survey of digital forensic methods for multimodal deepfake identification on social media

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deepfake Detection Research Articles

Articles published on Deepfake Detection

Detecting and Grounding Multi-Modal Media Manipulation and Beyond.

PVASS-MDD: Predictive Visual-Audio Alignment Self-Supervision for Multimodal Deepfake Detection

Deep Learning Approach to Detect Fake Video on Raspberry PI

Real-Time Deepfake Video Detection Using Eye Movement Analysis with a Hybrid Deep Learning Approach

A New Deepfake Detection Method Based on Compound Scaling Dual-Stream Attention Network

Deep fake detection using an optimal deep learning model with multi head attention-based feature extraction scheme

Multi-domain awareness for compressed deepfake videos detection over social networks guided by common mechanisms between artifacts

Learning Self-distilled Features for Facial Deepfake Detection Using Visual Foundation Models: General Results and Demographic Analysis

A place for (socio)linguistics in audio deepfake detection and discernment: Opportunities for convergence and interdisciplinary collaboration

Optimizing CNN-based Deepfake detection with firefly algorithm: A Hybrid Approach

Predicting manipulated regions in deepfake videos using convolutional vision transformers

Face Warping Deepfake Detection and Localization in a Digital Video using Transfer Learning Approach

Multimodal Deepfake Detection

Deepfake Detection using Integrate-backward-integrate Logic Optimization Algorithm with CNN

Feature fusion Vision Transformers using MLP-Mixer for enhanced deepfake detection

AI Integrity Solutions for Deepfake Identification and Prevention

An Approach to Deepfake Video Detection Based on ACO-PSO Features and Deep Learning

JRC: Deepfake detection via joint reconstruction and classification

Deepfake video detection: challenges and opportunities

Deepfake forensics: a survey of digital forensic methods for multimodal deepfake identification on social media