Toward comprehensive short utterances manipulations detection in videos

Abderrazzaq Moufidi,David Rousseau,Pejman Rasti

doi:10.1007/s11042-024-20284-x

Abstract

AbstractIn a landscape increasingly populated by convincing yet deceptive multimedia content generated through generative adversarial networks, there exists a significant challenge for both human interpretation and machine learning algorithms. This study introduces a shallow learning technique specifically tailored for analyzing visual and auditory components in videos, targeting the lower face region. Our method is optimized for ultra-short video segments (200-600 ms) and employs wavelet scattering transforms for audio and discrete cosine transforms for video. Unlike many approaches, our method excels at these short durations and scales efficiently to longer segments. Experimental results demonstrate high accuracy, achieving 96.83% for 600 ms audio segments and 99.87% for whole video sequences on the FakeAVCeleb and DeepfakeTIMIT datasets. This approach is computationally efficient, making it suitable for real-world applications with constrained resources. The paper also explores the unique challenges of detecting deepfakes in ultra-short sequences and proposes a targeted evaluation strategy for these conditions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Toward comprehensive short utterances manipulations detection in videos

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Oct 14, 2024
License type: CC BY 4.0

Similar Papers

GridHTM: Grid-Based Hierarchical Temporal Memory for Anomaly Detection in Videos.
Vladimir Monakhov ... Michael A Riegler
Sensors | VOL. 23
Vladimir Monakhov, et. al.Vladimir Monakhov ... Michael A Riegler
13 Feb 2023
Sensors | VOL. 23

Generative Adversarial Networks for Abnormal Event Detection in Videos Based on Self-Attention Mechanism
Weichao Zhang ... Hongyu Wang
IEEE Access | VOL. 9
Weichao Zhang, et. al.Weichao Zhang ... Hongyu Wang
01 Jan 2020
IEEE Access | VOL. 9

Notice of Violation of IEEE Publication Principles - Effective visual fire detection in video sequences using probabilistic approach
P Jenifer
-
P JeniferP Jenifer
01 Mar 2011
01 Mar 2011

Unsupervised learning approach for abnormal event detection in surveillance video by revealing infrequent patterns
Tushar Sandhan ... Jin Young Choi
-
Tushar Sandhan, et. al.Tushar Sandhan ... Jin Young Choi
01 Nov 2013
01 Nov 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward comprehensive short utterances manipulations detection in videos

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications