Generation and detection of manipulated multimodal audiovisual content: Advances, trends and open challenges

Helena Liz-López,Mamadou Keita,Abdelmalik Taleb-Ahmed,Abdenour Hadid,Javier Huertas-Tato,David Camacho

doi:10.1016/j.inffus.2023.102103

Abstract

Generative deep learning techniques have invaded the public discourse recently. Despite the advantages, the applications to disinformation are concerning as the counter-measures advance slowly. As the manipulation of multimedia content becomes easier, faster, and more credible, developing effective forensics becomes invaluable. Other works have identified this need but neglect that disinformation is inherently multimodal. Overall in this survey, we exhaustively describe modern manipulation and forensic techniques from the lens of video, audio and their multimodal fusion. For manipulation techniques, we give a classification of the most commonly applied manipulations. Generative techniques can be exploited to generate datasets; we provide a list of current datasets useful for forensics. We have reviewed forensic techniques from 2018 to 2023, examined the usage of datasets, and given a comparative analysis of each modality. Finally, we give another comparison of end-to-end forensics tools for end-users. From our analysis clear trends are found with diffusion models, dataset granularity, explainability techniques, synchronisation improvements, and learning task diversity. We find a roadmap of deep challenges ahead, including multilinguality, multimodality, improving data quality (and variety), all in an adversarial ever-changing environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generation and detection of manipulated multimodal audiovisual content: Advances, trends and open challenges

Abstract

Talk to us

Similar Papers

More From: Information Fusion

Lead the way for us

Journal: Information Fusion	Publication Date: Nov 2, 2023
Citations: 8

Similar Papers

EnsembleDet: ensembling against adversarial attack on deepfake detection
Himanshu Dutta ... Aditya Pandey
Journal of Electronic Imaging | VOL. 30
Himanshu Dutta, et. al.Himanshu Dutta ... Aditya Pandey
28 Dec 2022
Journal of Electronic Imaging | VOL. 30

Development and validation of a multi-modality fusion deep learning model for differentiating glioblastoma from solitary brain metastases.
Chunquan Li ... Ziye Yan
Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences | VOL. 49
Chunquan Li, et. al.Chunquan Li ... Ziye Yan
28 Jan 2024
Zhong nan da xue xue bao. Yi xue ban = Journal of Central South University. Medical sciences | VOL. 49

3D Character Animation and Asset Generation Using Deep Learning
Vlad-Constantin Lungu-Stan ... Irina Georgiana Mocanu
Applied Sciences | VOL. 14
Vlad-Constantin Lungu-Stan, et. al.Vlad-Constantin Lungu-Stan ... Irina Georgiana Mocanu
16 Aug 2024
Applied Sciences | VOL. 14

An Investigation of Leap Motion Based 3D Manipulation Techniques for Use in Egocentric Viewpoint
Giuseppe Caggianese ... Pietro Neroni
-
Giuseppe Caggianese, et. al.Giuseppe Caggianese ... Pietro Neroni
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generation and detection of manipulated multimodal audiovisual content: Advances, trends and open challenges

Abstract

Talk to us

Similar Papers

More From: Information Fusion