Transformer-Auxiliary Neural Networks for Image Manipulation Localization by Operator Inductions

Zenan Shi,Dong Zhang,Haipeng Chen

doi:10.1109/tcsvt.2023.3251444

Abstract

Image manipulation localization (IML), which seeks to accurately segment tampered regions that are artfully fastened into a normal image, is a fundamental yet challenging computer vision task. Despite that impressive results have been achieved by some progressive deep learning methods, they usually fail in capturing the subtle manipulation artifacts at different object scales, which are not competent to generate a perfect segmentation mask with complete and fine object structures. Besides, the problem of coarse boundaries also occurs frequently. To this end, in this paper, we propose a Transformer-Auxiliary by operator-induced neural Network (TANet) to localize forged regions for IML. Specifically, a stacked multi-scale transformer (SMT) branch is first introduced as a compensation for feature representations of the mainstream convolutional neural network branch. SMT can detect structured abnormalities of the input image at multi-levels by operating on patches of different sizes. Then TANet explicitly exploits an operator induction module (OIM) to excavate valuable and manipulated region-related boundary semantics to guide the representative learning of the mainstream branch. The OIM encourages the network to generate features that highlight object structure, thereby promoting precise boundary localization of forged regions. We conduct extensive experiments on various datasets and settings to validate the effectiveness of TANet. Results show that TANet outperforms the state-of-the-art methods by a large margin under widely-used evaluation metrics.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer-Auxiliary Neural Networks for Image Manipulation Localization by Operator Inductions

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Sep 1, 2023
Citations: 5

Similar Papers

Lung tumor segmentation in 4D CT images using motion convolutional neural networks.
Shadab Momin ... Zhen Tian
Medical physics | VOL. 48
Shadab Momin, et. al.Shadab Momin ... Zhen Tian
13 Sep 2021
Medical physics | VOL. 48

Autonomous UAV Safety by Visual Human Crowd Detection Using Multi-Task Deep Neural Networks
Christos Papaioannidis ... Ioannis Mademlis
-
Christos Papaioannidis, et. al.Christos Papaioannidis ... Ioannis Mademlis
30 May 2021
30 May 2021

Satellite Image Super-Resolution via Multi-Scale Residual Deep Neural Network
Tao Lu ... Junjun Jiang
Remote sensing | VOL. 11
Tao Lu, et. al.Tao Lu ... Junjun Jiang
04 Jul 2019
Remote sensing | VOL. 11

Objective picture quality scale by neural networks (PQS‐NN)
Kazunori Kotani ... Makoto Miyahara
Electronics and Communications in Japan (Part I: Communications) | VOL. 74
Kazunori Kotani, et. al.Kazunori Kotani ... Makoto Miyahara
01 Feb 1991
Electronics and Communications in Japan (Part I: Communications) | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer-Auxiliary Neural Networks for Image Manipulation Localization by Operator Inductions

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society