Improved Object Detection with Content and Position Separation in Transformer

Yao Wang,Jong-Eun Ha

doi:10.3390/rs16020353

Abstract

In object detection, Transformer-based models such as DETR have exhibited state-of-the-art performance, capitalizing on the attention mechanism to handle spatial relations and feature dependencies. One inherent challenge these models face is the intertwined handling of content and positional data within their attention spans, potentially blurring the specificity of the information retrieval process. We consider object detection as a comprehensive task, and simultaneously merging content and positional information like before can exacerbate task complexity. This paper presents the Multi-Task Fusion Detector (MTFD), a novel architecture that innovatively dissects the detection process into distinct tasks, addressing content and position through separate decoders. By utilizing assumed fake queries, the MTFD framework enables each decoder to operate under a presumption of known ancillary information, ensuring more specific and enriched interactions with the feature map. Experimental results affirm that this methodical separation followed by a deliberate fusion not only simplifies the task difficulty of the detection process but also augments accuracy and clarifies the details of each component, providing a fresh perspective on object detection in Transformer-based architectures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Remote Sensing	Publication Date: Jan 16, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improved Object Detection with Content and Position Separation in Transformer

Abstract

Talk to us

Similar Papers

More From: Remote Sensing

Lead the way for us

Similar Papers

Improve object detection via a multi-feature and multi-task CNN model
Yingxin Lou ... Zhuqing Jiang
-
Yingxin Lou, et. al.Yingxin Lou ... Zhuqing Jiang
01 Dec 2017
01 Dec 2017

Comparative study on object detection in visual scenes using deep learning
Kapil Kumar ... Kamal Kant Verma
World Journal of Advanced Engineering Technology and Sciences | VOL. 10
Kapil Kumar, et. al. Kapil Kumar ... Kamal Kant Verma
30 Nov 2023
World Journal of Advanced Engineering Technology and Sciences | VOL. 10

A Novel Method for Scene Modeling to Detect Unusual Activity
...
Cumhuriyet Science Journal | VOL. 36
, et. al. ...
01 Jan 2015
Cumhuriyet Science Journal | VOL. 36

Multi-Scale Spatial and Channel-wise Attention for Improving Object Detection in Remote Sensing Imagery
Jie Chen ... Min Deng
IEEE Geoscience and Remote Sensing Letters | VOL. 17
Jie Chen, et. al.Jie Chen ... Min Deng
29 Aug 2019
IEEE Geoscience and Remote Sensing Letters | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved Object Detection with Content and Position Separation in Transformer

Abstract

Talk to us

Similar Papers

More From: Remote Sensing