Content‐augmented feature pyramid network with light linear spatial transformers for object detection

Yongxiang Gu,Xiaolin Qin,Yuncong Peng,Lu Li

doi:10.1049/ipr2.12575

Abstract

As one of the prevalent components, feature pyramid network (FPN) is widely used in current object detection models for improving multi-scale object detection performance. However, its feature fusion mode is still in a misaligned and local manner, thus limiting the representation power. To address the inherited defects of FPN, a novel architecture termed content-augmented feature pyramid network (CA-FPN) is proposed in this paper. Firstly, a global content extraction module (GCEM) is proposed to extract multi-scale context information. Secondly, lightweight linear spatial Transformer connections are added in the top-down pathway to augment each feature map with multi-scale features, where a linearized approximate self-attention function is designed for reducing model complexity. By means of the self-attention mechanism in Transformer, it is no longer needed to align feature maps during feature fusion, thus solving the misaligned defect. By setting the query scope to the entire feature map, the local defect can also be solved. Extensive experiments on COCO and PASCAL VOC datasets demonstrated that the CA-FPN outperforms other FPN-based detectors without bells and whistles and is robust in different settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IET Image Processing	Publication Date: Jul 5, 2022
Citations: 4	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Content‐augmented feature pyramid network with light linear spatial transformers for object detection

Abstract

Talk to us

Similar Papers

More From: IET Image Processing

Lead the way for us

Similar Papers

Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection
Hongyang Wang ... Tiejun Wang
Electronics | VOL. 12
Hongyang Wang, et. al.Hongyang Wang ... Tiejun Wang
26 Dec 2022
Electronics | VOL. 12

Adaptive multiscale feature for object detection
Xiaoyong Yu ... Guilong Gao
Neurocomputing | VOL. 449
Xiaoyong Yu, et. al.Xiaoyong Yu ... Guilong Gao
06 Apr 2021
Neurocomputing | VOL. 449

TFPN: Twin Feature Pyramid Networks for Object Detection
Yi Liang ... Huang Zhen
-
Yi Liang, et. al.Yi Liang ... Huang Zhen
01 Nov 2019
01 Nov 2019

Adaptive learning feature pyramid for object detection
Fukoeng Wong ... Haifeng Hu
IET Computer Vision | VOL. 13
Fukoeng Wong, et. al.Fukoeng Wong ... Haifeng Hu
01 Dec 2019
IET Computer Vision | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Content‐augmented feature pyramid network with light linear spatial transformers for object detection

Abstract

Talk to us

Similar Papers

More From: IET Image Processing