Bridging Multi-Scale Context-Aware Representation for Object Detection

Boying Wang,Libo Zhang,Ruyi Ji,Yanjun Wu

doi:10.1109/tcsvt.2022.3221755

Abstract

Feature Pyramid Network (FPN) exploits multi-scale fusion representation to deal with scale variances in object detection. However, it ignores the context information gap across different levels. In this paper, we develop a plug-and-play detector, the multi-scale context-aware feature pyramid network to unleash the power of feature pyramid representation. Based on the dilated feature map at the highest level of the backbone, we propose the cross-scale context aggregation block to make full use of context information in the feature pyramid. Moreover, we extract discriminative features among different levels by the adaptive context aggregation block for robust object detection. Comprehensive experiments on MS-COCO demonstrate the effectiveness and efficiency of the proposed network, where about 1.0 ~ 3.0 AP improvements are achieved compared with existing FPN-based methods. In addition, we also conduct extensive experiments on pixel-level prediction tasks, <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">i.e</i> ., instance segmentation, semantic segmentation, and panoptic segmentation, which further verify the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bridging Multi-Scale Context-Aware Representation for Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems for Video Technology	Publication Date: May 1, 2023
Citations: 11

Similar Papers

Tripartite Feature Enhanced Pyramid Network for Dense Prediction.
Dongfang Liu ... James Liang
IEEE Transactions on Image Processing | VOL. 32
Dongfang Liu, et. al.Dongfang Liu ... James Liang
01 Jan 2023
IEEE Transactions on Image Processing | VOL. 32

Multi-task Network for Panoptic Segmentation in Automated Driving
Andra Petrovai ... Sergiu Nedevschi
-
Andra Petrovai, et. al.Andra Petrovai ... Sergiu Nedevschi
01 Oct 2019
01 Oct 2019

A2-FPN: Attention Aggregation based Feature Pyramid Network for Instance Segmentation
Miao Hu ... Lu Fang
-
Miao Hu, et. al.Miao Hu ... Lu Fang
01 Jun 2021
01 Jun 2021

Learning panoptic segmentation through feature discriminability
Tao Chu ... Qiong Liu
Pattern Recognition | VOL. 122
Tao Chu, et. al.Tao Chu ... Qiong Liu
18 Aug 2021
Pattern Recognition | VOL. 122

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bridging Multi-Scale Context-Aware Representation for Object Detection

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems for Video Technology