Object Detection Using Improved Bi-Directional Feature Pyramid Network

Tran Ngoc Quang,Seunghyun Lee,Byung Cheol Song

doi:10.3390/electronics10060746

Tran Ngoc Quang, Seunghyun Lee + Show 1 more

Open Access

https://doi.org/10.3390/electronics10060746

Copy DOI

Journal: Electronics	Publication Date: Mar 22, 2021
Citations: 10	License type: CC BY 4.0

Affiliation: Inha University

Abstract

Conventional single-stage object detectors have been able to efficiently detect objects of various sizes using a feature pyramid network. However, because they adopt a too simple manner of aggregating feature maps, they cannot avoid performance degradation due to information loss. To solve this problem, this paper proposes a new framework for single-stage object detection. The proposed aggregation scheme introduces two independent modules to extract global and local information. First, the global information extractor is designed so that each feature vector can reflect the information of the entire image through a non-local neural network (NLNN). Next, the local information extractor aggregates each feature map more effectively through the improved bi-directional network. The proposed method can achieve better performance than the existing single-stage object detection methods by providing improved feature maps to the detection heads. For example, the proposed method shows 1.6% higher average precision (AP) than the efficient featurized image pyramid network (EFIPNet) for the MicroSoft Common Objects in COntext (MS COCO) dataset.

Full Text