Multiscale features integration based multiple-in-single-out network for object detection

Kequan Yang,Jide Li,Songmin Dai,Xiaoqiang Li

doi:10.1016/j.imavis.2023.104714

Abstract

The single-level feature map-based object detection has been a challenging task due to the feature scale limitation. Therefore, enriching multiscale information of single-level features is considered a promising approach to deal with this challenge. Although most existing methods have attempted to augment the feature scale of single-level features, the detection performance is still unsatisfactory because these methods mine multiscale features only based on a one-level feature map. To address this problem, we propose a multiple-in-single-out network (MiSoNet) to integrate multiscale information from multilevel feature maps into a single-level feature map. To achieve this, MiSoNet’s key component is equipped with two cascaded modules: a multilevel feature integration module (MFIM) and a depthwise convolutional residual encoder (DWEncoder). Specifically, MFIM adaptively fuses features of inconsistent semantics and scales from multilevel feature maps. DWEncoder stacks several residual blocks with depthwise convolutions to extract multiscale contexts in the single feature map, which can further extend the scale range of the receptive fields. Extensive experiments are conducted on the Common Objects in Context (COCO) dataset, where the MiSoNet achieves a 41.0AP, which surpasses the YOLOF by 1.4AP with negligible computational overhead. Moreover, the MiSoNet, with fewer parameters and FLOPs, outperforms some advanced detectors based on the feature pyramid network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multiscale features integration based multiple-in-single-out network for object detection

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: May 23, 2023
Citations: 3

Similar Papers

Multiple-in-Single-Out Object Detector Leveraging Spiking Neural Membrane Systems and Multiple Transformers.
Zhengyuan Jiang ... Jun Wang
International Journal of Neural Systems | VOL. 34
Zhengyuan Jiang, et. al.Zhengyuan Jiang ... Jun Wang
13 Apr 2024
International Journal of Neural Systems | VOL. 34

Multi-Scale Residual Aggregation Feature Pyramid Network for Object Detection
Hongyang Wang ... Tiejun Wang
Electronics | VOL. 12
Hongyang Wang, et. al.Hongyang Wang ... Tiejun Wang
26 Dec 2022
Electronics | VOL. 12

Enhanced Feature Pyramid Networks by Feature Aggregation Module and Refinement Module
Xuan-Thuy Vo ... Kang-Hyun Jo
-
Xuan-Thuy Vo, et. al.Xuan-Thuy Vo ... Kang-Hyun Jo
01 Jun 2020
01 Jun 2020

NAS-FPNLite object detection method fused with cross stage connection and inverted residual
Hongxia Wang ... Deshan Chen
Journal of Image and Graphics | VOL. 28
Hongxia Wang, et. al.Hongxia Wang ... Deshan Chen
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiscale features integration based multiple-in-single-out network for object detection

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing