Lightweight Attention Pyramid Network for Object Detection and Instance Segmentation

Jiwei Zhang,Yanyu Yan,Zelei Cheng,Wendong Wang

doi:10.3390/app10030883

Abstract

Feature pyramids of convolutional neural networks (ConvNets)—from bottom to top—are used by most recent researchers for the improvement of object detection accuracy, but they seldom aim to address the correlation of each feature channel and the fusion of low-level features and high-level features. In this paper, an Attention Pyramid Network (APN) is proposed, which mainly contains the adaptive transformation module and feature attention block. The adaptive transformation module utilizes the multiscale feature fusion, and makes full use of the accurate target location information of low-level features and the semantic information of high-level features. Then, the feature attention block strengthens the features of important channels and weakens the features of unimportant channels through learning. By implementing the APN in a basic Mask R-CNN system, our method achieves state-of-the-art results on the MS COCO dataset and 2018 WAD database without bells and whistles. In addition, the structure of the APN makes the network parameters lighter, and runs at 4 ms on average, which is ignorable when compared to the inference time of the backbone of ConvNet.

Highlights

Along with the popularization of the artificial intelligence systems [1,2], IOT [3] and the accumulation of image data [4], automatic object detection is increasingly being widely used in video surveillance and robot vision
The performance of the proposed Attention Pyramid Network (APN) was evaluated via extensive simulations using the MS COCO and the WAD datasets; the results show the effectiveness of our approach
We comprehensively evaluated the APN using the MS COCO dataset and the 2018 WAD dataset, and our results outperform the baseline, i.e., the original feature pyramid networks (FPN)

Summary

Introduction

Along with the popularization of the artificial intelligence systems [1,2], IOT [3] and the accumulation of image data [4], automatic object detection is increasingly being widely used in video surveillance and robot vision. Object detection is a fundamental computer-vision task [5], and the existence of multiple scales and ratios is the most challenging problem in object detection. More and more attention is being paid to this problem, and various detection methods have emerged. In the image pyramid methods [6,7], as shown, images are generally resized to multiple scales and resized to the same ratio for training and inference. Because of the large number of images, the methods are computationally expensive

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jan 28, 2020
Citations: 15	License type: CC BY 4.0

R Discovery Prime

Lightweight Attention Pyramid Network for Object Detection and Instance Segmentation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

PENet: Pre-Enhanced Network for Object Detection and Instance Segmentation
Yunda Shi ... Li Li
-
Yunda Shi, et. al.Yunda Shi ... Li Li
24 Feb 2023
24 Feb 2023

MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images
Liming Zhou ... Xianyu Zuo
Drones | VOL. 8
Liming Zhou, et. al.Liming Zhou ... Xianyu Zuo
08 May 2024
Drones | VOL. 8

HQ-ISNet: High-Quality Instance Segmentation for Remote Sensing Imagery
Hao Su ... Xiaoling Zhang
Remote Sensing | VOL. 12
Hao Su, et. al.Hao Su ... Xiaoling Zhang
19 Mar 2020
Remote Sensing | VOL. 12

MFIL-FCOS: A Multi-Scale Fusion and Interactive Learning Method for 2D Object Detection and Remote Sensing Image Detection
Guoqing Zhang ... Ruixia Hou
Remote Sensing | VOL. 16
Guoqing Zhang, et. al.Guoqing Zhang ... Ruixia Hou
07 Mar 2024
Remote Sensing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Lightweight Attention Pyramid Network for Object Detection and Instance Segmentation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Applied Sciences