Improving the Performance of Convolutional Neural Networks by Fusing Low-Level Features With Different Scales in the Preceding Stage

Xiaohong Yu,Lin Gao,Wei Long,Yanyan Li,Xiaoqiu Shi

doi:10.1109/access.2021.3077070

Abstract

The width of convolutional neural networks (CNNs) is crucial for improving performance. Many wide CNNs use a convolutional layer to fuse multiscale features or fuse the preceding features to subsequent features. However, these CNNs rarely use blocks, which consist of a series of successive convolutional layers, to fuse multiscale features. In this paper, we propose an approach for improving performance by fusing the low-level features extracted from different blocks. We utilize five different convolutions, including 3×3, 5×5, 7×7,5×3 ∪ 3×5 and 7×3 ∪ 3×7, to generate five low-level features, and we design two fusion strategies: low-level feature fusion (L-Fusion) and high-level feature fusion (H-Fusion). Experimental results show that the L-Fusion is more helpful for improving the performance of CNNs, and the 5×5 convolution is more suitable for multiscale feature fusion. We summarize the conclusion as a strategy that fuses multiscale features in the preceding stage of CNNs. Furthermore, we propose a new architecture to perceive the input of CNNs by using two self-governed blocks based on the strategy. Finally, we modify five off-the-shelf networks, DenseNet-BC (depth = 40), ALL-CNN-C (depth = 9), Darknet 19 (depth = 19), Resnet 18 (depth = 18) and Resnet 50 (depth = 50), by utilizing the proposed architecture to verify the conclusion, and these updated networks provide more competitive results.

Highlights

CNNs [1] were first presented in 1989, and they have demonstrated excellent performance in many visual tasks such as semantic segmentation [2], [3], image classification [4], and object detection [5], [6]
One of our purposes of this paper is to study the advantage of multiscale feature fusion, but we seek to answer whether large-scale feature or multiscale feature fusion increases performance more
In this paper, we divide a convolutional neural networks (CNNs) into different blocks according to the size of the features to obtain low-level and high-level features for feature fusion

Summary

Introduction

Ns (convolutional neural networks) [1] were first presented in 1989, and they have demonstrated excellent performance in many visual tasks such as semantic segmentation [2], [3], image classification [4], and object detection [5], [6]. As hardware has developed, the performance of CNNs has increased dramatically due to the higher computational capacity of the hardware. Some classic models have validated that the depth of a CNN is pivotal for its performance [11], [4]. Many visual recognition tasks have benefitted from very deep networks [12], [13]. A considerably deeper network achieves better results than a shallower network, and we can obtain a higher-quality model by increasing the depth.

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improving the Performance of Convolutional Neural Networks by Fusing Low-Level Features With Different Scales in the Preceding Stage

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

A Deeply Supervised Convolutional Neural Network for Pavement Crack Detection With Multiscale Feature Fusion.
Zhong Qu ... Dong-Yang Zhou
IEEE transactions on neural networks and learning systems | VOL. 33
Zhong Qu, et. al.Zhong Qu ... Dong-Yang Zhou
15 Mar 2021
IEEE transactions on neural networks and learning systems | VOL. 33

Deep Convolutional Neural Network with Feature Fusion for Image Super-Resolution
Furui Bai ... Wen Lu
-
Furui Bai, et. al.Furui Bai ... Wen Lu
01 Jan 2018
01 Jan 2018

Low - resolution vehicle recognition based on deep feature fusion
Lixia Xue ... Ronggui Wang
Multimedia Tools and Applications | VOL. 77
Lixia Xue, et. al.Lixia Xue ... Ronggui Wang
04 May 2018
Multimedia Tools and Applications | VOL. 77

Enhancing feature fusion for human pose estimation
Rui Wang ... Jiangwei Tong
Machine Vision and Applications | VOL. 31
Rui Wang, et. al.Rui Wang ... Jiangwei Tong
24 Sep 2020
Machine Vision and Applications | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the Performance of Convolutional Neural Networks by Fusing Low-Level Features With Different Scales in the Preceding Stage

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access