Abstract

Many convolutional neural networks have been proposed for image classification in recent years. Most tend to decrease the plane size of feature maps stage-by-stage, such that the feature maps generated within each stage show the same plane size. This concept governs the design of most classification networks. However, it can also lead to semantic deficiency of high-resolution feature maps as they are always placed in the shallow layers of a network. Here, we propose a novel network architecture, named ScaleNet, which consists of stacked convolution-deconvolution blocks and a multipath residual structure. Unlike most current networks, ScaleNet extracts image features by a cascaded deconstruction-reconstruction process. It can generate scale-variable feature maps within each block and stage, thereby realizing multiscale feature extraction at any depth of the network. Based on the CIFAR-10, CIFAR-100, and ImageNet datasets, ScaleNet demonstrated competitive classification performance compared to state-of-the-art ResNet. In addition, ScaleNet exhibited a powerful ability to capture strong semantic and fine-grained features on its high-resolution feature maps. The code is available at https://github.com/zhjpqq/scalenet.

Highlights

  • With the development of deep convolutional neural networks, image classification has achieved considerable progress in recent years

  • We propose a multipath residual structure to replace the original single-path residual structure proposed in ResNet [9], which can further improve network feature forward-propagation and gradient back-propagation abilities

  • For the CIFAR-100 dataset, our deep ScaleNet with a substantial parameter reduction of 30% (1.2M parameters) performs better (24.13%) than ResNet (27.22%, 1.7M), SD-ResNet (24.58%, 1.7M), pre-ResNet (24.33%,1.7M), Exponential Linear Unit (ELU)-ResNet (26.55%, 1.7M), Parametric Exponential Linear Unit (PELU)-ResNet (25.04%, 1.7M), and FitResNet (27.66%, 2.5M)

Read more

Summary

Introduction

With the development of deep convolutional neural networks, image classification has achieved considerable progress in recent years. Different from ResNet [9] and DenseNet [10], in which feature size remains unchanged within each stage, ScaleNet can generate scale-variable

Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call