Adaptive Modular Convolutional Neural Network for Image Recognition.

Wenbo Wu,Yun Pan

doi:10.3390/s22155488

Wenbo Wu, Yun Pan

Open Access

PDF Available

https://doi.org/10.3390/s22155488

Copy DOI

Export

Save

Cite

Journal: Sensors (Basel, Switzerland)	Publication Date: Jul 22, 2022
Citations: 10	License type: CC BY 4.0

Affiliation: Communication University of China

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Image recognition has long been one of the research hotspots in computer vision tasks. The development of deep learning is rapid in recent years, and convolutional neural networks usually need to be designed with fixed resources. If sufficient resources are available, the model can be scaled up to achieve higher accuracy, for example, VggNet, ResNet, GoogLeNet, etc. Although the accuracy of large-scale models has been improved, the following problems will occur with the expansion of model scale: (1) There may be over-fitting; (2) increasing model parameters; (3) slow model convergence. This paper proposes a design method for a modular convolutional neural network model which solves the problem of over-fitting and large model parameters by connecting multiple modules in parallel. Moreover, each module contains several submodules (three submodules in this paper) and fuses the features extracted from the submodules. The model convergence can be accelerated by using the fused features (the fused features contain more image information). In this study, we add a gate unit based on the attention mechanism to the model, which aims to optimize the structure of the model (select the optimal number of modules), allowing the model to select an optimum network structure by learning and dynamically reducing FLOPs (floating-point operations per second) of the model. Compared to VggNet, ResNet, and GoogLeNet, the structure of the model proposed in this paper is simple and the parameters are small. The proposed model achieves good results in the Kaggle datasets Cats-vs.-Dogs (99.3%), 10-Monkey Species (99.26%), and Birds-400 (99.13%).

Full Text