Cross channel weight sharing for image classification

Nazmul Shahadat,Anthony S Maida

doi:10.1016/j.imavis.2023.104872

Abstract

Hypercomplex convolutional neural networks (HCNNs) have recently been used to improve deep learning architectures due to their ability to share weights across input channels and thus improve the cohesiveness of learned representations within the layers. Much work has been done to study the effect of hypercomplex representation in CNNs. However, to date, none of these models have used fully hypercomplex architectures, meaning that some of the layers in these earlier networks used conventional real-valued calculations that did not engage cross-channel weight sharing. Here, we introduce and study full hypercomplex CNNs by ensuring all layers perform hypercomplex calculations. For the earlier HCNNs, the real-valued layers are found in: (I) in the front end of the networks, (II) in the back end of the networks, and (III) in the residual blocks of the networks. The present research examines the performance of HCNNs when the modules mentioned above are replaced with hypercomplex equivalents. These representational networks have outperformed and shown state-of-the-art results compared to previous hypercomplex models (on some specific datasets). A disadvantage of these representational networks is that they consume high computational costs. To reduce this cost, novel separable hypercomplex networks (SHNNs) are proposed. They are created by factoring a quaternion convolutional module into two consecutive separable vectormap convolutional modules. As the successive layers of deep HCNNs perform local hierarchical grouping on increasingly abstract features, these groupings are responsible for the long-distance interaction problems found in the dense layers. To handle these problems, researchers have applied attention mechanisms in hypercomplex space.This paper offers a perspective on the basic concepts of representational networks and attention-based representational networks with the focus on (1) reviewing deep full HCNNs, including hypercomplex-based fully connected dense layer; (2) analyzing hypercomplex-based separable concept; (3) describing how the hypercomplex spatial layer of residual bottleneck block is replaced with an attention layer; and (4) providing a comprehensive summary comparing the performance of recent deep HCNNs, deep convolutional neural networks, and original attention based networks. We compare the performance of the networks in terms of accuracy and the number of trainable parameters on several image classification datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross channel weight sharing for image classification

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Journal: Image and Vision Computing	Publication Date: Nov 28, 2023
Citations: 2

Similar Papers

Convolutional neural network with parallel convolution scale attention module and ResCBAM for breast histology image classification
Ting Yan ... Bin Wang
Heliyon | VOL. 10
Ting Yan, et. al.Ting Yan ... Bin Wang
01 May 2024
Heliyon | VOL. 10

Comparison of Attention Mechanism in Convolutional Neural Networks for Binary Classification of Breast Cancer Histopathological Images
Marcin Ziąber ... Wojciech Rudnicki
-
Marcin Ziąber, et. al.Marcin Ziąber ... Wojciech Rudnicki
01 Jan 2023
01 Jan 2023

HybridBranchNet: A novel structure for branch hybrid convolutional neural networks architecture
Ebrahim Parcham ... Mansoor Fateh
Neural Networks | VOL. 165
Ebrahim Parcham, et. al.Ebrahim Parcham ... Mansoor Fateh
24 May 2023
Neural Networks | VOL. 165

GrapeNet: A Lightweight Convolutional Neural Network Model for Identification of Grape Leaf Diseases
Jianwu Lin ... Xiaoyulong Chen
Agriculture | VOL. 12
Jianwu Lin, et. al.Jianwu Lin ... Xiaoyulong Chen
20 Jun 2022
Agriculture | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross channel weight sharing for image classification

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing