No routing needed between capsules

Adam Byerly,Tatiana Kalganova,Ian Dear

doi:10.1016/j.neucom.2021.08.064

Adam Byerly, Tatiana Kalganova + Show 1 more

Open Access

https://doi.org/10.1016/j.neucom.2021.08.064

Copy DOI

Journal: Neurocomputing	Publication Date: Aug 19, 2021
Citations: 33	License type: cc-by

Affiliation: Brunel University London, Bradley University

Abstract

Most capsule network designs rely on traditional matrix multiplication between capsule layers and computationally expensive routing mechanisms to deal with the capsule dimensional entanglement that the matrix multiplication introduces. By using Homogeneous Vector Capsules (HVCs), which use element-wise multiplication rather than matrix multiplication, the dimensions of the capsules remain unentangled. In this work, we study HVCs as applied to the highly structured MNIST dataset in order to produce a direct comparison to the capsule research direction of Geoffrey Hinton, et al. In our study, we show that a simple convolutional neural network using HVCs performs as well as the prior best performing capsule network on MNIST using 5.5× fewer parameters, 4× fewer training epochs, no reconstruction sub-network, and requiring no routing mechanism. The addition of multiple classification branches to the network establishes a new state of the art for the MNIST dataset with an accuracy of 99.87% for an ensemble of these models, as well as establishing a new state of the art for a single model (99.83% accurate).

Highlights

Capsules have become a more active area of research since [1], which demonstrated near state of the art performance on MNIST [2] classification by using capsules and a routing algorithm to determine which capsules in a previous layer feed capsules in the subsequent layer
In [6], we proposed a capsule design that used element-wise multiplication between capsules in subsequent layers and relied on backpropagation to do the work that prior capsule designs were relying on routing mechanisms for
We proposed using a simple convolutional neural network and established design principles as a basis for a network architecture

Summary

Introduction

Capsules (vector-valued neurons) have become a more active area of research since [1], which demonstrated near state of the art performance on MNIST [2] classification (at 99.75%) by using capsules and a routing algorithm to determine which capsules in a previous layer feed capsules in the subsequent layer. In [6], we proposed a capsule design that used element-wise multiplication between capsules in subsequent layers and relied on backpropagation to do the work that prior capsule designs were relying on routing mechanisms for. We referred to this capsule design as homogeneous vector capsules (HVCs). We directly extend the work of [7,1] on capsules applied to MNIST by applying HVCs to MNIST.By using this capsule design, we avoid

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

No routing needed between capsules

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Simple convolutional neural network on image classification
Tianmei Guo ... Henjian Li
-
Tianmei Guo, et. al.Tianmei Guo ... Henjian Li
01 Mar 2017
01 Mar 2017

Fusion of Multiple Simple Convolutional Neural Networks for Gender Classification
Nihad A Abdalrady ... Saleh Aly
-
Nihad A Abdalrady, et. al.Nihad A Abdalrady ... Saleh Aly
01 Feb 2020
01 Feb 2020

A Method to Measure the Effect of Noise Labels
Xinbin Zhang
SSRN | VOL. -
Xinbin ZhangXinbin Zhang
19 Dec 2017
SSRN | VOL. -

A Shallow Convolutional Neural Network for Accurate Handwritten Digits Classification
Vladimir Golovko ... Anatoliy Sachenko
-
Vladimir Golovko, et. al.Vladimir Golovko ... Anatoliy Sachenko
01 Jan 2017
01 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

No routing needed between capsules

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Neurocomputing