Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree Assumption

Matthias Mitterreiter,Sören Laue,Marcel Koch,Joachim Giesen

doi:10.1609/aaai.v37i8.26104

Abstract

Capsule neural networks replace simple, scalar-valued neurons with vector-valued capsules. They are motivated by the pattern recognition system in the human brain, where complex objects are decomposed into a hierarchy of simpler object parts. Such a hierarchy is referred to as a parse-tree. Conceptually, capsule neural networks have been defined to mimic this behavior. The capsule neural network (CapsNet), by Sabour, Frosst, and Hinton, is the first actual implementation of the conceptual idea of capsule neural networks. CapsNets achieved state-of-the-art performance on simple image recognition tasks with fewer parameters and greater robustness to affine transformations than comparable approaches. This sparked extensive follow-up research. However, despite major efforts, no work was able to scale the CapsNet architecture to more reasonable-sized datasets. Here, we provide a reason for this failure and argue that it is most likely not possible to scale CapsNets beyond toy examples. In particular, we show that the concept of a parse-tree, the main idea behind capsule neuronal networks, is not present in CapsNets. We also show theoretically and experimentally that CapsNets suffer from a vanishing gradient problem that results in the starvation of many capsules during training.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree Assumption

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 2

Similar Papers

AN IMPROVED SEGMENTATION METHOD FOR BRAIN CANCER USING CAPSULE NEURAL NETWORKS
Kumar M ... Harsha B.K
ICTACT Journal on Image and Video Processing | VOL. 13
Kumar M, et. al.Kumar M ... Harsha B.K
01 May 2023
ICTACT Journal on Image and Video Processing | VOL. 13

Cognitive Consistency Routing Algorithm of Capsule-Network
Huayu Li ... Yihan Wang
-
Huayu Li, et. al.Huayu Li ... Yihan Wang
24 Apr 2019
24 Apr 2019

Comparison of deep learning-based models for detection of diseased trees using an image compression algorithm
Assiya Sarinova ... Yerassyl Omirtay
Eastern-European Journal of Enterprise Technologies | VOL. 5
Assiya Sarinova, et. al.Assiya Sarinova ... Yerassyl Omirtay
30 Oct 2024
Eastern-European Journal of Enterprise Technologies | VOL. 5

Local and interregional alpha EEG dynamics dissociate between memory for search and memory for recognition
Joram Van Driel ... Christian N.L Olivers
NeuroImage | VOL. 149
Joram Van Driel, et. al.Joram Van Driel ... Christian N.L Olivers
26 Jan 2017
NeuroImage | VOL. 149

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree Assumption

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence