SVHN Datasets Research Articles

Conventional application of convolutional neural networks (CNNs) for image classification and recognition is based on the assumption that all target classes are equal (i.e. no hierarchy) and exclusive of one another (i.e. no overlap). CNN-based image classifiers built on this assumption, therefore, cannot take into account an innate hierarchy among target classes (e.g. cats and dogs in animal image classification) or additional information that can be easily derived from the data (e.g. numbers larger than five in the recognition of handwritten digits), thereby resulting in scalability issues when the number of target classes is large. Combining two related but slightly different ideas of hierarchical classification and logical learning by auxiliary inputs, we propose a new learning framework called hierarchical auxiliary learning, which not only address the scalability issues with a large number of classes but also could further reduce the classification/recognition errors with a reasonable number of classes. In the hierarchical auxiliary learning, target classes are semantically or non-semantically grouped into superclasses, which turns the original problem of mapping between an image and its target class into a new problem of mapping between a pair of an image and its superclass and the target class. To take the advantage of a superclass as a hint during the learning phase, we introduce an auxiliary block into a neural network, which generates auxiliary scores used as additional information for final classification/recognition; in this paper, we add the auxiliary block between the last residual block and the fully-connected output layer of the ResNet. Experimental results show that the proposed hierarchical auxiliary learning reduces classification errors up to 0.56, 1.6 and 3.56 percent with MNIST, SVHN and CIFAR-10 datasets, respectively.

Read full abstract

Very deep networks are successful in various tasks with reported results surpassing human performance. However, training such very deep networks is not trivial. Typically, the problems of learning the identity function and feature reuse can work together to plague optimization of very deep networks. In this paper, we propose a highway network with gate constraints that addresses the aforementioned problems, and thus alleviates the difficulty of training. Namely, we propose two variants of highway network, HWGC and HWCC , employing feature summation and concatenation respectively. The proposed highway networks, besides being more computationally efficient, are shown to have more interesting learning characteristics such as natural learning of hierarchical and robust representations due to a more effective usage of model depth, fewer gates for successful learning, better generalization capacity and faster convergence than the original highway network. Experimental results show that our models outperform the original highway network and many state-of-the-art models. Importantly, we observe that our second model with feature concatenation and compression consistently outperforms our model with feature summation of similar depth, the original highway network, many state-of-the-art models and even ResNets on four benchmarking datasets which are CIFAR-10, CIFAR-100, Fashion-MNIST, SVHN and imagenet-2012 (ILSVRC) datasets. Furthermore, the second proposed model is more computationally efficient than the state-of-the-art in view of training, inference time and GPU memory resource, which strongly supports real-time applications. Using a similar number of model parameters for the CIFAR-10, CIFAR-100, Fashion-MNIST and SVHN datasets, the significantly shallower proposed model can surpass the performance of ResNet-110 and ResNet-164 that are roughly 6 and 8 times deeper, respectively. Similarly, for the imagenet dataset, the proposed models surpass the performance of ResNet-101 and ResNet-152 that are roughly three times deeper.

Read full abstract

SVHN Datasets Research Articles

Articles published on SVHN Datasets

EncoDeep

Hierarchical Auxiliary Learning

Deep network compression with teacher latent subspace learning and LASSO

Incremental Concept Learning via Online Generative Memory Recall.

Structured feature sparsity training for convolutional neural network compression

Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Controlling information capacity of binary neural network

Sketch-then-Edit Generative Adversarial Network

Progressive Learning of Low-Precision Networks for Image Classification

Utilizing Amari-Alpha Divergence to Stabilize the Training of Generative Adversarial Networks.

Enabling Spike-Based Backpropagation for Training Deep Neural Network Architectures.

Optimized training and scalable implementation of Conditional Deep Neural Networks with early exits for Fog-supported IoT applications

TDP: Two-dimensional perceptron for image recognition

Deterministic conversion rule for CNNs to efficient spiking convolutional neural networks

Improved Highway Network Block for Training Very Deep Neural Networks

An Incremental Self-Labeling Strategy for Semi-Supervised Deep Learning Based on Generative Adversarial Networks

PARAMETRIC FLATTEN-T SWISH: AN ADAPTIVE NONLINEAR ACTIVATION FUNCTION FOR DEEP LEARNING

Adversarial Dual Network Learning With Randomized Image Transform for Restoring Attacked Images

Group Feedback Capsule Network

RS-CapsNet: An Advanced Capsule Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

SVHN Datasets Research Articles

Articles published on SVHN Datasets

EncoDeep

Hierarchical Auxiliary Learning

Deep network compression with teacher latent subspace learning and LASSO

Incremental Concept Learning via Online Generative Memory Recall.

Structured feature sparsity training for convolutional neural network compression

Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Controlling information capacity of binary neural network

Sketch-then-Edit Generative Adversarial Network

Progressive Learning of Low-Precision Networks for Image Classification

Utilizing Amari-Alpha Divergence to Stabilize the Training of Generative Adversarial Networks.

Enabling Spike-Based Backpropagation for Training Deep Neural Network Architectures.

Optimized training and scalable implementation of Conditional Deep Neural Networks with early exits for Fog-supported IoT applications

TDP: Two-dimensional perceptron for image recognition

Deterministic conversion rule for CNNs to efficient spiking convolutional neural networks

Improved Highway Network Block for Training Very Deep Neural Networks

An Incremental Self-Labeling Strategy for Semi-Supervised Deep Learning Based on Generative Adversarial Networks

PARAMETRIC FLATTEN-T SWISH: AN ADAPTIVE NONLINEAR ACTIVATION FUNCTION FOR DEEP LEARNING

Adversarial Dual Network Learning With Randomized Image Transform for Restoring Attacked Images

Group Feedback Capsule Network

RS-CapsNet: An Advanced Capsule Network