Caltech Dataset Research Articles

Deep neural networks (DNNs) have achieved the state of the art performance in numerous fields. However, DNNs need high computation times, and people always expect better performance in a lower computation. Therefore, we study the human somatosensory system and design a neural network (SpinalNet) to achieve higher accuracy with fewer computations. Hidden layers in traditional NNs receive inputs in the previous layer, apply activation function, and then transfer the outcomes to the next layer. In the proposed SpinalNet, each layer is split into three splits: 1) input split, 2) intermediate split, and 3) output split. Input split of each layer receives a part of the inputs. The intermediate split of each layer receives outputs of the intermediate split of the previous layer and outputs of the input split of the current layer. The number of incoming weights becomes significantly lower than traditional DNNs. The SpinalNet can also be used as the fully connected or classification layer of DNN and supports both traditional learning and transfer learning. We observe significant error reductions with lower computational costs in most of the DNNs. Traditional learning on the VGG-5 network with SpinalNet classification layers provided the state-of-the-art (SOTA) performance on QMNIST, Kuzushiji-MNIST, and EMNIST (Letters, Digits, and Balanced) datasets. Traditional learning with ImageNet pre-trained initial weights and SpinalNet classification layers provided the SOTA performance on STL-10, Fruits 360, Bird225, and Caltech-101 datasets. The scripts of the proposed SpinalNet training are available at the following link: <uri>https://github.com/dipuk0506/SpinalNet</uri> <i>Impact Statement</i>—Research in deep neural networks (DNNs) has gained significant attention from industries and academia due to their eye-catching performance. DNNs have enabled machines to perform myriad tasks with high accuracy that once only humans could do. Several researchers have recently proposed different types of NNs and have achieved high accuracy. The recent success of biologically inspired convolutional neural networks and the miraculous spinal architecture of humans has motivated us to develop a neural network with gradual inputs. We have achieved superior performance in several datasets. After the first online appearance of the initial version of this paper, several researchers have applied the proposed neural network in several new datasets and reported promising results. We may observe numerous novel applications of SpinalNet in upcoming years.

Read full abstract

The paperfocuses on the content-based image retrieval systems building. The main challengesin the construction of such sys-tems are considered, the components of such systems are reviewed, and a brief overview of the main methods and techniques that have been used in this area to implement the main components of image search systems is given.As one of the options for solving such a problem, an image retrievemethodology based on the binary space partitioning method and the perceptual hashing method is proposed. Space binary partition trees are a data structuresobtained as follows: the space is partitioned by a hyperplane into two half-spaces, and theneach half-space is recursively partitioned until each node contains only a trivial part of the input features. Perceptual hashing algorithms make it possible to represent an image as a 64-bit hash value, with similar images represented by similar hash values. As a metric for determining the distance between hash values, the Hamming distance is used, this counts the number of dis-tinct bits.To organize the base of hash values, a vp-tree is used, which is an implementation of the binary space partitioning struc-ture.For the experimental study of the methodology, the Caltech-256 data set was used, which contains 30607 images divided into 256 categories, the Difference Hash, P-Hash and Wavelet Hash algorithms were used as perceptual hashing algorithms, the study was carried out in the Google Colab environment.As part of an experimental study, the robustnessof hashing algorithms to modification, compression, blurring, noise, and image rotation was examined. In addition, a study was made of the process of building a vp-tree and the process of searching for images in the tree. As a result of experiments, it was found that each of the hashing algorithms has its own advantages and disadvantages. So, the hashing algorithm based on the difference inadjacentpixel values in the image turned out to be the fastest, but it turned out to be not very robustto modification and image rotation. The P-Hash algorithm, based on the use of the discrete cosine transform, showed better resistance to image blurring, but turned out to be sensitive to image compression. The W-Hash algorithm based on the Haar wavelet transform made it possible to construct the most efficient tree structure and provedto be resistant to image modification and compression.The proposed technique is not recommended for use in general-purpose image retrieval systems; however, it can be useful in searching for images in specialized databases. As ways to improve the methodology, one can note the improvement of the vp-tree structure, as well as the search for a more efficient method of image representation, in addition to perceptual hashing.

Read full abstract

Caltech Dataset Research Articles

Related Topics

Articles published on Caltech Dataset

Adaptive class token knowledge distillation for efficient vision transformer

SNN using color-opponent and attention mechanisms for object recognition

A Mighty Image Retrieval Descriptor Based on Machine Learning and Gaussian Derivative Filter

Survey of image classification models for transfer learning

Content-based image retrieval using multi-scale averaging local binary patterns

AdvRain: Adversarial Raindrops to Attack Camera-Based Smart Vision Systems

SpinalNet: Deep Neural Network With Gradual Input

The Weights Reset Technique for Deep Neural Networks Implicit Regularization

Privileged information learning with weak labels

Evolving Deep Multiple Kernel Learning Networks Through Genetic Algorithms

End-to-End Object Detection with Enhanced Positive Sample Filter

An Image Classification Method Based on Adaptive Attention Mechanism and Feature Extraction Network.

Deep Domain Adaptation Using Cascaded Learning Networks and Metric Learning

Curiosity-Driven Class-Incremental Learning via Adaptive Sample Selection

An Improved Probability Density Function (PDF) for Face Skin Detection

Graph Convolutional Network Combined with Semantic Feature Guidance for Deep Clustering

Image classification method based on local receptive field extended S-Mobilenet model

Fast and effective pedestrian detection based on low-level visual features combination

Methodology for image retrieval based on binary spacepartitioning and perceptual image hashing

PMAL: A Proxy Model Active Learning Approach for Vision Based Industrial Applications

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Caltech Dataset Research Articles

Related Topics

Articles published on Caltech Dataset

Adaptive class token knowledge distillation for efficient vision transformer

SNN using color-opponent and attention mechanisms for object recognition

A Mighty Image Retrieval Descriptor Based on Machine Learning and Gaussian Derivative Filter

Survey of image classification models for transfer learning

Content-based image retrieval using multi-scale averaging local binary patterns

AdvRain: Adversarial Raindrops to Attack Camera-Based Smart Vision Systems

SpinalNet: Deep Neural Network With Gradual Input

The Weights Reset Technique for Deep Neural Networks Implicit Regularization

Privileged information learning with weak labels

Evolving Deep Multiple Kernel Learning Networks Through Genetic Algorithms

End-to-End Object Detection with Enhanced Positive Sample Filter

An Image Classification Method Based on Adaptive Attention Mechanism and Feature Extraction Network.

Deep Domain Adaptation Using Cascaded Learning Networks and Metric Learning

Curiosity-Driven Class-Incremental Learning via Adaptive Sample Selection

An Improved Probability Density Function (PDF) for Face Skin Detection

Graph Convolutional Network Combined with Semantic Feature Guidance for Deep Clustering

Image classification method based on local receptive field extended S-Mobilenet model

Fast and effective pedestrian detection based on low-level visual features combination

Methodology for image retrieval based on binary spacepartitioning and perceptual image hashing

PMAL: A Proxy Model Active Learning Approach for Vision Based Industrial Applications