A CNN-RNN Hybrid Model with 2D Wavelet Transform Layer for Image Classification

Zihao Dong,Ruixun Zhang,Xiuli Shao

doi:10.1109/ictai.2019.00147

Abstract

Convolutional neural networks (CNNs) have recently achieved impressive performances in image processing tasks such as image classification and object recognition. However, CNNs only process images in the spatial domain whereas spectral analysis operates in the frequency domain. In this paper, we propose the 2D wavelet transform layer. The learned features from images are viewed as two-directional sequential data, and we use two LSTM layers that sweep both horizontally and vertically across the image to compress feature matrices. Based on this, the 2D wavelet transform decomposes the above feature matrices as a learned mixing of different harmonic functions, and therefore integrating the spectral analysis into CNNs. We also select 3 × 3 convolutional mixing style with Gaussian+LSM filter to mix the output of the 2D wavelet transform to generate new output features. Finally, we combine the sequential and spectral features to build our CNN-RNN architecture with skip layers and apply it to image classification. Our proposed network is evaluated on three widely-used benchmark datasets: CIFAR-10, CIFAR-100 and Tiny ImageNet. Experiments show that our CNN-RNN hybrid model achieves better accuracy in image classification tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A CNN-RNN Hybrid Model with 2D Wavelet Transform Layer for Image Classification

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning sparse features with lightweight ScatterNet for small sample training
Zihao Dong ... Zengsheng Kuang
Knowledge-Based Systems | VOL. 205
Zihao Dong, et. al.Zihao Dong ... Zengsheng Kuang
25 Jul 2020
Knowledge-Based Systems | VOL. 205

Multi-level Dense Capsule Networks
Sai Samarth R Phaye ... Apoorva Sikka
-
Sai Samarth R Phaye, et. al.Sai Samarth R Phaye ... Apoorva Sikka
01 Jan 2019
01 Jan 2019

Competing ratio loss for discriminative multi-class image classification
Ke Zhang ... Tony X Han
Neurocomputing | VOL. 464
Ke Zhang, et. al.Ke Zhang ... Tony X Han
27 Aug 2021
Neurocomputing | VOL. 464

Beetle Antennae Search: Using Biomimetic Foraging Behaviour of Beetles to Fool a Well-Trained Neuro-Intelligent System.
Ameer Khan ... Shuai Li
Biomimetics | VOL. 7
Ameer Khan, et. al.Ameer Khan ... Shuai Li
23 Jun 2022
Biomimetics | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A CNN-RNN Hybrid Model with 2D Wavelet Transform Layer for Image Classification

Abstract

Talk to us

Similar Papers