Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition.

Lin Wu,Xue Li,Yang Wang,Junbin Gao

doi:10.1109/tcyb.2018.2813971

Abstract

Fine-grained visual recognition is an important problem in pattern recognition applications. However, it is a challenging task due to the subtle interclass difference and large intraclass variation. Recent visual attention models are able to automatically locate critical object parts and represent them against appearance variations. However, without consideration of spatial dependencies in discriminative feature learning, these methods are underperformed in classifying fine-grained objects. In this paper, we present a deep attention-based spatially recursive model that can learn to attend to critical object parts and encode them into spatially expressive representations. Our network is technically premised on bilinear pooling, enabling local pairwise feature interactions between outputs from two different convolutional neural networks (CNNs) that correspond to distinct region detection and relevant feature extraction. Then, spatial long-short term memory (LSTMs) units are introduced to generate spatially meaningful hidden representations via the long-range dependency on all features in two dimensions. The attention model is leveraged between bilinear outcomes and spatial LSTMs for dynamic selection on varied inputs. Our model, which is composed of two-stream CNN layers, bilinear pooling, and spatial recursive encoding with attention, is end-to-end trainable to serve as the part detector and feature extractor whereby relevant features are localized, extracted, and encoded spatially for recognition purpose. We demonstrate the superiority of our method over two typical fine-grained recognition tasks: fine-grained image classification and person re-identification.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics

Lead the way for us

Journal: IEEE Transactions on Cybernetics	Publication Date: Mar 22, 2018
Citations: 273

Similar Papers

Grouping Bilinear Pooling for Fine-Grained Image Classification
Rui Zeng ... Jingsong He
Applied Sciences | VOL. 12
Rui Zeng, et. al.Rui Zeng ... Jingsong He
17 May 2022
Applied Sciences | VOL. 12

Fine-grained classification via mixture of deep convolutional neural networks
Zongyuan Ge ... Peter Corke
-
Zongyuan Ge, et. al.Zongyuan Ge ... Peter Corke
01 Mar 2016
01 Mar 2016

An Efficient and Lightweight Convolutional Neural Network for Remote Sensing Image Scene Classification.
Donghang Yu ... Daoji Li
Sensors | VOL. 20
Donghang Yu, et. al.Donghang Yu ... Daoji Li
02 Apr 2020
Sensors | VOL. 20

Cultural Heritage and the Intelligent Internet of Things
Woosik Lee ... Dong-Hoon Lee
Journal on Computing and Cultural Heritage | VOL. 12
Woosik Lee, et. al.Woosik Lee ... Dong-Hoon Lee
13 Jun 2019
Journal on Computing and Cultural Heritage | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Cybernetics