LG-CNN: From local parts to global discrimination for fine-grained recognition

Guo-Sen Xie,Xu-Yao Zhang,Wenhan Yang,Mingliang Xu,Shuicheng Yan,Cheng-Lin Liu

doi:10.1016/j.patcog.2017.06.002

Abstract

Fine-grained recognition is one of the most difficult topics in visual recognition, which aims at distinguishing confusing categories such as bird species within a genus. The information of part and bounding boxes in fine-grained images is very important for improving the performance. However, in real applications, the part and/or bounding box annotations may not exist. This makes fine-grained recognition a challenging problem. In this paper, we propose a jointly trained Convolutional Neural Network (CNN) architecture to solve the fine-grained recognition problem without using part and bounding box information. In this framework, we first detect part candidates by calculating the gradients of feature maps of a trained CNN model w.r.t. the input image and then filter out unnecessary ones by fusing two saliency detection methods. Meanwhile, two groups of global object locations are obtained based on the saliency detection methods and a segmentation method. With the filtered part candidates and approximate object locations as inputs, we construct the CNN architecture with local parts and global discrimination (LG-CNN) which consists of two CNN networks with shared weights. The upper stream of LG-CNN is focused on the part information of the input image, the bottom stream of LG-CNN is focused on the global input image. LG-CNN is jointly trained by two stream loss functions to guide the updating of the shared weights. Experiments on three popular fine-grained datasets well validate the effectiveness of our proposed LG-CNN architecture. Applying our LG-CNN architecture to generic object recognition datasets also yields superior performance over the directly fine-tuned CNN architecture with a large margin.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LG-CNN: From local parts to global discrimination for fine-grained recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jun 3, 2017
Citations: 64

Similar Papers

Clinically Relevant Vulnerabilities of Deep Machine Learning Systems for Skin Cancer Diagnosis
Xinyi Du-Harpur ... Magnus D Lynch
Journal of Investigative Dermatology | VOL. 141
Xinyi Du-Harpur, et. al.Xinyi Du-Harpur ... Magnus D Lynch
12 Sep 2020
Journal of Investigative Dermatology | VOL. 141

A novel study for automatic two-class COVID-19 diagnosis (between COVID-19 and Healthy, Pneumonia) on X-ray images using texture analysis and 2-D/3-D convolutional neural networks.
Huseyin Yaşar ... Murat Ceylan
Multimedia systems | VOL. 37
Huseyin Yaşar, et. al.Huseyin Yaşar ... Murat Ceylan
29 Jan 2022
Multimedia systems | VOL. 37

Human Activity Recognition in a Realistic and Multiview Environment Based on Two-Dimensional Convolutional Neural Network
Ashish Khare ... Arati Kushwaha
Journal of Artificial Intelligence and Technology | VOL. -
Ashish Khare, et. al. Ashish Khare ... Arati Kushwaha
09 May 2023
Journal of Artificial Intelligence and Technology | VOL. -

3-D Super-Resolution of Coded Aperture Millimeter-Wave Images Using Complex-Valued Convolutional Neural Network
Rahul Sharma ... Rupesh Kumar
IEEE Sensors Journal | VOL. 22
Rahul Sharma, et. al.Rahul Sharma ... Rupesh Kumar
01 Nov 2022
IEEE Sensors Journal | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LG-CNN: From local parts to global discrimination for fine-grained recognition

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition