Multilayer feature descriptors fusion CNN models for fine‐grained visual recognition

Yong Hou,Jinye Peng,Hangzai Luo,Jun Wang,Wanqing Zhao,Xiang Zhang

doi:10.1002/cav.1897

Abstract

AbstractFine‐grained image classification is a challenging topic in the field of computer vision. General models based on first‐order local features cannot achieve acceptable performance because the features are not so efficient in capturing fine‐grained difference. A bilinear convolutional neural network (CNN) model exhibits that a second‐order statistical feature is more efficient in capturing fine‐grained difference than a first‐order local feature. However, this framework only considers the extraction of a second‐order feature descriptor, using a single convolutional layer. The potential effective classification features of other convolutional layers are ignored, resulting in loss of recognition accuracy. In this paper, a multilayer feature descriptors fusion CNN model is proposed. It fully considers the second‐order feature descriptors and the first‐order local feature descriptor generated by different layers. Experimental verification was carried out on fine‐grained classification benchmark data sets, CUB‐200‐2011, Stanford Cars, and FGVC‐aircraft. Compared with the bilinear CNN model, the proposed method has improved accuracy by 0.8%, 1.1%, and 5.5%. Compared with the compact bilinear pooling model, there is an accuracy increase of 0.64%, 1.63%, and 1.45%, respectively. In addition, the proposed model effectively uses multiple 1×1 convolution kernels to reduce dimension. The experimental results show that the multilayer low‐dimensional second‐order feature descriptors fusion model has comparable recognition accuracy of the original model.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multilayer feature descriptors fusion CNN models for fine‐grained visual recognition

Abstract

Talk to us

Similar Papers

More From: Computer Animation and Virtual Worlds

Lead the way for us

Journal: Computer Animation and Virtual Worlds	Publication Date: May 1, 2019
Citations: 4

Similar Papers

Bi-stream CNN Down Syndrome screening model based on genotyping array
Bing Feng ... William Hoskins
BMC Medical Genomics | VOL. 11
Bing Feng, et. al.Bing Feng ... William Hoskins
01 Nov 2018
BMC Medical Genomics | VOL. 11

Artificial intelligence: finding the intersection of predictive modeling and clinical utility
Karthik Ravi
Gastrointestinal Endoscopy | VOL. 93
Karthik RaviKarthik Ravi
07 Mar 2021
Gastrointestinal Endoscopy | VOL. 93

Prediction of Diabetic Retinopathy using Deep Learning with Preprocessing
S Balaji ... D Gokulakrishnan
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10
S Balaji, et. al.S Balaji ... D Gokulakrishnan
22 Feb 2024
EAI Endorsed Transactions on Pervasive Health and Technology | VOL. 10

Facial Expression Analysis Based on Fusion Multi-Layer Convolutional Layer Feature Neural Network
Hao Meng ...
-
Hao Meng, et. al.Hao Meng ...
09 Nov 2020
09 Nov 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multilayer feature descriptors fusion CNN models for fine‐grained visual recognition

Abstract

Talk to us

Similar Papers

More From: Computer Animation and Virtual Worlds