Object–Part Registration–Fusion Net for Fine-Grained Image Classification

Chih-Wei Lin,Jinfu Liu,Mengxiang Lin

doi:10.3390/sym13101838

Chih-Wei Lin, Jinfu Liu + Show 1 more

Open Access

https://doi.org/10.3390/sym13101838

Copy DOI

Abstract

Classifying fine-grained categories (e.g., bird species, car, and aircraft types) is a crucial problem in image understanding and is difficult due to intra-class and inter-class variance. Most of the existing fine-grained approaches individually utilize various parts and local information of objects to improve the classification accuracy but neglect the mechanism of the feature fusion between the object (global) and object’s parts (local) to reinforce fine-grained features. In this paper, we present a novel framework, namely object–part registration–fusion Net (OR-Net), which considers the mechanism of registration and fusion between an object (global) and its parts’ (local) features for fine-grained classification. Our model learns the fine-grained features from the object of global and local regions and fuses these features with the registration mechanism to reinforce each region’s characteristics in the feature maps. Precisely, OR-Net consists of: (1) a multi-stream feature extraction net, which generates features with global and various local regions of objects; (2) a registration–fusion feature module calculates the dimension and location relationships between global (object) regions and local (parts) regions to generate the registration information and fuses the local features into the global features with registration information to generate the fine-grained feature. Experiments execute symmetric GPU devices with symmetric mini-batch to verify that OR-Net surpasses the state-of-the-art approaches on CUB-200-2011 (Birds), Stanford-Cars, and Stanford-Aircraft datasets.

Highlights

Fine-grained classification is the branch of image classification that focuses on distinguishing objects in subordinate classes with subtle differences from the base classes
This study proposed a novel convolutional neural network, object–part registration
The whole-body stream and parts stream indicate the unique parts of the object, and their inputs are grabbed from the original image to provide more details when extracting features

Summary

Introduction

Fine-grained classification is the branch of image classification that focuses on distinguishing objects in subordinate classes with subtle differences from the base classes. Some scholars collect the optical image with a surveillance camera to recognize the rainfall intensity [9,10], and parts use the satellite image to classify and predict [11,12] According to their CNN structures, we classify these studies into three categories: the multi-stream and attention-location/part-location approaches. Studies take the handmade part annotations to provide the parts information in the fine-grained image classification and utilize the multi-stream network to extract the feature of each part (local features) from various streams. The previous works design various convolutional neural networks associated with different factor variations, such as multi-stream framework and part information to generate the discriminative feature descriptors for the fine-grained image classification. (a) Original image (b) w/o registration–fusion features (c) w/ registration–fusion features

Methodology

Feature Registration–Fusion Module

Network Architecture

Procedure of the Proposed OR-Net

Experiment

Experimental Datasets and Implementation Details

Diagnostic Experiments

Ablation Experiments

Experimential Analysis on the Popular Datasets

Quantitative Evaluation

Method

Qualitative Evaluation

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Oct 1, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Object–Part Registration–Fusion Net for Fine-Grained Image Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Learning comprehensive global features in person re-identification: Ensuring discriminativeness of more local regions
Jiali Xi ... Qianru Sun
Pattern Recognition | VOL. 134
Jiali Xi, et. al.Jiali Xi ... Qianru Sun
24 Sep 2022
Pattern Recognition | VOL. 134

Multi-Relations Aware Network for In-the-Wild Facial Expression Recognition
Dongliang Chen ... Rui Chen
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33
Dongliang Chen, et. al.Dongliang Chen ... Rui Chen
01 Aug 2023
IEEE Transactions on Circuits and Systems for Video Technology | VOL. 33

Partial hybrid finite elements for composite laminates
W Feng ... S.V Hoa
Finite Elements in Analysis & Design | VOL. 30
W Feng, et. al.W Feng ... S.V Hoa
01 Oct 1998
Finite Elements in Analysis & Design | VOL. 30

Joint Coding of Local and Global Deep Features in Videos for Visual Search.
Lin Ding ... Yonghong Tian
IEEE Transactions on Image Processing | VOL. 29
Lin Ding, et. al.Lin Ding ... Yonghong Tian
01 Jan 2020
IEEE Transactions on Image Processing | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Object–Part Registration–Fusion Net for Fine-Grained Image Classification

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry