Fine-grained Recognition with Learnable Semantic Data Augmentation.

Yifan Pu,Yizeng Han,Yulin Wang,Junlan Feng,Chao Deng,Gao Huang

doi:10.1109/tip.2024.3364500

Abstract

Fine-grained image recognition is a longstanding computer vision challenge that focuses on differentiating objects belonging to multiple subordinate categories within the same meta-category. Since images belonging to the same meta-category usually share similar visual appearances, mining discriminative visual cues is the key to distinguishing fine-grained categories. Although commonly used image-level data augmentation techniques have achieved great success in generic image classification problems, they are rarely applied in fine-grained scenarios, because their random editing-region behavior is prone to destroy the discriminative visual cues residing in the subtle regions. In this paper, we propose diversifying the training data at the feature-level to alleviate the discriminative region loss problem. Specifically, we produce diversified augmented samples by translating image features along semantically meaningful directions. The semantic directions are estimated with a covariance prediction network, which predicts a sample-wise covariance matrix to adapt to the large intra-class variation inherent in fine-grained images. Furthermore, the covariance prediction network is jointly optimized with the classification network in a meta-learning manner to alleviate the degenerate solution problem. Experiments on four competitive fine-grained recognition benchmarks (CUB-200-2011, Stanford Cars, FGVC Aircrafts, NABirds) demonstrate that our method significantly improves the generalization performance on several popular classification networks (e.g., ResNets, DenseNets, EfficientNets, RegNets and ViT). Combined with a recently proposed method, our semantic data augmentation approach achieves state-of-the-art performance on the CUB-200-2011 dataset. The source code will be released.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fine-grained Recognition with Learnable Semantic Data Augmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing

Lead the way for us

Similar Papers

YNBIRDS: A System for Fine-Grained Bird Image Recognition
Yili Zhao ... Hua Zhou
-
Yili Zhao, et. al.Yili Zhao ... Hua Zhou
01 Jan 2019
01 Jan 2019

Image Recognition of Pledges of Capital Stock in Small- and Medium-Sized Enterprises Based on Partial Differential Equations
Dehui Zhou
Advances in Mathematical Physics | VOL. 2021
Dehui ZhouDehui Zhou
01 Nov 2021
Advances in Mathematical Physics | VOL. 2021

Cross-Category Cross-Semantic Regularization for Fine-Grained Image Recognition
Yelin Chen ... Wei Luo
-
Yelin Chen, et. al.Yelin Chen ... Wei Luo
01 Jan 2019
01 Jan 2019

Local Patch AutoAugment With Multi-Agent Collaboration
Shiqi Lin ... Ruoyu Feng
IEEE Transactions on Multimedia | VOL. 26
Shiqi Lin, et. al.Shiqi Lin ... Ruoyu Feng
01 Jan 2024
IEEE Transactions on Multimedia | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fine-grained Recognition with Learnable Semantic Data Augmentation.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Image Processing