Relieving the Incompatibility of Network Representation and Classification for Long-Tailed Data Distribution.

Hao Hu,Mingsheng Wu,Mengya Gao

doi:10.1155/2021/6702625

Abstract

In the real-world scenario, data often have a long-tailed distribution and training deep neural networks on such an imbalanced dataset has become a great challenge. The main problem caused by a long-tailed data distribution is that common classes will dominate the training results and achieve a very low accuracy on the rare classes. Recent work focuses on improving the network representation ability to overcome the long-tailed problem, while it always ignores adapting the network classifier to a long-tailed case, which will cause the “incompatibility” problem of network representation and network classifier. In this paper, we use knowledge distillation to solve the long-tailed data distribution problem and fully optimize the network representation and classifier simultaneously. We propose multiexperts knowledge distillation with class-balanced sampling to jointly learn high-quality network representation and classifier. Also, a channel activation-based knowledge distillation method is also proposed to improve the performance further. State-of-the-art performance on several large-scale long-tailed classification datasets shows the superior generalization of our method.

Highlights

Used datasets in the literature for CNN’s training, like CIFAR [1] and ImageNet [2], are usually artificially designed and rarely suffer from the data imbalance
In the open real world, the distribution of data categories is often long-tailed, in which the number of training samples per class varies significantly from thousands of images to few samples. In the scenarios such as railway traffic, mesothelioma diagnosis, and industrial fault detection [3, 4], we need to detect an unexpected object where the real samples for the category of unexpected object are usually hard to collect, which leads to a long-tailed data distribution. ere are many works [5, 6] proposed to solve such real-world classification problems
Authors in [7, 8] pointed out the problem that the data distribution will hardly influence the performance of deep neural network

Summary

Introduction

Used datasets in the literature for CNN’s training, like CIFAR [1] and ImageNet [2], are usually artificially designed and rarely suffer from the data imbalance. Ere are many works [5, 6] proposed to solve such real-world classification problems. In the scenarios such as railway traffic, mesothelioma diagnosis, and industrial fault detection [3, 4], we need to detect an unexpected object where the real samples for the category of unexpected object are usually hard to collect, which leads to a long-tailed data distribution. They do not provide a general solution to such a long-tailed distribution problem. When deep models are trained in such imbalanced scenarios, standard approaches usually fail to achieve satisfactory results, leading to a significant drop in performance

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computational Intelligence and Neuroscience	Publication Date: Jan 1, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Relieving the Incompatibility of Network Representation and Classification for Long-Tailed Data Distribution.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience

Lead the way for us

Similar Papers

Personalized Federated Learning on long-tailed data via knowledge distillation and generated features
Fengling Lv ... Hanzi Wang
Pattern Recognition Letters | VOL. -
Fengling Lv, et. al.Fengling Lv ... Hanzi Wang
01 Oct 2024
Pattern Recognition Letters | VOL. -

Class-Guided Triple Head Prediction Network for Long-Tail Object Detection
Xuyang Liu ... Yuan Zheng
-
Xuyang Liu, et. al.Xuyang Liu ... Yuan Zheng
04 Jun 2023
04 Jun 2023

BWLM: A Balanced Weight Learning Mechanism for Long-Tailed Image Recognition
Baoyu Fan ... Han Ma
Applied Sciences | VOL. 14
Baoyu Fan, et. al.Baoyu Fan ... Han Ma
04 Jan 2024
Applied Sciences | VOL. 14

Long-Tailed Food Classification.
Jiangpeng He ... Heather Eicher-Miller
Nutrients | VOL. 15
Jiangpeng He, et. al.Jiangpeng He ... Heather Eicher-Miller
15 Jun 2023
Nutrients | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Relieving the Incompatibility of Network Representation and Classification for Long-Tailed Data Distribution.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computational Intelligence and Neuroscience