Comparative Study: Evaluating the effects of class balancing on transformer performance in the PlantNet-300k image dataset

José Chavarría Madriz,Maria Mora-Cross,William Ulate

doi:10.3897/biss.7.113057

Abstract

Image-based identification of plant specimens plays a crucial role in various fields such as agriculture, ecology, and biodiversity conservation. The growing interest in deep learning has led to remarkable advancements in image classification techniques, particularly with the utilization of convolutional neural networks (CNNs). Since 2015, in the context of the PlantCLEF (Conference and Labs of the Evaluation Forum) challenge (Joly et al. 2015), deep learning models, specifically CNNs, have consistently achieved the most impressive results in this field (Carranza-Rojas 2018). However, recent developments have introduced transformer-based models, such as ViT (Vision Transformer) (Dosovitskiy et al. 2020) and CvT (Convolutional vision Transformer) (Wu et al. 2021), as a promising alternative for image classification tasks. Transformers offer unique advantages such as capturing global context and handling long-range dependencies (Vaswani et al. 2017), which make them suitable for complex recognition tasks like plant identification. In this study, we focus on the image classification task using the PlantNet-300k dataset (Garcin et al. 2021a). The dataset consists of a large collection of 306,146 plant images representing 1,081 distinct species. These images were selected from the Pl@ntNet citizen observatory database. The dataset has two prominent characteristics that pose challenges for classification. First, there is a significant class imbalance, meaning that a small subset of species dominates the majority of the images. This imbalance creates bias and affects the accuracy of classification models. Second, many species exhibit visual similarities, making it tough, even for experts, to accurately identify them. These characteristics are referred to by the dataset authors as long-tailed distribution and high intrinsic ambiguity, respectively (Garcin et al. 2021b). In order to address the inherent challenges of the PlantNet-300k dataset, we employed a two-fold approach. Firstly, we leveraged transformer-based models to tackle the dataset's intrinsic ambiguity and effectively capture the complex visual patterns present in plant images. Secondly, we focused on mitigating the class imbalance issue through various data preprocessing techniques, specifically class balancing methods. By implementing these techniques, we aimed to ensure fair representation of all plant species in order to improve the overall performance of image classification models. Our objective is to assess the effects of data preprocessing techniques, specifically class balancing, on the classification performance of the PlantNet-300k dataset. By exploring different preprocessing methods, we addressed the class imbalance issue and through precise evaluation, conducted a comparison of the performance of transformer-based models with and without class balancing techniques. Through these efforts, our ultimate goal is to assert if these techniques allow us to achieve more accurate and reliable classification results, particularly for underrepresented species in the dataset. In our experiment, we compared the performance of two transformer-based models, ViT and CvT, using two versions of the PlantNet-300k dataset: one with class balancing and the other without class balancing. This setup results in a total of four sets of metrics for evaluation. To assess the classification performance, we utilized a wide range of commonly used metrics including recall, precision, accuracy, AUC (Area Under the Curve), ROC (Receiver Operating Characteristic), and others. These metrics provide insights into each models' ability to correctly classify plant species, identify false positives and negatives, measure overall accuracy, and assess the models' discriminatory power. By conducting this comparative study, we seek to contribute to the advancement of plant identification research by providing empirical evidence of the benefits and effectiveness of class balancing techniques in improving the performance of transformer-based models on the PlantNet-300k dataset and any other similar ones.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative Study: Evaluating the effects of class balancing on transformer performance in the PlantNet-300k image dataset

Abstract

Talk to us

Similar Papers

More From: Biodiversity Information Science and Standards

Lead the way for us

Journal: Biodiversity Information Science and Standards	Publication Date: Sep 21, 2023
License type: CC BY 4.0

Similar Papers

Transformer-Based Multilabel Deep Learning Model Is Efficient for Detecting Ankle Lateral and Medial Ligament Injuries on Magnetic Resonance Imaging and Improving Clinicians’ Diagnostic Accuracy for Rotational Chronic Ankle Instability
Rui Yin ... Jianchao Gui
Arthroscopy: The Journal of Arthroscopic and Related Surgery | VOL. -
Rui Yin, et. al.Rui Yin ... Jianchao Gui
01 Jun 2024
Arthroscopy: The Journal of Arthroscopic and Related Surgery | VOL. -

The effect of data diversity on the performance of deep learning models for predicting early gastric cancer under endoscopy
Conghui Shi ... Lianlian Wu
Journal of Digital Health | VOL. -
Conghui Shi, et. al.Conghui Shi ... Lianlian Wu
21 Feb 2022
Journal of Digital Health | VOL. -

Classification Analysis of Back propagation-Optimized CNN Performance in Image Processing
Putrama Alkhairi ... Agus Perdana Windarto
Journal of Systems Engineering and Information Technology (JOSEIT) | VOL. 2
Putrama Alkhairi, et. al. Putrama Alkhairi ... Agus Perdana Windarto
31 Mar 2023
Journal of Systems Engineering and Information Technology (JOSEIT) | VOL. 2

Deep Learning Models for Segmenting Non-perfusion Area of Color Fundus Photographs in Patients With Branch Retinal Vein Occlusion.
Jinxin Miao ... Yuan Fang
Frontiers in Medicine | VOL. 9
Jinxin Miao, et. al.Jinxin Miao ... Yuan Fang
30 Jun 2022
Frontiers in Medicine | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Study: Evaluating the effects of class balancing on transformer performance in the PlantNet-300k image dataset

Abstract

Talk to us

Similar Papers

More From: Biodiversity Information Science and Standards