Generalization Capability Research Articles

Named entity recognition (NER) is a fundamental subtask for information extraction that aims to locate and classify named entities in unstructured text into predefined categories. Recently, large-scale language models (LLMs) have achieved SOTA performance on a variety of natural language processing tasks. However, because NER is a sequence labeling task in nature while LLMs is a text-generation model, the performance of LLMs on NER is still significantly below supervised baselines, and NER remains a difficult task. Meanwhile, the word boundary and semantic information of Chinese words are usually quite vague, as words contained in Chinese texts are not separated by spaces. Thus, the NER task still requires supervised learning paradigm and heavily relies on large amounts of labeled data, such as entity type and boundary information. However, the cost of labeling data can be prohibitively large, and the purely supervised approaches usually suffer from poor generalization capability. In this article, we propose a multitask learning-based bidirectional iterated dilated convolution model, BCNN-CWS, for low-resource NER via leveraging word boundary information of Chinese word segmentation (CWS) task. Specifically, to efficiently recognize named entities, an iterated dilated convolutional model with a limited number of layers is implemented. In addition, a bidirectional causal convolution mechanism is presented for contextual information extraction. Results of extensive experiments on public Chinese datasets demonstrate that BCNN-CWS achieves superior performance over state-of-the-art models, and it yields up to about 50% speed improvement over existing methods. It is worth noting that BCNN-CWS can be further improved by combining with a pretrained model. Received: 25 Spetember 2024 | Revised: 4 November 2024 | Accepted: 28 November 2024 Conflicts of Interest The authors declare that they have no conflicts of interest to this work. Data Availability Statement The data that support the findings of this study are openly available in GitLab at https://github.com/jiangfeng13/BCNN-CWS Author Contribution Statement Tao Wu: Conceptualization, Methodology, Writing – original draft, Writing – review & editing, Visualization, Supervision. Xinwen Cao: Resources, Data curation. Feng Jiang: Software, Validation, Formal analysis, Investigation, Writing – original draft. Canyixing Cui: Data curation, Writing -review & editing. Xuehao Li: Resources. Xingping Xian: Supervision, Project administration, Funding acquisition.

Read full abstract

Background: Given the severe economic burden that citrus greening disease imposes on fruit farmers and related industries, rapid and accurate disease detection is particularly crucial. This not only effectively curbs the spread of the disease, but also significantly reduces reliance on manual detection within extensive citrus planting areas. Objective: In response to this challenge, and to address the issues posed by resource-constrained platforms and complex backgrounds, this paper designs and proposes a novel method for the recognition and localization of citrus greening disease, named the HHS-RT-DETR model. The goal of this model is to achieve precise detection and localization of the disease while maintaining efficiency. Methods: Based on the RT-DETR-r18 model, the following improvements are made: the HS-FPN (high-level screening-feature pyramid network) is used to improve the feature fusion and feature selection part of the RT-DETR model, and the filtered feature information is merged with the high-level features by filtering out the low-level features, so as to enhance the feature selection ability and multi-level feature fusion ability of the model. In the feature fusion and feature selection sections, the HWD (hybrid wavelet-directional filter banks) downsampling operator is introduced to prevent the loss of effective information in the channel and reduce the computational complexity of the model. Through using the ShapeIoU loss function to enable the model to focus on the shape and scale of the bounding box itself, the prediction of the bounding box of the model will be more accurate. Conclusions and Results: This study has successfully developed an improved HHS-RT-DETR model which exhibits efficiency and accuracy on resource-constrained platforms and offers significant advantages for the automatic detection of citrus greening disease. Experimental results show that the improved model, when compared to the RT-DETR-r18 baseline model, has achieved significant improvements in several key performance metrics: the precision increased by 7.9%, the frame rate increased by 4 frames per second (f/s), the recall rose by 9.9%, and the average accuracy also increased by 7.5%, while the number of model parameters reduced by 0.137×107. Moreover, the improved model has demonstrated outstanding robustness in detecting occluded leaves within complex backgrounds. This provides strong technical support for the early detection and timely control of citrus greening disease. Additionally, the improved model has showcased advanced detection capabilities on the PASCAL VOC dataset. Discussions: Future research plans include expanding the dataset to encompass a broader range of citrus species and different stages of citrus greening disease. In addition, the plans involve incorporating leaf images under various lighting conditions and different weather scenarios to enhance the model’s generalization capabilities, ensuring the accurate localization and identification of citrus greening disease in diverse complex environments. Lastly, the integration of the improved model into an unmanned aerial vehicle (UAV) system is envisioned to enable the real-time, regional-level precise localization of citrus greening disease.

Read full abstract

Generalization Capability Research Articles

Related Topics

Articles published on Generalization Capability

Data Augmentation for Voiceprint Recognition Using Generative Adversarial Networks

Transformer model-based multi-scale fine-grained identification and classification of regional traffic states

Learning to Train and to Explain a Deep Survival Model with Large-Scale Ovarian Cancer Transcriptomic Data

Presentation Attack Detection using iris periocular visual spectrum images

SSMBERT: A Space Science Mission Requirement Classification Method Based on BERT

Damage Localization and Severity Assessment in Composite Structures Using Deep Learning Based on Lamb Waves

Low-Resource Chinese Named Entity Recognition via CNN-based Multitask Learning

Intelligent aerodynamic modelling method for steady/unsteady flow fields of airfoils driven by flow field images based on modified U-Net neural network

Optimization of Thermal and Pressure Drop Performance in Circular Pin Fin Heat Sinks Using the TOPSIS Method

ENHANCING WELDING QUALITY THROUGH PREDICTIVE MODELLING — INSIGHTS FROM MACHINE LEARNING TECHNIQUES

Prediction of the Dissolved Oxygen Content in Aquaculture Based on the CNN-GRU Hybrid Neural Network

A text classification method combining in-domain pre-training and prompt learning for the steel e-commerce industry

A Machine Learning-Based Approach for Predicting Aerodynamic Coefficients Using Deep Neural Networks and CFD Data

Real-time pavement distress detection based on deep learning and visual sensors

Refinement of a kinetic adsorption model through Artificial Intelligence

DRGAT: Predicting Drug Responses Via Diffusion-Based Graph Attention Network.

DINOV2-FCS: a model for fruit leaf disease classification and severity prediction

HHS-RT-DETR: A Method for the Detection of Citrus Greening Disease

Cable fault diagnosis with generalization capability using incremental learning and deep convolutional neural network

Modelling icing growth on overhead transmission lines: Current advances and future directions

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Generalization Capability Research Articles

Related Topics

Articles published on Generalization Capability

Data Augmentation for Voiceprint Recognition Using Generative Adversarial Networks

Transformer model-based multi-scale fine-grained identification and classification of regional traffic states

Learning to Train and to Explain a Deep Survival Model with Large-Scale Ovarian Cancer Transcriptomic Data

Presentation Attack Detection using iris periocular visual spectrum images

SSMBERT: A Space Science Mission Requirement Classification Method Based on BERT

Damage Localization and Severity Assessment in Composite Structures Using Deep Learning Based on Lamb Waves

Low-Resource Chinese Named Entity Recognition via CNN-based Multitask Learning

Intelligent aerodynamic modelling method for steady/unsteady flow fields of airfoils driven by flow field images based on modified U-Net neural network

Optimization of Thermal and Pressure Drop Performance in Circular Pin Fin Heat Sinks Using the TOPSIS Method

ENHANCING WELDING QUALITY THROUGH PREDICTIVE MODELLING — INSIGHTS FROM MACHINE LEARNING TECHNIQUES

Prediction of the Dissolved Oxygen Content in Aquaculture Based on the CNN-GRU Hybrid Neural Network

A text classification method combining in-domain pre-training and prompt learning for the steel e-commerce industry

A Machine Learning-Based Approach for Predicting Aerodynamic Coefficients Using Deep Neural Networks and CFD Data

Real-time pavement distress detection based on deep learning and visual sensors

Refinement of a kinetic adsorption model through Artificial Intelligence

DRGAT: Predicting Drug Responses Via Diffusion-Based Graph Attention Network.

DINOV2-FCS: a model for fruit leaf disease classification and severity prediction

HHS-RT-DETR: A Method for the Detection of Citrus Greening Disease

Cable fault diagnosis with generalization capability using incremental learning and deep convolutional neural network

Modelling icing growth on overhead transmission lines: Current advances and future directions