AUC Maximization for Low-Resource Named Entity Recognition

Ngoc Dang Nguyen,Changyou Chen,Wray Buntine,Wei Tan,Lan Du,Richard Beare

doi:10.1609/aaai.v37i11.26571

Abstract

Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is inherently an imbalanced tagging problem, the model performance under the low-resource settings could suffer using these standard objective functions. Based on recent advances in area under the ROC curve (AUC) maximization, we propose to optimize the NER model by maximizing the AUC score. We give evidence that by simply combining two binary-classifiers that maximize the AUC score, significant performance improvement over traditional loss functions is achieved under low-resource NER settings. We also conduct extensive experiments to demonstrate the advantages of our method under the low-resource and highly-imbalanced data distribution settings. To the best of our knowledge, this is the first work that brings AUC maximization to the NER setting. Furthermore, we show that our method is agnostic to different types of NER embeddings, models and domains. The code of this work is available at https://github.com/dngu0061/NER-AUC-2T.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AUC Maximization for Low-Resource Named Entity Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 4

Similar Papers

Enhancing the Performance of Telugu Named Entity Recognition Using Gazetteer Features
Saikiranmai Gorla ... Aruna Malapati
Information | VOL. 11
Saikiranmai Gorla, et. al.Saikiranmai Gorla ... Aruna Malapati
02 Feb 2020
Information | VOL. 11

On the Construction of Web NER Model Training Tool based on Distant Supervision
Chien-Lung Chou ... Kuo-Chun Chien
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19
Chien-Lung Chou, et. al.Chien-Lung Chou ... Kuo-Chun Chien
15 Nov 2020
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 19

A Disease Identification Algorithm for Medical Crowdfunding Campaigns: Validation Study.
Steven S Doerstling ... Peter A Ubel
Journal of Medical Internet Research | VOL. 24
Steven S Doerstling, et. al.Steven S Doerstling ... Peter A Ubel
21 Jun 2022
Journal of Medical Internet Research | VOL. 24

Evaluating Medical Entity Recognition in Health Care: Entity Model Quantitative Study.
Shengyu Liu ... Xiaolei Xiu
JMIR medical informatics | VOL. 12
Shengyu Liu, et. al.Shengyu Liu ... Xiaolei Xiu
17 Oct 2024
JMIR medical informatics | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AUC Maximization for Low-Resource Named Entity Recognition

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence