Area Under The ROC Curve Maximization Research Articles

Current work in named entity recognition (NER) uses either cross entropy (CE) or conditional random fields (CRF) as the objective/loss functions to optimize the underlying NER model. Both of these traditional objective functions for the NER problem generally produce adequate performance when the data distribution is balanced and there are sufficient annotated training examples. But since NER is inherently an imbalanced tagging problem, the model performance under the low-resource settings could suffer using these standard objective functions. Based on recent advances in area under the ROC curve (AUC) maximization, we propose to optimize the NER model by maximizing the AUC score. We give evidence that by simply combining two binary-classifiers that maximize the AUC score, significant performance improvement over traditional loss functions is achieved under low-resource NER settings. We also conduct extensive experiments to demonstrate the advantages of our method under the low-resource and highly-imbalanced data distribution settings. To the best of our knowledge, this is the first work that brings AUC maximization to the NER setting. Furthermore, we show that our method is agnostic to different types of NER embeddings, models and domains. The code of this work is available at https://github.com/dngu0061/NER-AUC-2T.

Read full abstract

Extreme learning machines (ELMs) has been theoretically and experimentally proved to achieve promising performance at a fast learning speed for supervised classification tasks. However, it does not perform well on imbalanced binary classification tasks and tends to get biased toward the majority class. Besides, since a large amount of training data with labels are not always available in the real world, there is an urgent demand to develop an efficient semi-supervised version of ELM for imbalanced binary classification tasks. In this article, owing to the distinct insensitivity of area under the ROC curve (AUC) to both class skews and changes of class distributions, we focus the study on integrating AUC maximization into the ELM framework to tackle with imbalanced binary classification tasks well. By demystifying the AUC metric with the ELM framework, we develop a new AUC-based ELM called AUC-ELM for imbalanced binary classification, which essentially is revealed to be equivalent to an ELM on another transformed data space. Accordingly, its semi-supervised version called SAUC-ELM is also developed. Both AUC-ELM and SAUC-ELM have the distinctive merits: 1) they share the advantage of ELM in both generalization capability and training efficiency, and further uniquely tailored for imbalanced binary classification tasks and 2) in contrast to the existing imbalanced variants of ELM, such as class-specific cost regulation ELM and semi-supervised ELM, they have fewer parameters to tune, thereby reducing the computational cost for model selection. Experiments on a heap of datasets show that both AUC-ELM and SAUC-ELM outperform the other comparative methods in terms of both classification performance and training speed.

Read full abstract

Area Under The ROC Curve Maximization Research Articles

Articles published on Area Under The ROC Curve Maximization

AUC Maximization for Low-Resource Named Entity Recognition

Differentially private empirical risk minimization for AUC maximization

Anomaly detection with inexact labels

AUC-Based Extreme Learning Machines for Supervised and Semi-Supervised Imbalanced Classification

Stochastic AUC Optimization Algorithms With Linear Convergence

An Adaptive Moment estimation method for online AUC maximization.

Peer-To-Peer Lending: Classification in the Loan Application Process

Partial AUC maximization in a linear combination of dichotomizers

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Area Under The ROC Curve Maximization Research Articles

Articles published on Area Under The ROC Curve Maximization

AUC Maximization for Low-Resource Named Entity Recognition

Differentially private empirical risk minimization for AUC maximization

Anomaly detection with inexact labels

AUC-Based Extreme Learning Machines for Supervised and Semi-Supervised Imbalanced Classification

Stochastic AUC Optimization Algorithms With Linear Convergence

An Adaptive Moment estimation method for online AUC maximization.

Peer-To-Peer Lending: Classification in the Loan Application Process

Partial AUC maximization in a linear combination of dichotomizers