Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition

Zhuojun Ding,Dangyang Chen,Wei Wei,Xiaoye Qu

doi:10.24963/ijcai.2024/691

Abstract

Cross-lingual named entity recognition (NER) aims to train an NER model for the target language leveraging only labeled source language data and unlabeled target language data. Prior approaches either perform label projection on translated source language data or employ a source model to assign pseudo labels for target language data and train a target model on these pseudo-labeled data to generalize to the target language. However, these automatic labeling procedures inevitably introduce noisy labels, thus leading to a performance drop. In this paper, we propose a Global-Local Denoising framework (GLoDe) for cross-lingual NER. Specifically, GLoDe introduces a progressive denoising strategy to rectify incorrect pseudo labels by leveraging both global and local distribution information in the semantic space. The refined pseudo-labeled target language data significantly improves the model's generalization ability. Moreover, previous methods only consider improving the model with language-agnostic features, however, we argue that target language-specific features are also important and should never be ignored. To this end, we employ a simple auxiliary task to achieve this goal. Experimental results on two benchmark datasets with six target languages demonstrate that our proposed GLoDe significantly outperforms current state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on Unlabeled Data in Target Language
Qianhui Wu ... Zijia Lin
-
Qianhui Wu, et. al.Qianhui Wu ... Zijia Lin
01 Jan 2020
01 Jan 2020

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition
Shining Liang ... Xianglin Zuo
-
Shining Liang, et. al.Shining Liang ... Xianglin Zuo
14 Aug 2021
14 Aug 2021

Cross-Lingual Named Entity Recognition for Heterogenous Languages
Yingwen Fu ... Nankai Lin
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31
Yingwen Fu, et. al.Yingwen Fu ... Nankai Lin
01 Jan 2023
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31

UniTrans : Unifying Model Transfer and Data Transfer for Cross-Lingual Named Entity Recognition with Unlabeled Data
Qianhui Wu ... Börje F Karlsson
-
Qianhui Wu, et. al.Qianhui Wu ... Börje F Karlsson
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Pseudo Labels with Global-Local Denoising Framework for Cross-lingual Named Entity Recognition

Abstract

Talk to us

Similar Papers