Multi-level multilingual semantic alignment for zero-shot cross-lingual transfer learning

Anchun Gui,Han Xiao

doi:10.1016/j.neunet.2024.106217

Abstract

Recently, cross-lingual transfer learning has attracted extensive attention from both academia and industry. Previous studies usually focus only on the single-level alignment (e.g., word-level, sentence-level), based on pre-trained language models. However, it leads to suboptimal performance in downstream tasks of the low-resource language due to the missing correlation of hierarchical semantic information (e.g., sentence-to-word, word-to-word). Therefore, in this paper, we propose a novel multi-level alignment framework, which hierarchically learns the semantic correlation between multiple levels by leveraging well-designed alignment training tasks. In addition, we devise an attention-based fusion mechanism (AFM) to infuse semantic information from high levels. Extensive experiments on mainstream cross-lingual tasks (e.g., text classification, paraphrase identification, and named entity recognition) demonstrate the effectiveness of our proposed method, and also show that our model achieves state-of-the-art performance across various benchmarks compared to other strong baselines.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-level multilingual semantic alignment for zero-shot cross-lingual transfer learning

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Similar Papers

Joint Extraction of Clinical Entities and Relations Using Multi-head Selection Method
Xintao Fang ... Yuting Song
-
Xintao Fang, et. al.Xintao Fang ... Yuting Song
11 Dec 2021
11 Dec 2021

Enhancing Cross-Lingual Sarcasm Detection by a Prompt Learning Framework with Data Augmentation and Contrastive Learning
Tianbo An ... Pingping Yan
Electronics | VOL. 13
Tianbo An, et. al.Tianbo An ... Pingping Yan
01 Jun 2024
Electronics | VOL. 13

Transformer-based Named Entity Recognition for Clinical Cancer Drug Toxicity by Positive-unlabeled Learning and KL Regularizers
Weixin Xie ... Chengkui Zhao
Current Bioinformatics | VOL. 19
Weixin Xie, et. al.Weixin Xie ... Chengkui Zhao
01 Sep 2024
Current Bioinformatics | VOL. 19

On the Power of Pre-Trained Text Representations
Yu Meng ... Jiawei Han
-
Yu Meng, et. al.Yu Meng ... Jiawei Han
14 Aug 2021
14 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-level multilingual semantic alignment for zero-shot cross-lingual transfer learning

Abstract

Talk to us

Similar Papers

More From: Neural Networks