Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation

Xiwei Yang,Qi Ban,Limin Liu,Jing Yun,Bofei Zheng

doi:10.3390/electronics12245001

Abstract

Cross-lingual text summarization is a highly desired service for overseas report editing tasks and is formulated in a distributed application to facilitate the cooperation of editors. The multilanguage pre-trained language model (MPLM) can generate high-quality cross-lingual text summaries with simple fine-tuning. However, the MPLM does not adapt to complex variations, like the word order and tense in different languages. When the model performs on these languages with separate syntactic structures and vocabulary morphologies, it will lead to the low-level quality of the cross-lingual summary. The matter worsens when the cross-lingual summarization datasets are low-resource. We use a knowledge distillation framework for the cross-lingual summarization task to address the above issues. By learning the monolingual teacher model, the cross-lingual student model can effectively capture the differences between languages. Since the teacher and student models generate summaries in two languages, their representations lie on different vector spaces. In order to construct representation relationships across languages, we further propose a similarity metric, which is based on bidirectional semantic alignment, to map different language representations to the same space. In order to improve the quality of cross-lingual summaries further, we use contrastive learning to make the student model focus on the differentials among languages. Contrastive learning can enhance the ability of the similarity metric for bidirectional semantic alignment. Our experiments show that our approach is competitive in low-resource scenarios on cross-language summarization datasets in pairs of distant languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Journal: Electronics	Publication Date: Dec 14, 2023
License type: CC BY 4.0

Similar Papers

Improving Neural Cross-Lingual Abstractive Summarization via Employing Optimal Transport Distance for Knowledge Distillation
Thong Thanh Nguyen ... Anh Tuan Luu
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Thong Thanh Nguyen, et. al.Thong Thanh Nguyen ... Anh Tuan Luu
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

Augmenting Low-Resource Cross-Lingual Summarization with Progression-Grounded Training and Prompting
Jiushun Ma ... Xiang Huang
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Jiushun Ma, et. al.Jiushun Ma ... Xiang Huang
26 Jun 2024
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

Unifying Cross-lingual Summarization and Machine Translation with Compression Rate
Yu Bai ... Kai Fan
-
Yu Bai, et. al.Yu Bai ... Kai Fan
06 Jul 2022
06 Jul 2022

A Novel Wikipedia based Dataset for Monolingual and Cross-Lingual Summarization
Mehwish Fatima ... Michael Strube
-
Mehwish Fatima, et. al.Mehwish Fatima ... Michael Strube
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Oversea Cross-Lingual Summarization Service in Multilanguage Pre-Trained Model through Knowledge Distillation

Abstract

Talk to us

Similar Papers

More From: Electronics