Toward Zero-Shot and Zero-Resource Multilingual Question Answering

Chia-Chih Kuo,Kuan-Yu Chen

doi:10.1109/access.2022.3207569

Chia-Chih Kuo, Kuan-Yu Chen

Open Access

https://doi.org/10.1109/access.2022.3207569

Copy DOI

Abstract

In recent years, multilingual question answering has been an emergent research topic and has attracted much attention. Although systems for English and other rich-resource languages that rely on various advanced deep learning-based techniques have been highly developed, most of them in low-resource languages are impractical due to data insufficiency. Accordingly, many studies have attempted to improve the performance of low-resource languages in a zero-shot or few-shot manner based on multilingual bidirectional encoder representations from transformers (mBERT) by transferring knowledge learned from rich-resource languages to low-resource languages. Most methods require either a large amount of unlabeled data or a small set of labeled data for low-resource languages. In Wikipedia, 169 languages have less than 10,000 articles, and 48 languages have less than 1,000 articles. This reason motivates us to conduct a zero-shot multilingual question answering task under a zero-resource scenario. Thus, this study proposes a framework to fine-tune the original mBERT using data from rich-resource languages, and the resulting model can be used for low-resource languages in a zero-shot and zero-resource manner. Compared to several baseline systems, which require millions of unlabeled data for low-resource languages, the performance of our proposed framework is not only highly comparative but is also better for languages used in training.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Toward Zero-Shot and Zero-Resource Multilingual Question Answering

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

When Pairs Meet Triplets: Improving Low-Resource Captioning via Multi-Objective Optimization
Yike Wu ... Ying Zhang
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 18
Yike Wu, et. al.Yike Wu ... Ying Zhang
04 Mar 2022
ACM Transactions on Multimedia Computing, Communications, and Applications | VOL. 18

Identification of Seven Low-Resource North-Eastern Languages: An Experimental Study
Joyanta Basu ... Swanirbhar Majumder
-
Joyanta Basu, et. al.Joyanta Basu ... Swanirbhar Majumder
01 Jan 2020
01 Jan 2020

LRSpeech
Jin Xu ... Jian Li
-
Jin Xu, et. al.Jin Xu ... Jian Li
20 Aug 2020
20 Aug 2020

Self-Supervised Contrastive Learning on Cross-Augmented Samples for SAR Target Recognition
Xiaoyu Liu ... Jifang Pei
-
Xiaoyu Liu, et. al.Xiaoyu Liu ... Jifang Pei
01 May 2023
01 May 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward Zero-Shot and Zero-Resource Multilingual Question Answering

Abstract

Talk to us

Similar Papers

More From: IEEE Access