A multi-granularity semantic space learning approach for cross-lingual open domain question answering

Lin Li,Dong Li,Miao Kong,Dong Zhou

doi:10.1007/s11280-021-00879-2

Abstract

Cross-lingual Open Domain Question Answering (Cross-lingual Open-QA) has been developed since it was proposed in the mid-1990s. It can be divided into two mainstream tasks according to the training corpus used in the answer extraction stage. One is that both of the training and testing data are in the target language. The other is that the training data is in the source language, and the testing data is in the target language. For a long time, the former has been studied mainly through translation based approaches. Until 2019, the latter appeared and non-translation based approaches become available thanks to multilingual BERT model. Therefore, the two tasks have been discussed separately, which encourages our work on whether it is possible to achieve these two tasks simultaneously without any additional transformation. It is observed that the existence of the multilingual BERT model makes a solution to establish a unified framework. However, there are two problems with using the multilingual BERT model directly. The one is in the document retrieval stage, directly working multilingual pretraining model for similarity calculation will result in insufficient retrieval accuracy. The other is in the answer extraction stage, the answers will involve different levels of abstraction related to retrieved documents, which needs deep exploration. This paper puts forward a multi-granularity semantic space learning based approach for cross-lingual Open-QA. It consists of the Match-Retrieval module and the Multi-granularity-Extraction module. The matching network in the retrieval module makes heuristic adjustment and expansion on the learned features to improve the retrieval quality. In the answer extraction module, the reuse of deep semantic features is realized at the network structure level through cross-layer concatenation, and it enables us to learn multi-granularity semantic space. Experimental results on two public cross-lingual Open-QA datasets show the superiority of our proposed approach over the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A multi-granularity semantic space learning approach for cross-lingual open domain question answering

Abstract

Talk to us

Similar Papers

More From: World Wide Web

Lead the way for us

Journal: World Wide Web	Publication Date: May 10, 2021
Citations: 1

Similar Papers

A Cross-Layer Connection Based Approach for Cross-Lingual Open Question Answering
Lin Li ... Miao Kong
-
Lin Li, et. al.Lin Li ... Miao Kong
01 Jan 2020
01 Jan 2020

Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages
Wietse De Vries ... Martijn Wieling
-
Wietse De Vries, et. al.Wietse De Vries ... Martijn Wieling
01 Jan 2021
01 Jan 2021

Exploring zero-shot and joint training cross-lingual strategies for aspect-based sentiment analysis based on contextualized multilingual language models
Dang Van Thin ... Ngan Luu-Thuy Nguyen
Journal of Information and Telecommunication | VOL. 7
Dang Van Thin, et. al.Dang Van Thin ... Ngan Luu-Thuy Nguyen
16 Feb 2023
Journal of Information and Telecommunication | VOL. 7

Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition
Shining Liang ... Xianglin Zuo
-
Shining Liang, et. al.Shining Liang ... Xianglin Zuo
14 Aug 2021
14 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A multi-granularity semantic space learning approach for cross-lingual open domain question answering

Abstract

Talk to us

Similar Papers

More From: World Wide Web