Improving Machine Reading Comprehension with Multi-Task Learning and Self-Training

Jianquan Ouyang,Mengen Fu

doi:10.3390/math10030310

Abstract

Machine Reading Comprehension (MRC) is an AI challenge that requires machines to determine the correct answer to a question based on a given passage, in which extractive MRC requires extracting an answer span to a question from a given passage, such as the task of span extraction. In contrast, non-extractive MRC infers answers from the content of reference passages, including Yes/No question answering to unanswerable questions. Due to the specificity of the two types of MRC tasks, researchers usually work on one type of task separately, but real-life application situations often require models that can handle many different types of tasks in parallel. Therefore, to meet the comprehensive requirements in such application situations, we construct a multi-task fusion training reading comprehension model based on the BERT pre-training model. The model uses the BERT pre-training model to obtain contextual representations, which is then shared by three downstream sub-modules for span extraction, Yes/No question answering, and unanswerable questions, next we fuse the outputs of the three sub-modules into a new span extraction output and use the fused cross-entropy loss function for global training. In the training phase, since our model requires a large amount of labeled training data, which is often expensive to obtain or unavailable in many tasks, we additionally use self-training to generate pseudo-labeled training data to train our model to improve its accuracy and generalization performance. We evaluated the SQuAD2.0 and CAIL2019 datasets. The experiments show that our model can efficiently handle different tasks. We achieved 83.2EM and 86.7F1 scores on the SQuAD2.0 dataset and 73.0EM and 85.3F1 scores on the CAIL2019 dataset.

Highlights

Machine reading comprehension (MRC) aims to teach machines to answer questions after understanding a given passage [1,2], which can be broadly classified into two categories: Extractive MRC and Non-extractive MRC
We propose a machine reading comprehension model based on multitask fusion training, and we construct a multi-task fusion training reading comprehension model based on the BERT pre-training model
The ALBERT model as an improved model of the BERT model can effectively improve the downstream performance of multi-sentence coding tasks through three improvements: factorized embedding parameterization, cross-layer parameter sharing, and inter-sentence coherence loss

Summary

Introduction

Machine reading comprehension (MRC) aims to teach machines to answer questions after understanding a given passage [1,2], which can be broadly classified into two categories: Extractive MRC and Non-extractive MRC. Extractive MRC requires models to extract the answer span of a question from a reference text. The tasks of close-test [3] and span extraction [4,5]. Non-extractive MRC infers answers to questions from the content of the referenced passage, including Yes/No question answering [7] and unanswerable question task [6].

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mathematics	Publication Date: Jan 19, 2022
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Improving Machine Reading Comprehension with Multi-Task Learning and Self-Training

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Retrospective Reader for Machine Reading Comprehension
Zhuosheng Zhang ... Junjie Yang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Zhuosheng Zhang, et. al.Zhuosheng Zhang ... Junjie Yang
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Question answering model based on machine reading comprehension with knowledge enhancement and answer verification
Ziming Yang ... Qingxuan Kuang
Concurrency and Computation: Practice and Experience | VOL. 34
Ziming Yang, et. al.Ziming Yang ... Qingxuan Kuang
10 Jun 2020
Concurrency and Computation: Practice and Experience | VOL. 34

Unanswerable Questions Recognition by Semantic Discrepancy Detection
...
-
, et. al. ...
25 Dec 2019
25 Dec 2019

Multi-task joint training model for machine reading comprehension
Fangfang Li ... Shichao Zhang
Neurocomputing | VOL. 488
Fangfang Li, et. al.Fangfang Li ... Shichao Zhang
01 Mar 2022
Neurocomputing | VOL. 488

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Machine Reading Comprehension with Multi-Task Learning and Self-Training

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics