PLM-PGHC: A novel de-biasing framework for robust question answering

Shujuan Yu,Yun Zhang,Mengjie Wu,Na Xie,Liya Huang

doi:10.3233/jifs-233029

Abstract

Reading Comprehension models have achieved superhuman performance on mainstream public datasets. However, many studies have shown that the models are likely to take advantage of biases in the datasets, which makes it difficult to efficiently reasoning when generalizing to out-of-distribution datasets with non-directional bias, resulting in serious accuracy loss. Therefore, this paper proposes a pre-trained language model based de-biasing framework with positional generalization and hierarchical combination. In this work, generalized positional embedding is proposed to replace the original word embedding to initially weaken the over-dependence of the model on answer distribution information. Secondly, in order to make up for the influence of regularization randomness on training stability, KL divergence term is introduced into the loss function to constrain the distribution difference between the two sub models. Finally, a hierarchical combination method is used to obtain classification outputs that fuse text features from different encoding layers, so as to comprehensively consider the semantic features at the multidimensional level. Experimental results show that PLM-PGHC helps learn a more robust QA model and effectively restores the F1 value on the biased distribution from 37.51% to 81.78%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PLM-PGHC: A novel de-biasing framework for robust question answering

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Similar Papers

How to Pre-train Your Model? Comparison of Different Pre-training Models for Biomedical Question Answering
Sanjay Kamath ... Yue Ma
-
Sanjay Kamath, et. al.Sanjay Kamath ... Yue Ma
01 Jan 2020
01 Jan 2020

Spherical Latent Spaces for Stable Variational Autoencoders
Jiacheng Xu ... Greg Durrett
-
Jiacheng Xu, et. al.Jiacheng Xu ... Greg Durrett
01 Jan 2018
01 Jan 2018

Segmentation with mixed supervision: Confidence maximization helps knowledge distillation.
Bingyuan Liu ... Ismail Ben Ayed
Medical Image Analysis | VOL. 83
Bingyuan Liu, et. al.Bingyuan Liu ... Ismail Ben Ayed
01 Jan 2023
Medical Image Analysis | VOL. 83

Machine Reading Comprehension of High-Tech Industry Policies: A New Dataset and Chinese Pre-Trained Language Model
Changchang Zeng ... Shaobo Li
-
Changchang Zeng, et. al.Changchang Zeng ... Shaobo Li
10 Dec 2021
10 Dec 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PLM-PGHC: A novel de-biasing framework for robust question answering

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems