Enhancing Transformer-based language models with commonsense representations for knowledge-driven machine comprehension

Ronghan Li,Zejun Jiang,Lifang Wang,Xinyu Lu,Meng Zhao,Daqing Chen

doi:10.1016/j.knosys.2021.106936

Abstract

Compared to the traditional machine reading comprehension (MRC) with limitation to the information in a passage, knowledge-driven MRC tasks aim to enable models to answer the question according to text and related commonsense knowledge. Although pre-trained Transformer-based language models (TrLMs) such as BERT and Roberta, have shown powerful performance in MRC, external knowledge such as unspoken commonsense and world knowledge still cannot be used and explained explicitly. In this work, we present three simple yet effective injection methods integrated into the structure of TrLMs to fine-tune downstream knowledge-driven MRC tasks with off-the-shelf commonsense representations. Moreover, we introduce a mask mechanism for a token-level multi-hop relationship searching to filter external knowledge. We have conducted extensive experiments on DREAM and CosmosQA, two prevalent knowledge-driven datasets. Experimental results indicate that the incremental TrLMs have outperformed the baseline systems by 1%-4.1% with a fewer computational cost. Further analysis shows the effectiveness of the proposed methods and the robustness of the incremental model in the case of an incomplete training set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing Transformer-based language models with commonsense representations for knowledge-driven machine comprehension

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems

Lead the way for us

Journal: Knowledge-Based Systems	Publication Date: Mar 6, 2021
Citations: 12

Similar Papers

Enhancing pretrained language models with structured commonsense knowledge for textual inference
Li Du ... Bing Qin
Knowledge-Based Systems | VOL. 254
Li Du, et. al.Li Du ... Bing Qin
02 Aug 2022
Knowledge-Based Systems | VOL. 254

Identification of Semantically Similar Sentences in Clinical Notes: Iterative Intermediate Training Using Multi-Task Learning.
Diwakar Mahajan ... Ananya Poddar
JMIR Medical Informatics | VOL. 8
Diwakar Mahajan, et. al.Diwakar Mahajan ... Ananya Poddar
27 Nov 2020
JMIR Medical Informatics | VOL. 8

A Study of Vietnamese Sentiment Classification with Ensemble Pre-Trained Language Models
Dang Van Thin ... Duong Ngoc Hao
Vietnam Journal of Computer Science | VOL. 11
Dang Van Thin, et. al.Dang Van Thin ... Duong Ngoc Hao
07 Dec 2023
Vietnam Journal of Computer Science | VOL. 11

Application of Transformer-Based Language Models to Detect Hate Speech in Social Media
Swapnanil Mukherjee ... Sujit Das
Journal of Computational and Cognitive Engineering | VOL. 2
Swapnanil Mukherjee, et. al.Swapnanil Mukherjee ... Sujit Das
17 Dec 2021
Journal of Computational and Cognitive Engineering | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing Transformer-based language models with commonsense representations for knowledge-driven machine comprehension

Abstract

Talk to us

Similar Papers

More From: Knowledge-Based Systems