Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

Viktor Schlegel,Goran Nenadic,Riza Batista-Navarro

doi:10.1609/aaai.v35i15.17622

Abstract

Advances in NLP have yielded impressive results for the task of machine reading comprehension (MRC), with approaches having been reported to achieve performance comparable to that of humans. In this paper, we investigate whether state-of-the-art MRC models are able to correctly process Semantics Altering Modifications (SAM): linguistically-motivated phenomena that alter the semantics of a sentence while preserving most of its lexical surface form. We present a method to automatically generate and align challenge sets featuring original and altered examples. We further propose a novel evaluation methodology to correctly assess the capability of MRC systems to process these examples independent of the data they were optimised on, by discounting for effects introduced by domain shift. In a large-scale empirical study, we apply the methodology in order to evaluate extractive MRC models with regard to their capability to correctly process SAM-enriched data. We comprehensively cover 12 different state-of-the-art neural architecture configurations and four training datasets and find that -- despite their well-known remarkable performance -- optimised models consistently struggle to correctly process semantically altered data.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Efficient Machine Reading Comprehension for Health Care Applications: Algorithm Development and Validation of a Context Extraction Approach.
Duy-Anh Nguyen ... Ryszard Kowalczyk
JMIR Formative Research | VOL. 8
Duy-Anh Nguyen, et. al.Duy-Anh Nguyen ... Ryszard Kowalczyk
25 Mar 2024
JMIR Formative Research | VOL. 8

An Iterative Multi-Source Mutual Knowledge Transfer Framework for Machine Reading Comprehension
Xin Liu ... Xiang Li
-
Xin Liu, et. al.Xin Liu ... Xiang Li
01 Jul 2020
01 Jul 2020

A Survey on Machine Reading Comprehension—Tasks, Evaluation Metrics and Benchmark Datasets
Changchang Zeng ... Qin Li
Applied sciences | VOL. 10
Changchang Zeng, et. al.Changchang Zeng ... Qin Li
29 Oct 2020
Applied sciences | VOL. 10

Explicit Utilization of General Knowledge in Machine Reading Comprehension
Chao Wang ... Hui Jiang
-
Chao Wang, et. al.Chao Wang ... Hui Jiang
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence