Towards the benchmarking of question generation: introducing the Monserrate corpus

Hugo Rodrigues,Luisa Coheur,Eric Nyberg

doi:10.1007/s10579-021-09545-5

Abstract

Despite the growing interest in Question Generation, evaluating these systems remains notably difficult. Many authors rely on metrics like BLEU or ROUGE instead of relying on manual evaluations, as their computation is mostly free. However, corpora generally used as reference is very incomplete, containing just a couple of hypotheses per source sentence. In this paper, we propose monserrate corpus, a dataset specifically built to evaluate Question Generation systems, with, on average, 26 questions associated to each source sentence, attempting to be an “exhaustive” reference. With monserrate we study the impact of the reference size in evaluating Question Generation systems. Several evaluation metrics are used, from more traditional lexical ones to metrics based on word embeddings, and we conclude that these are still a limiting evaluation factor, as they lead to different outcomes. Finally, with monserrate, we benchmark three different Question Generation systems, representing different approaches to this task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Towards the benchmarking of question generation: introducing the Monserrate corpus

Abstract

Talk to us

Similar Papers

More From: Language Resources and Evaluation

Lead the way for us

Journal: Language Resources and Evaluation	Publication Date: Jun 3, 2021
Citations: 2

Similar Papers

Towards a Better Metric for Evaluating Question Generation Systems
Preksha Nema ... Mitesh M Khapra
-
Preksha Nema, et. al.Preksha Nema ... Mitesh M Khapra
01 Jan 2018
01 Jan 2018

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
...
-
, et. al. ...
27 Jun 2022
27 Jun 2022

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
...
-
, et. al. ...
27 Jun 2022
27 Jun 2022

Towards Process-Oriented, Modular, and Versatile Question Generation that Meets Educational Needs
Xu Wang ... Jessica Houghton
-
Xu Wang, et. al.Xu Wang ... Jessica Houghton
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Towards the benchmarking of question generation: introducing the Monserrate corpus

Abstract

Talk to us

Similar Papers

More From: Language Resources and Evaluation