CQuAE: A new Contextualized QUestion Answering corpus on Education domain

Thomas Gerald,Louis Tamames,Sofiane Ettayeb,Ha-Quang Le,Patrick Paroubek,Anne Vilnat

doi:10.1016/j.datak.2024.102305

Abstract

Generating education-related questions and answers remains an open issue while being useful for students, teachers, and teaching aids. Given textual course material, we are interested in generating non-factual questions that require an elaborate answer (relying on analysis or reasoning). Despite the availability of annotated corpora of questions and answers, the effort to develop a generator using deep learning faces two main challenges. Firstly, freely accessible and qualitative data are insufficient to train generative approaches. Secondly, for a stand-alone application, we do not have explicit support to guide the generation toward complex questions. To tackle the first issue, we propose a new corpus based on education documents. For the second point, we propose to study several retargetable language algorithms to produce answers by extracting text spans from contextual documents to help the generation of questions. We particularly study the contribution of deep neural syntactic parsing and transformer-based semantic representation, taking into account the question type (according to our specific question typology) and the contextual support text span. Additionally, recent advances in generation models have proven the efficiency of the instruction-based approach for natural language generation. Consequently, we propose a first investigation of very large language models to generate questions related to the education domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CQuAE: A new Contextualized QUestion Answering corpus on Education domain

Abstract

Talk to us

Similar Papers

More From: Data & Knowledge Engineering

Lead the way for us

Similar Papers

Paragraph-level Neural Question Generation with Maxout Pointer and Gated Self-attention Networks
Yao Zhao ... Yuanyuan Ding
-
Yao Zhao, et. al.Yao Zhao ... Yuanyuan Ding
01 Jan 2018
01 Jan 2018

Leveraging Structured Information from a Passage to Generate Questions
Jian Xu ... Mingtao Zhou
Tsinghua Science and Technology | VOL. 28
Jian Xu, et. al.Jian Xu ... Mingtao Zhou
01 Jun 2023
Tsinghua Science and Technology | VOL. 28

Tree-of-Reasoning Question Decomposition for Complex Question Answering with Large Language Models
Kun Zhang ... Yuanzhuo Wang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Kun Zhang, et. al.Kun Zhang ... Yuanzhuo Wang
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

EduQG: A Multi-Format Multiple-Choice Dataset for the Educational Domain
Amir Hadifar ... Chris Develder
IEEE Access | VOL. 11
Amir Hadifar, et. al.Amir Hadifar ... Chris Develder
01 Jan 2023
IEEE Access | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CQuAE: A new Contextualized QUestion Answering corpus on Education domain

Abstract

Talk to us

Similar Papers

More From: Data &amp; Knowledge Engineering

More From: Data & Knowledge Engineering