Building a deep learning-based QA system from a CQA dataset

Sol Jin,Xu Lian,Hanearl Jung,Jinsoo Park,Jihae Suh

doi:10.1016/j.dss.2023.114038

Abstract

A man-made machine-reading comprehension (MRC) dataset is necessary to train the answer extraction part of existing Question Answering (QA) systems. However, a high-quality and well-structured dataset with question-paragraph-answer pairs is not usually found in the real world. Furthermore, updating or building an MRC dataset is a challenging and costly affair. To address these shortcomings, we propose a QA system that uses a large-scale English Community Question Answering (CQA) dataset (i.e., Stack Exchange) composed of 3,081,834 question-answer pairs. The QA system adopts a classifier-retriever-summarizer structure design. The question classifier and the answer retriever part are based on a Bidirectional Encoder Representations from Transformers (BERT) Natural Language Processing (NLP) model by Google, and the summarizer part introduces a deep learning-based Text-to-Text Transfer Transformer (T5) model to summarize the long answers. We instantiated the proposed QA system with 140 topics from the CQA dataset (including topics such as biology, law, politics, etc.) and conducted human and automatic evaluations. Our system presented encouraging results, considering that it provides high-quality answers to the questions in the test set and satisfied the requirements to develop a QA system without MRC datasets. Our results show the potential of building automatic and high-performance QA systems without being limited by man-made datasets, a significant step forward in the research of open-domain or specific-domain QA systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Building a deep learning-based QA system from a CQA dataset

Abstract

Talk to us

Similar Papers

More From: Decision Support Systems

Lead the way for us

Journal: Decision Support Systems	Publication Date: Jun 15, 2023
Citations: 3

Similar Papers

BERT+vnKG: Using Deep Learning and Knowledge Graph to Improve Vietnamese Question Answering System
Truong H V Phan ... Phuc Do
International Journal of Advanced Computer Science and Applications | VOL. 11
Truong H V Phan, et. al.Truong H V Phan ... Phuc Do
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 11

Bilingual Question Answering System Using Bidirectional Encoder Representations from Transformers and Best Matching Method
Dini Adni Navastara ... Agus Zainal Arifin
-
Dini Adni Navastara, et. al.Dini Adni Navastara ... Agus Zainal Arifin
20 Oct 2021
20 Oct 2021

Analysis of QA System Behavior against Context and Question Changes
Rachid Karra ... Abdelali Lasfar
The International Arab Journal of Information Technology | VOL. 21
Rachid Karra, et. al.Rachid Karra ... Abdelali Lasfar
01 Jan 2024
The International Arab Journal of Information Technology | VOL. 21

Investigating Query Expansion and Coreference Resolution in Question Answering on BERT
Santanu Bhattacharjee ... Gideon Maillette De Buy Wenniger
-
Santanu Bhattacharjee, et. al.Santanu Bhattacharjee ... Gideon Maillette De Buy Wenniger
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building a deep learning-based QA system from a CQA dataset

Abstract

Talk to us

Similar Papers

More From: Decision Support Systems