An analysis of a high-performance japanese question answering system

Hideki Isozaki

doi:10.1145/1111667.1111670

Abstract

Twenty-five Japanese Question Answering systems participated in NTCIR QAC2 subtask 1. Of these, our system SAIQA-QAC2 performed the best: MRR = 0.607. SAIQA-QAC2 is an improvement on our previous system SAIQA-Ii that achieved MRR = 0.46 for QAC1. We mainly improved the answer-type determination module and the retrieval module. In general, a fine-grained answer taxonomy improves QA performance but it is difficult to build an accurate answer extraction module for the fine-grained taxonomy because Machine Learning methods require a huge training corpus and hand-crafted rules are hard to maintain. Therefore, we built a fine-grained system by using a coarse-grained named entity recognizer and a Japanese lexicon “Nihongo Goi-taikei.” Our experiments show that named entity/numerical expression recognition and word sense-based answer extraction mainly contributed to the performance. In addition, we developed a new proximity-based document retrieval module that performs better than BM25. We also compared its performance with MultiText, a conventional proximity-based retrieval method developed for QA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An analysis of a high-performance japanese question answering system

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian Language Information Processing	Publication Date: Sep 1, 2005
Citations: 8

Similar Papers

Improving text retrieval precision and answer accuracy in question answering systems
Matthew W Bilotti ... Eric Nyberg
-
Matthew W Bilotti, et. al.Matthew W Bilotti ... Eric Nyberg
01 Jan 2008
01 Jan 2008

Formalized answer extraction technology based on pattern learning
Li Peng ... Zhang Kai-Hui
-
Li Peng, et. al. Li Peng ... Zhang Kai-Hui
01 Oct 2010
01 Oct 2010

Developing Amharic Question Answering Model Over Unstructured Data Source Using Deep Learning Approach
Abenezer Mengistu Elema
-
Abenezer Mengistu ElemaAbenezer Mengistu Elema
28 Nov 2022
28 Nov 2022

SemBioNLQA: A semantic biomedical question answering system for retrieving exact and ideal answers to natural language questions.
Mourad Sarrouti ... Said Ouatik El Alaoui
Artificial Intelligence in Medicine | VOL. 102
Mourad Sarrouti, et. al.Mourad Sarrouti ... Said Ouatik El Alaoui
28 Nov 2019
Artificial Intelligence in Medicine | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An analysis of a high-performance japanese question answering system

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian Language Information Processing