Abstract

Question generation and question answering are attracting more and more attention recently. Existing question generation systems produce questions based on the given text. However, there is still a vast gap between these generated questions and their practical usage, which acquires more modification from human beings. In order to alleviate this dilemma, we consider reducing the volume of the question set/suite and extracting a lightweight subset while conserving as many features as possible from the original set. In this paper, we first propose a three-layer semantic analysis model, which ensembles traditional language analysis tools to perform the reduction. Then, a bunch of metrics over semantic contribution is carefully designed to balance distinct features. Finally, we introduce the concept of Grade Level and Information Entropy to evaluate our model from a multi-dimensional manner. We conduct an extensive set of experiments to test our model for question suite reduction. The results demonstrate that it can retain as much diversity as possible compared to the original large set.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.