Detecting Misflagged Duplicate Questions in Community Question-Answering Archives

Doris Hoogeveen,Yitong Li,Timothy Baldwin,Karin Verspoor,Andrew Bennett

doi:10.1609/icwsm.v12i1.15011

Abstract

In this paper we introduce the task of misflagged duplicate question detection for question pairs in community question-answer (cQA) archives and compare it to the more standard task of detecting valid duplicate questions. A misflagged duplicate is a question that has been erroneously hand-flagged by the community as a duplicate of an archived one, where the two questions are not actually the same. We find that form is flagged duplicate detection, meta data features that capture user authority, question quality, and relational data between questions, outperform pure text-based methods, while for regular duplicate detection a combination of meta data features and semantic features gives the best results. We show that misflagged duplicate questions are even more challenging to model than regular duplicate question detection, but that good results can still be obtained.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Proceedings of the International AAAI Conference on Web and Social Media	Publication Date: Jun 15, 2018
Citations: 18	License type: public-domain

R Discovery Prime

R Discovery Prime

Detecting Misflagged Duplicate Questions in Community Question-Answering Archives

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media

Lead the way for us

Similar Papers

Duplicate Question Detection with Deep Learning Using Word2Vec
P Lavanya Kumari ... P Ram Teja
-
P Lavanya Kumari, et. al.P Lavanya Kumari ... P Ram Teja
02 Sep 2021
02 Sep 2021

Bert-QAnet: BERT-encoded hierarchical question-answer cross-attention network for duplicate question detection
Xuan Zhao ... Jimmy Xiangji Huang
Neurocomputing | VOL. 509
Xuan Zhao, et. al.Xuan Zhao ... Jimmy Xiangji Huang
20 Aug 2022
Neurocomputing | VOL. 509

DeepDup: Duplicate Question Detection in Community Question Answering
Mohomed Shazan Mohomed Jabbar ... Sankalp Prabharkar
-
Mohomed Shazan Mohomed Jabbar, et. al.Mohomed Shazan Mohomed Jabbar ... Sankalp Prabharkar
23 Jul 2021
23 Jul 2021

Forum Duplicate Question Detection by Domain Adaptive Semantic Matching
Zhuojia Xu ... Hua Yuan
IEEE Access | VOL. 8
Zhuojia Xu, et. al.Zhuojia Xu ... Hua Yuan
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Misflagged Duplicate Questions in Community Question-Answering Archives

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International AAAI Conference on Web and Social Media