Abstract

Question subjectivity identification in Community Question Answering (CQA) has attracted a lot of attentions in recent years. With the rapid development of CQA, subjective questions posted by users are growing exponentially, which presents two challenges for question subjectivity identification. The first one is the data imbalance between subjective and objective questions. The second one is that the amount of manually labelled training data is hard to catch up with the fast developing speed of CQA. In this paper, we propose an adaptive semi-supervised Extreme Learning Machine (ASELM) to solve those two challenges. To resolve the data imbalance problem, ASELM employs the different impacts on identification performance caused by the imbalanced data. Second, the proposed method introduces the unlabelled data, and builds a model about the ratio between the number of labelled and unlabelled data based on Gaussian Model, which is applied to automatically generate the constraint on the unlabelled data. Experimental results showed ASELM improved identification performance for the imbalanced data, and outperformed the performance of basic ELM, SELM, Weighted ELM and SS-ELM on both F1 measure and accuracy.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.