Abstract
The aim of this paper is to develop a model to classify the stance expressed in social media texts. More specifically, the work presented focuses on tweets. In stance detection (SD) tasks, the objective is to identify the stance of a person towards a target of interest. In this paper, a model for SD is established and its variations are evaluated using different classifiers. The single models differ based on the pre-processing and the combination of features. To reduce the dimensionality of the feature space, analysis of variance (ANOVA) test is used. Then, two classifiers are employed as base learners including Random Forests (RF) and Support Vector Machines (SVM). Experimental analyses are conducted on SemEval dataset that is used as a benchmark for SD. Finally, the base learners that resulted from different design alternatives, are combined into three ensemble models. Experimental results show the significance of the used features and the effectiveness of a manually built dictionary that is used in the pre-processing stage. Moreover, the proposed ensembles outperform the state-of-the-art models in the overall test score, which suggests that ensemble learning is the best tool for effective SD in tweets.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Information Technology & Decision Making
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.