Discriminative Feature Spamming Technique for Roman Urdu Sentiment Analysis

Khawar Mehmood,Muhammad Kamran Malik,Daryl Essam,Kamran Shafi

doi:10.1109/access.2019.2908420

Khawar Mehmood, Muhammad Kamran Malik + Show 2 more

Open Access

https://doi.org/10.1109/access.2019.2908420

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2019
Citations: 29	License type: cc-by-nc-nd

Affiliation: University of Canberra, UNSW Sydney

Abstract

Term weighting is one of the most commonly used approaches, which works by assigning weights to terms, that aims to improve the performance of information retrieval or text categorization tasks. In this paper, we present a novel term weighting technique, called discriminative feature spamming technique (DFST), which identifies distinctive terms, based on a term utility criteria (TUC), and then spams them to increase their discriminative power. The experimental results show that the DFST outperformed a set of time-tested term weighting schemes, from the information retrieval field. All the experiments were performed on the largest ever Roman Urdu (RU) dataset of 11000 reviews, which was collected and annotated for this work. In addition, a custom tokenizer was built, which further improved classification accuracy. A cross-scheme comparison was performed, which showed that the results obtained by using the newly proposed DFST, were statistically significant and better than previous approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discriminative Feature Spamming Technique for Roman Urdu Sentiment Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Several alternative term weighting methods for text representation and classification
Zhong Tang ... Song Li
Knowledge-Based Systems | VOL. 207
Zhong Tang, et. al.Zhong Tang ... Song Li
14 Aug 2020
Knowledge-Based Systems | VOL. 207

Query expansion based on a semantic graph model
Xue Jiang
-
Xue JiangXue Jiang
24 Jul 2011
24 Jul 2011

A noun-based approach to feature location using time-aware term-weighting
Sima Zamani ... John Anvik
Information and Software Technology | VOL. 56
Sima Zamani, et. al.Sima Zamani ... John Anvik
26 Mar 2014
Information and Software Technology | VOL. 56

Improving Term Weighting for Community Question Answering Search Using Syntactic Analysis
David Carmel ... Avihai Mejer
-
David Carmel, et. al.David Carmel ... Avihai Mejer
03 Nov 2014
03 Nov 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discriminative Feature Spamming Technique for Roman Urdu Sentiment Analysis

Abstract

Talk to us

Similar Papers

More From: IEEE Access