Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop

Komal Dhingra,Sumit Kr Yadav

doi:10.1007/s13042-017-0768-3

Abstract

Online reviews are the most easily available free information sources used by both organizations and customers to make decisions. Establishments are utilizing significance of opinions to earn undue profit by hiring professionals known as spammers, giving positive comments on their products and negative opinions on their competitor’s product. This activity is known as opinion spamming and should be identified to give genuine results containing sentiments towards a product. So far, opinion spam detection has been considered as a discrete classification problem, generally as spam and non-spam. However, it involves uncertainty as suspicious behavior of a user might be due to coincidence. As, fuzzy logic handles real world uncertainty very well, we propose a novel fuzzy modeling based solution to the problem. We have proposed four fuzzy input linguistic variable and considered suspicious level of a spammer group to be one of—Ultra, Mega, Immense, Highly, Moderate, Slightly and Feebly. We have defined novel FSL Deduction Algorithm generating 81 fuzzy rules and Fuzzy Ranking Evaluation Algorithm (FREA) to determine the extent to which a group is suspicious. As reviews dataset satisfy the three V’s of big data (Volume, Velocity and Variety), we have considered this problem as a big data problem and used Hadoop for storage and analyzation. We have further demonstrated our proposed algorithm using a sample reviews dataset and Amazon reviews dataset achieving an accuracy of 80.77% which unlike other approaches remains steady for large number of groups and deals well with uncertainty involved in opinion spam detection.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop

Abstract

Talk to us

Similar Papers

More From: International Journal of Machine Learning and Cybernetics

Lead the way for us

Journal: International Journal of Machine Learning and Cybernetics	Publication Date: Dec 16, 2017
Citations: 22

Similar Papers

Opinion spam detection framework using hybrid classification scheme
Muhammad Zubair Asghar ... Aurangzeb Khan
Soft Computing | VOL. 24
Muhammad Zubair Asghar, et. al.Muhammad Zubair Asghar ... Aurangzeb Khan
11 Jun 2019
Soft Computing | VOL. 24

Opinion Spam Detection in Online Reviews
Ajay Rastogi ... Monica Mehrotra
Journal of Information & Knowledge Management | VOL. 16
Ajay Rastogi, et. al.Ajay Rastogi ... Monica Mehrotra
23 Nov 2017
Journal of Information & Knowledge Management | VOL. 16

Opinion Spam Detection in Online Reviews Using Neural Networks
K Archchitha ... E.Y.A Charles
-
K Archchitha, et. al.K Archchitha ... E.Y.A Charles
01 Sep 2019
01 Sep 2019

Impact of Behavioral and Textual Features on Opinion Spam Detection
Ajay Rastogi ... Monica Mehrotra
-
Ajay Rastogi, et. al.Ajay Rastogi ... Monica Mehrotra
01 Jun 2018
01 Jun 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop

Abstract

Talk to us

Similar Papers

More From: International Journal of Machine Learning and Cybernetics