Opinion Spam Detection Research Articles

The promotion of e-commerce platforms has changed the lifestyle of several people from traditional marketing to digital marketing where businesses are made online and the concurrence reached high levels. These platforms have helped the ease of purchases while providing more advantages to the customers such as benefiting from a wide range of high-quality products, low prices, buying at any time, and more importantly supplying information and reviews about the products, and so on. Unfortunately, a plethora of companies mislead the customers to buy their products or demote the competitors' by using deceptive opinion spams which has a negative impact on the decision and the behavior of the purchasers. Deceptive opinion spams are written deliberately to seem legitimate and authentic so that to misguide or delude the customer's purchases. Consequently, the detection of these opinions is a hard task due to their nature for both humans and machines. Most of the studies are based on traditional machine learning and sparse feature engineering. However, these models do not capture the semantic aspect of reviews. According to many researchers, it is the key to the detection of deceptive opinion spam. Besides, only a few studies consider using contextual information by adopting neural networks in comparison with plenty of traditional machine learning classifiers. These models face numerous shortcomings as long as their representations are obtained while mining each review considering only words, sentences, reviews, or a combination of them, thereby classifying them based on their representations. In fact, deceptive opinions are written by the same deceivers belonging to the same companies with similar aims to promote or demolish a product. In other words, Deceptive opinion spams tend to be semantically coherent with each other. To the best of our knowledge, no model tries to obtain a representation based on the contextual relationships between opinions. This article proposes to use a capsule neural network, bidirectional long short-term memory, attention mechanism, and paragraph vector distributed bag of words to detect deceptive opinion spam. Our model provides a powerful representation of the opinions since it centers on the preservation of their contexts and the relationships between them. The results show that our model significantly outperforms the existing state-of-the-art models.

Read full abstract

Abstract Purpose This paper aims to analyze the effectiveness of two major types of features—metadata-based (behavioral) and content-based (textual)—in opinion spam detection. Design/methodology/approach Based on spam-detection perspectives, our approach works in three settings: review-centric (spam detection), reviewer-centric (spammer detection) and product-centric (spam-targeted product detection). Besides this, to negate any kind of classifier-bias, we employ four classifiers to get a better and unbiased reflection of the obtained results. In addition, we have proposed a new set of features which are compared against some well-known related works. The experiments performed on two real-world datasets show the effectiveness of different features in opinion spam detection. Findings Our findings indicate that behavioral features are more efficient as well as effective than the textual to detect opinion spam across all three settings. In addition, models trained on hybrid features produce results quite similar to those trained on behavioral features than on the textual, further establishing the superiority of behavioral features as dominating indicators of opinion spam. The features used in this work provide improvement over existing features utilized in other related works. Furthermore, the computation time analysis for feature extraction phase shows the better cost efficiency of behavioral features over the textual. Research limitations The analyses conducted in this paper are solely limited to two well-known datasets, viz., YelpZip and YelpNYC of Yelp.com. Practical implications The results obtained in this paper can be used to improve the detection of opinion spam, wherein the researchers may work on improving and developing feature engineering and selection techniques focused more on metadata information. Originality/value To the best of our knowledge, this study is the first of its kind which considers three perspectives (review, reviewer and product-centric) and four classifiers to analyze the effectiveness of opinion spam detection using two major types of features. This study also introduces some novel features, which help to improve the performance of opinion spam detection methods.

Read full abstract

Opinion Spam Detection Research Articles

Related Topics

Articles published on Opinion Spam Detection

Understanding Large-Scale Network Effects in Detecting Review Spammers

A Contextual Relationship Model for Deceptive Opinion Spam Detection.

Deceptive opinion spam detection using feature reduction techniques

Mining Weak Relations Between Reviews for Opinion Spam Detection

An Effective Framework for design of Dataset Using Twitter

A comprehensive survey of various methods in opinion spam detection

A text classification method based on a convolutional and bidirectional long short-term memory model

Opinion Spam Detection: A New Approach Using Machine Learning and Network-Based Algorithms

Deceptive opinion spam detection approaches: a literature survey

A Study on Diverse Methods and Performance Measures in Sentiment Analysis

Temporal Opinion Spam Detection by Multivariate Indicative Signals

Analyzing and Detecting Opinion Spam on a Large-scale Dataset via Temporal and Spatial Patterns

Opinion Mining On Web-Based Communities Using Optimised Clustering Algorithms

SC-Com: Spotting Collusive Community in Opinion Spam Detection

Analyzing the effectiveness of semi-supervised learning approaches for opinion spam classification

An unsupervised approach to detect review spam using duplicates of images, videos and Chinese texts

A non-convex semi-supervised approach to opinion spam detection by ramp-one class SVM

Online Fraud Review Detection Using Data Mining

An anomaly detection framework for time-evolving attributed networks

Effective Opinion Spam Detection: A Study on Review Metadata Versus Content

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Opinion Spam Detection Research Articles

Related Topics

Articles published on Opinion Spam Detection

Understanding Large-Scale Network Effects in Detecting Review Spammers

A Contextual Relationship Model for Deceptive Opinion Spam Detection.

Deceptive opinion spam detection using feature reduction techniques

Mining Weak Relations Between Reviews for Opinion Spam Detection

An Effective Framework for design of Dataset Using Twitter

A comprehensive survey of various methods in opinion spam detection

A text classification method based on a convolutional and bidirectional long short-term memory model

Opinion Spam Detection: A New Approach Using Machine Learning and Network-Based Algorithms

Deceptive opinion spam detection approaches: a literature survey

A Study on Diverse Methods and Performance Measures in Sentiment Analysis

Temporal Opinion Spam Detection by Multivariate Indicative Signals

Analyzing and Detecting Opinion Spam on a Large-scale Dataset via Temporal and Spatial Patterns

Opinion Mining On Web-Based Communities Using Optimised Clustering Algorithms

SC-Com: Spotting Collusive Community in Opinion Spam Detection

Analyzing the effectiveness of semi-supervised learning approaches for opinion spam classification

An unsupervised approach to detect review spam using duplicates of images, videos and Chinese texts

A non-convex semi-supervised approach to opinion spam detection by ramp-one class SVM

Online Fraud Review Detection Using Data Mining

An anomaly detection framework for time-evolving attributed networks

Effective Opinion Spam Detection: A Study on Review Metadata Versus Content