Opinion Spam Detection Research Articles

With the advent of social networking sites, opinion-mining applications have attracted the interest of the online community on review sites to know about products for their purchase decisions. However, due to increasing trend of posting spam (fake) reviews to promote the target products or defame the specific brands of competitors, Opinion Spam detection and classification has emerged as a hot issue in the community of opinion mining and sentiment analysis. We investigate the issue of Opinion Spam detection by using different combinations of entities, features, and their sentiment scores. We enrich the feature set of a baseline Spam detection method with Spam detection features (Opinion Spam, Opinion Spammer, Item Spam). Using a dataset of reviews from the Amazon site and sentences labeled for Spam detection, we evaluate the role of spamicity-related features in detecting and classifying spam (fake) clues and distinguishing them from genuine reviews. For this purpose, we introduce a rule-based feature weighting scheme and propose a method for tagging the review sentence as spam and non-spam. Experiments results depict that spam-related features improve Spam detection in review sentences posted on product review sites. Adding a revised feature weighting scheme achieved an accuracy increase from 93 to 96%. Furthermore, a hybrid set of features are shown to improve the performance of Opinion Spam detection in terms of better precision, recall, and F-measure values. This work shows that combining spam-related features with rule-based weighting scheme can improve the performance of even baseline Spam detection method. This improvement can be of use to Opinion Spam detection systems, due to the growing interest of individuals and companies in isolating fake (spam) and genuine (non-spam) reviews about products. The outcome of this work will provide an insight into spam-related features and feature weighting and will assist in developing more advanced applications for Opinion Spam detection. In the field of Opinion Spam detection, previous state-of-the-art studies used less number of spamicity-related features and less efficient feature weighting scheme. However, we provided a revised feature selection and a revised feature weighting scheme with normalized spamicity score computation technique. Therefore, our contribution is novel to the field because it provides a significant improvement over the comparing methods.

최근 뉴스, 블로그, 소셜미디어 등을 통해 방대한 양의 비정형 텍스트 데이터가 쏟아져 나오고 있다. 이러한 비정형 텍스트 데이터는 풍부한 정보 및 의견을 거의 실시간으로 반영하고 있다는 측면에서 그 활용도가 매우 높아, 학계는 물론 산업계에서도 분석 수요가 증가하고 있다. 하지만 텍스트 데이터의 유용성이 증가함과 동시에 이러한 텍스트 데이터를 왜곡하여 특정 목적을 달성하려는 시도도 늘어나고 있다. 이러한 스팸성 텍스트 데이터의 증가는 방대한 정보 가운데 필요한 정보를 획득하는 일을 더욱 어렵게 만드는 것은 물론, 정보 자체 및 정보 제공 매체에 대한 신뢰도를 떨어뜨리는 현상을 초래하게 된다. 따라서 원본 데이터로부터 스팸성 데이터를 식별하여 제거함으로써, 정보의 신뢰성 및 분석 결과의 품질을 제고하기 위한 노력이 반드시 필요하다. 이러한 목적으로 스팸을 식별하기 위한 연구가 오피니언 스팸 탐지, 스팸 이메일 검출, 웹 스팸 탐지 등의 분야에서 매우 활발하게 수행되었다. 본 연구에서는 스팸 식별을 위한 기존의 연구 동향을 자세히 소개하고, 블로그 정보의 신뢰성 향상을 위한 방안 중 하나로 블로그의 스팸 태그를 식별하기 위한 방안을 제안한다. Recently, tremendous amounts of unstructured text data that is distributed through news, blogs, and social media has gained much attention from many researchers and practitioners as this data contains abundant information about various consumers' opinions. However, as the usefulness of text data is increasing, more and more attempts to gain profits by distorting text data maliciously or nonmaliciously are also increasing. This increase in spam text data not only burdens users who want to obtain useful information with a large amount of inappropriate information, but also damages the reliability of information and information providers. Therefore, efforts must be made to improve the reliability of information and the quality of analysis results by detecting and removing spam data in advance. For this purpose, many studies to detect spam have been actively conducted in areas such as opinion spam detection, spam e-mail detection, and web spam detection. In this study, we introduce core concepts and current research trends of spam detection and propose a methodology to detect the spam tag of a blog as one of the challenging attempts to improve the reliability of blog information.

Opinion Spam Detection Research Articles

Related Topics

Articles published on Opinion Spam Detection

Sentiment Analysis

Opinion spam detection: Using multi-iterative graph-based model

Opinion spam detection by incorporating multimodal embedded representation into a probabilistic review graph

Opinion spam detection framework using hybrid classification scheme

Opinion Spam Detection based on Annotation Extension and Neural Networks

Opinion Spam Detection and Analysis by Identifying Domain Features in Product Reviews

Learning to Detect Deceptive Opinion Spam: A Survey

A Novel Model for Opinion Spam Detection Based on Multi-Iteration Network Structure

Online Social Networking services and Spam Detection Approaches in Opinion Mining- A review

Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop

Opinion Spam Detection in Online Reviews

Evaluation of Data Mining Features, Features Taxonomies and their Applications

Credibility in social media: opinions, news, and health information—a survey

텍스트 분석의 신뢰성 확보를 위한 스팸 데이터 식별 방안

Neural networks for deceptive opinion spam detection: An empirical study

Detection of fake opinions using time series

Detection of opinion spam based on anomalous rating deviation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Opinion Spam Detection Research Articles

Related Topics

Articles published on Opinion Spam Detection

Sentiment Analysis

Opinion spam detection: Using multi-iterative graph-based model

Opinion spam detection by incorporating multimodal embedded representation into a probabilistic review graph

Opinion spam detection framework using hybrid classification scheme

Opinion Spam Detection based on Annotation Extension and Neural Networks

Opinion Spam Detection and Analysis by Identifying Domain Features in Product Reviews

Learning to Detect Deceptive Opinion Spam: A Survey

A Novel Model for Opinion Spam Detection Based on Multi-Iteration Network Structure

Online Social Networking services and Spam Detection Approaches in Opinion Mining- A review

Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop

Opinion Spam Detection in Online Reviews

Evaluation of Data Mining Features, Features Taxonomies and their Applications

Credibility in social media: opinions, news, and health information—a survey

텍스트 분석의 신뢰성 확보를 위한 스팸 데이터 식별 방안

Neural networks for deceptive opinion spam detection: An empirical study

Detection of fake opinions using time series

Detection of opinion spam based on anomalous rating deviation