Abstract

SummaryNowadays, online product reviews strongly influence the purchase decision of consumers in e‐commerce platforms. Driven by the immense financial profits, review spammers deliberately post fake reviews to promote or demote their target products. Some spammers are even organized as groups to work together and try to take total control of the sentiment on their target products. To detect such spammer groups, most previous works exploit frequent itemset mining (FIM) to find spammer group candidates and then use unsupervised spamicity ranking methods to identify real spammer groups. However, these methods usually suffer from the problem of threshold setting, ie, high support value finding fewer groups while low support value leading to more coincidentally generated groups and computational inefficiency. Moreover, the unsupervised methods are not able to make good use of labeled instances which are actually obtainable in practice. In this paper, we propose CONSGD, a cosine pattern and heterogeneous information network–based spammer group detecting method. Specifically, the CONSGD uses cosine pattern mining (CPM) to discover tight spammer group candidates with a respective low support value, where the cosine threshold is utilized to avoid coincidentally generated groups. Moreover, CONSGD employs heterogeneous information network classification to identify the real spammer groups, which could utilize the labeled instances and do not rely to the assumption of independent instances. Experiments on real‐life dataset show that our proposed CONSGD is effective and outperforms the state‐of‐the‐art spammer group detection methods.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.