Abstract

The large flux of online products in today’s world makes business reviews a valuable source for consumers for making sound decisions before making online purchases. Reviews are useful for readers in learning more about the product and gauge its quality. Fake reviews and reviewers form the bulk of the review corpus, making review spamming an open research challenge. These spam reviews require detection to nullify their contribution to product recommendations. In the past, researchers and communities have taken spam detection problems as a matter of serious concern. Yet, for all that, there is space for the performance of exploration on large-scale complex datasets. The work contributes towards robust feature selection with derived features that provide more details on malicious reviews and spammers. Ensemble and other standard machine learning techniques are trained and evaluated over optimal feature sets. In addition, the Metapath-based Graph Convolution Network (M-GCN) framework is proposed, which is an implicit knowledge extraction method to automatically capture the complex semantic meaning of reviews from the heterogeneous network. It makes analysis of triplet (users, reviews, and products) relationships in e-commerce sites through examination of Top-n feature sets in a mutually reinforcing manner. The proposed model is demonstrated on Yelp and Amazon benchmark datasets for evaluation of efficacy and it is shown outperforming state-of-the-art techniques with and without graph-utilization, providing an accuracy of 96% in the prediction task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.