Online reviews are increasingly recognized as a key source of information influencing consumer behavior. This in turn implies that competitive advantage can be achieved by manipulating users’ perceptions about restaurants. The hospitality industry is particularly susceptible to this issue because products and services in this industry can only be rated upon consumption. Therefore, many efforts have recently been dedicated to developing automatic methods for detecting fake reviews based on data intelligence in this sector. Recent studies suggest that both the semantic meaning of consumer reviews and the sentiment conveyed may be useful indicators of fake reviews. However, the semantic meaning may be context-sensitive and may also disregard sentiment information. Moreover, the content analysis approach should be integrated with the reviewer's behavior to reveal their true intentions. To address these problems, we propose a review representation model based on behavioural and sentiment-dependent linguistic features that effectively exploit the domain context. Using a large dataset of Yelp restaurant reviews, we demonstrate that the proposed review representation model is more effective than existing approaches in terms of detection accuracy. It furthermore accurately estimates the average rating assigned by legitimate reviewers, which has significant managerial implications for the hospitality industry.
Read full abstract