Abstract

In this paper, we analyze data collected from Yelp to understand the importance of the social networks created between Yelp reviewers, and the impact it has on the businesses they patronize. We then look at the shape of the social network generated by reviews and identify differences between the behavior of “elite” users vs. non-elite users. We then view the trends in network formation for particular businesses over time, and use features of the network of the business early in its development in order to predict its future success. We construct linear regression and random forest models that solely use features derived from review data, as well as models that are built using a combination of review and social network features. We see that the additional network features are statistically signifant, and help reduce the root mean squared error of our models by a significant percentage. Ultimately, using our network features of reviewers from the first three months of business, we can predict the number of reviewers for a business within its first year with an error of less than 2.5 reviews, an error of 8.5 over two year, and 13 over three.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.