Abstract

With the massive growth of the Internet, text data has become one of the main formats of tourism big data. As an effective expression means of tourists’ opinions, text mining of such data has big potential to inspire innovations for tourism practitioners. In the past decade, a variety of text mining techniques have been proposed and applied to tourism analysis to develop tourism value analysis models, build tourism recommendation systems, create tourist profiles, and make policies for supervising tourism markets. The successes of these techniques have been further boosted by the progress of natural language processing (NLP), machine learning, and deep learning. With the understanding of the complexity due to this diverse set of techniques and tourism text data sources, this work attempts to provide a detailed and up-to-date review of text mining techniques that have been, or have the potential to be, applied to modern tourism big data analysis. We summarize and discuss different text representation strategies, text-based NLP techniques for topic extraction, text classification, sentiment analysis, and text clustering in the context of tourism text mining, and their applications in tourist profiling, destination image analysis, market demand, etc. Our work also provides guidelines for constructing new tourism big data applications and outlines promising research areas in this field for incoming years.

Highlights

  • A Review of Text Corpus-Based Tourism BigChengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu 610041, China

  • Text is an effective and widely existing form of opinion expression and evaluation by users, as shown by the large number of online review comments over tourism sites, hotels, and services.As a direct expression of users’ needs and emotions, text-based tourism data mining has the potential to transform the tourism industry

  • Tourism text big data mining techniques have made it possible to analyze the behaviors of tourists and realize real-time monitoring of the market

Read more

Summary

A Review of Text Corpus-Based Tourism Big

Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu 610041, China. Guizhou Provincial Key Laboratory of Public Big Data (Guizhou University), Guiyang, Guizhou 550025, China. College of Big Data Statistics, GuiZhou University of Finance and Economics, Guiyang, Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA

Introduction
Review Protocol Used in This Review
Text Representations
24 July 2019
Text Corpus-Based NLP Techniques in Tourism Data Mining
Topic Extraction
Text Classification
Sentiment Analysis
Text Clustering
Applications of Text Corpus-Based Tourism Big Data Mining
Tourist Profile
Market
Tourist
Methods
Outlook
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.