Attractions in the tourism industry are one of the components that motivate tourists to visit destinations, such as entertainment, natural, cultural, and historical richness. For such reasons, people decide to visit unique destinations and spend time there. Almaty, the largest city of Kazakhstan, is one of the significant attraction centers of the Central Asia region, offering tourists unique and pleasant features with several tourist attractions. This study aims to analyze online user reviews of tourist attractions in Almaty, Kazakhstan, using machine learning and text mining methods. The primary focus is on identifying the main thematic clusters of reviews and their sentiment and comparing these themes with the types of attractions: historical, natural, and man-made. A total of 7,515 reviews were collected from the TripAdvisor website. The data was processed using sentiment analysis, topic modeling, and hierarchical clustering methods. The analysis revealed that 38% of the reviews were related to natural attractions, 34% to man-made, and 28% to historical ones. The most positive reviews were associated with natural attractions, while historical and man-made attractions received 79.38% and 81.40% positive reviews, respectively. In addition, the items that make up these attractions are identified, and their sentiment levels are pointed out. In addition to this situation, visitors have the most positive expressions for natural attractions, especially landscapes and lakes. The findings emphasize the importance of considering review themes to improve the quality of tourist services and to enhance the positive image of Almaty as a tourist destination.
Read full abstract