Abstract
K-Means Algorithm can be used to group tourists based on reviews on tourist destination objects. This algorithm has a weakness that is sensitive to the determination of the initial centroid. The initial centroid that is determined at random will decreasing the level accuracy, often gets stuck at the local optimum, and gets a random solution. Optimization algorithms such as PSO can overcome this by determining the optimal initial centroid. The optimal number of clusters (K) will be determined using the Elbow method by calculating the SSE value of the resulting cluster. The average Silhouette Coefficient (SC) is used to measure the quality of the clusters produced by the K-Means Algorithm with and without the PSO Algorithm. This study uses secondary data obtained from the UCI Machine Learning Repository with the name Travel Reviews Data Set which consists of 980 records and 10 attributes. The test results show that K=2 is the optimal number of clusters. The K-Means and PSO Algorithm gives an average SC value of 0.300358 which is better than without the PSO Algorithm of 0.300076. The optimal PSO hyperparameter generated is the number of particles=30, \varphi_1=2.2, and {\ \varphi}_2=3 at maximum iteration of 100.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.