Abstract

This study aims to develop integrated tourism information portal prototypes in Bali by using web scrapping and Clustering methods.This study will use the waterfall development model. This study focused on the stages of needs analysis, system design for Tourism information portals in Bali and prototype development at the implementation stage. The initial stage of the web scrapping process is to do web searching in the form of url requests based on the keywords entered. After the data from the web scrapping process is collected, the next step is text processing which consists of several processes, namely parsing or tokenization, stemming in the form of root word search process, and stop word removal or removal of non-essential words. Before the clustering process begins, each word will be given weighting with the TF-IDF method which will form the vector space model as a representation of a text document. Similarity between vector space models will be calculated using the cosine similarity method. After the weighting process, the last step is to cluster the words which then continue to group the websites based on the results of clustering.The results of this study are a website portal that can display tourism information in Bali.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call