The impact of tree-based machine learning models, length of training data, and quarantine search query on tourist arrival prediction’s accuracy under COVID-19 in Indonesia

Mochammad Agus Afrianto,Meditya Wasesa

doi:10.1080/13683500.2022.2085079

Abstract

ABSTRACT This study presents the extreme gradient boosting (XGBoost) and random forest (RF) models to predict tourism demand by incorporating international COVID-19 cases, international tourist arrivals, and the destination's quarantine policy predictors. Unlike other ‘black box’ machine learning models, those two tree-based models offer better interpretability with explicit feature importance and tree structure representations. This paper evaluates the accuracy of these models in predicting international tourist arrivals in Indonesia during the COVID-19 pandemic using long-range (January 2008–June 2021) and short-range (January 2018–June 2021) training datasets. The performance of these two models is compared with benchmark models, such as the artificial neural network, autoregressive integrated moving average, and seasonal ARIMA models. In general, the tree-based machine learning models outperformed all benchmark models. International COVID-19 cases and tourist arrivals predictors have dominating feature importance scores in XGBoost models. Meanwhile, Google trends keywords on quarantine policies show significant importance in RF models but not in the XGBoost models. Moreover, RF models are better than the XGBoost models in terms of accuracy and overcoming overfitting cases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The impact of tree-based machine learning models, length of training data, and quarantine search query on tourist arrival prediction’s accuracy under COVID-19 in Indonesia

Abstract

Talk to us

Similar Papers

More From: Current Issues in Tourism

Lead the way for us

Journal: Current Issues in Tourism	Publication Date: Jun 15, 2022
Citations: 2

Similar Papers

Development of interpretable machine learning models to predict in-hospital prognosis of acute heart failure patients.
Munekazu Tanaka ... Takeshi Kimura
ESC heart failure | VOL. 11
Munekazu Tanaka, et. al.Munekazu Tanaka ... Takeshi Kimura
15 May 2024
ESC heart failure | VOL. 11

Predictability of Belgian residential real estate rents using tree-based ML models and IML techniques
Ian Lenaers ... Lieven De Moor
International Journal of Housing Markets and Analysis | VOL. 17
Ian Lenaers, et. al.Ian Lenaers ... Lieven De Moor
13 Apr 2023
International Journal of Housing Markets and Analysis | VOL. 17

Ensemble learning models with a Bayesian optimization algorithm for mineral prospectivity mapping
Jiangning Yin ... Nan Li
Ore Geology Reviews | VOL. 145
Jiangning Yin, et. al.Jiangning Yin ... Nan Li
28 Apr 2022
Ore Geology Reviews | VOL. 145

Exploring Machine Learning in Deep Foundation and Soil Classification Application
Mohammad Moontakim Shoaib
-
Mohammad Moontakim ShoaibMohammad Moontakim Shoaib
05 Jun 2023
05 Jun 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The impact of tree-based machine learning models, length of training data, and quarantine search query on tourist arrival prediction’s accuracy under COVID-19 in Indonesia

Abstract

Talk to us

Similar Papers

More From: Current Issues in Tourism