Abstract

Lembang is a notable regional tourism destination that bears considerable significance within the urban area of Bandung. Lembang is widely recognized for its flourishing agricultural sector, which supports a significant community of farmers engaged in the cultivation of fruits, vegetables, and ornamental plants, in addition to its intrinsic scenic beauty. Therefore, the acquisition of precipitation data is of considerable significance for individuals live in the area to maintain their economic endeavors. This study employs daily historical data from the period of 2018 to 2021, wherein approximately 70% of the data is categorized as sparse. This discourse aims to examine the utilization of the Extreme Gradient Boosting (XGboost) technique for predicting rainfall in the Lembang region, specifically emphasizing its effectiveness in handling limited data. The findings indicate that the model, when trained and tested using a 7:3 data split ratio, achieved a mean absolute error (MAE) of 1.834 for training and 4.473 for testing. Additionally, the root mean square error (RMSE) was calculated to be 3.319 for training and 7.637 for testing. The optimal hyperparameters consist of a learning rate of 0.005, a max_depth value of 10, and the utilization of 300 decision trees as n_estimators. The model effectively captures the pattern of sparse time series data and non-rainy days data, as evidenced by its low error metrics. However, it slightly underestimates the rainfall rate on the days with intense precipitation

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call