Short-Term Visibility Prediction Using Tree-Based Machine Learning Algorithms and Numerical Weather Prediction Data

Bu-Yo Kim,Miloslav Belorid,Joo Wan Cha

doi:10.1175/waf-d-22-0053.1

Abstract

Abstract Accurate visibility prediction is imperative in the interests of human and environmental health. However, the existing numerical models for visibility prediction are characterized by low prediction accuracy and high computational cost. Thus, in this study, we predicted visibility using tree-based machine learning algorithms and numerical weather prediction data determined by the local data assimilation and prediction system (LDAPS) of the Korea Meteorological Administration. We then evaluated the accuracy of visibility prediction for Seoul, South Korea, through a comparative analysis using observed visibility from the automated synoptic observing system. The visibility predicted by machine learning algorithm was compared with the visibility predicted by LDAPS. The LDAPS data employed to construct the visibility prediction model were divided into learning, validation, and test sets. The optimal machine learning algorithm for visibility prediction was determined using the learning and validation sets. In this study, the extreme gradient boosting (XGB) algorithm showed the highest accuracy for visibility prediction. Comparative results using the test sets revealed lower prediction error and higher correlation coefficient for visibility predicted by the XGB algorithm (bias: −0.62 km, MAE: 2.04 km, RMSE: 2.94 km, and R: 0.88) than for that predicted by LDAPS (bias: −0.32 km, MAE: 4.66 km, RMSE: 6.48 km, and R: 0.40). Moreover, the mean equitable threat score (ETS) also indicated higher prediction accuracy for visibility predicted by the XGB algorithm (ETS: 0.5–0.6 for visibility ranges) than for that predicted by LDAPS (ETS: 0.1–0.2).

Full Text