Precipitable water vapor (PWV) facilitates the exchange of water and energy between the Earth's surface and the surrounding atmosphere. PWV can be retrieved by employing the water vapor absorption channels in Landsat 8. Unfortunately, the conventional split-window covariance-variance ratio (SWCVR) method is susceptible to various factors, resulting in low retrieval accuracy and availability. Therefore, we improve the traditional SWCVR model and propose a new PWV retrieval method using ensemble machine learning to address these limitations. The new method uses Gradient Boosting Decision Tree (GBDT) to establish the model between brightness temperature, GNSS-derived PWV, and related surface parameters. The results of the test set show that the improved SWCVR model has an RMSE, Bias, and availability of 0.4947 g/cm2, 0.0276 g/cm2, and 29.6%, respectively. By contrast, the GBDT model's corresponding values are 0.2870 g/cm2, −0.0094 g/cm2, and 67.9%, respectively. Compared with SWCVR, the GBDT improves the RMSE and availability by 41.99% and 38.3%, respectively. The GBDT algorithm is significantly better than the SWCVR model in low altitude and complex surface coverage areas. From a temporal perspective, the advantages of the GBDT method are more apparent in the summer. Finally, the SWCVR and GBDT models are externally validated using measured PWV data from the sun photometer and radiosonde, and the RMSE is 0.5148 g/cm2 and 0.3775 g/cm2, respectively. Based on the findings, we know that the accuracy of GBDT has significantly improved against the SWCVR.
Read full abstract