Seamless hourly surface fine particulate matters (PM2.5) datasets are urgently needed for environmental and epidemiologic studies. Most existing models rely on satellite AOD with spatial gaps and the time-geolocation information not directly physically related to ground-level PM2.5 as input, leading to several problems, e.g., for daytime only, spatial discontinuity. By exploring spatiotemporal connection, here we propose a novel framework for improving reanalysis AOD with satellite AOD and retrieving seamless PM2.5 concentration. With the wavelet decomposition on the meteorological / AOD time-series and the machine learning algorithm ‘CatBoost’, the grid cells are well connected based on the meteorological/AOD similarity (quasi-Lagrangian description), rather than the spatial distance (Eulerian description). Our new AOD dataset (compared with AERONET AOD, R2 = 0.60, RMSE = 0.19) combines the advantages of reanalysis AOD (all-day hourly; seamless) and satellite AOD (high accuracy and spatial resolution). The new PM2.5 estimation strategy performed excellent in cross-validation (spatial CV R2/RMSE = 0.84/14.57 μg/m3) and mapping (no spatial discontinuity). With the help of new AOD dataset, the PM2.5 estimation exhibits significant improvement (R2: increased from 0.60 to 0.65, RMSE: reduced from 21.52 to 20.10 μg/m3) on retrieving PM2.5 at ground monitoring site sparse area (the spatial distance to the closest site is over 20 km). Such high temporal-resolved PM2.5 prediction improved daily estimates of PM2.5 as it leveraged the additional hourly information to well capture the transport pattern, with the increased R2 (from 0.72 to 0.77) and decreased RMSE (from 15.54 to 14.30 μg/m3) at ground monitoring site sparse area. The annual averaged PM2.5 estimated by hourly model also exhibit noticeable differences (∼6%) from that estimated with daily model, implying the latter might suffer overestimation (∼3%) in population-weighted PM2.5 exposure across China. In addition, our new PM2.5 dataset shows the Chinese average PM2.5 is highest around 24:00, lowest around 17:00.Synopsis: By performing the wavelet decomposition on meteorological factors/AOD (independent variables), the hourly seamless PM2.5 concentration across China is well estimated with improved spatiotemporal connection.
Read full abstract