Air pollution forecasting is a significant step for air quality pollution management to mitigate pollution’s negative impact on the environment and people’s health. The data-driven forecasting model can help a better understanding of environmental air quality. The existing data-driven forecasting models usually ignore missing values, the correlations between the pollutant and meteorological factors and fail to perform temporal modeling effectively, affecting prediction accuracy. In response to these issues, we present a deep learning-based Convolutional LSTM–SDAE (CLS) model to forecast the particulate matter level, revealing the correlation between particulate matter and meteorological factors. In the proposed architecture, the k nearest neighbor (KNN) imputation technique is employed to recover the air quality dataset’s missing values. The Convolutional Long Short Term Memory (CNN–LSTM) unit identifies the vast dataset’s hidden features and performs pollutants’ temporal modeling. In addition, Bidirectional Gatted Recurrent Unit (BIGRU) is implemented as both encoder and decoder in Sparse Denoising Autoencoder, which reconstructs the CNN–LSTM model’s output in the dynamic fine-tuning layer to get robust prediction results. The experimental results in Talcher, India, and Beijing, China indicate that the model can improve forecasting accuracy and outperforms the other state of art and baseline models.
Read full abstract