Abstract

At present, the numerical prediction models fail to predict effectively due to the lack of basic data of pollutant concentration in a short term in China. Therefore, it is necessary to study the statistical prediction methods based on historical data. The traditional Back Propagation Neural Network (BPNN) has been used to predict the pollutant concentration. The missing data also has an impact on modeling, and how to use historical data effectively of multiple monitoring stations in a city should be concerned. In this study, the Improved Newton Interpolation (INI) algorithm has been adopted to solve the problem of missing data, and assigning weight (AW) method has been proposed to enrich data of per station. The Neighbor-Principal Component Analysis (Neighbor-PCA) algorithm has been employed to reduce the dimension of data in order to avoid overfitting caused by high dimension and linear correlation of multiple factors. The strategy of early stopping and gradient descent algorithm have been utilized to avoid the slow convergence speed and overfitting by the traditional BPNN. The methods (INI, AW, Neighbor-PCA) have been integrated as a prediction model named NNP-BPNN. Forecasting experiments of PM_{2.5} have shown that the NNP-BPNN model can improve the accuracy and generalization ability of the traditional BPNN model. Specifically, the average root mean square error (RMSE) has been reduced by 24% and the average correlation relevancy has been increased by 9.4%. It took 20 s to implement BPNN model, it took 170 s to implement NN-BPNN model and it took 47 s to implement NNP-BPNN model. The time used by NNP-BPNN model is reduced by 72% than that of NN-BPNN model.

Highlights

  • Environmental pollution poses a serious threat to human health (Zheng et al 2016)

  • We discuss the effectiveness of NN-Back Propagation Neural Network (BPNN) model with INI algorithm and the assigning weight (AW) method

  • We discuss the validity of NNP-BPNN model with INI algorithm, AW method and Neighbor-PCA algorithm

Read more

Summary

Introduction

Environmental pollution poses a serious threat to human health (Zheng et al 2016). The NAP uses mathematical models of the atmosphere and oceans to predict the air quality based on current atmosphere conditions and the pollutant sources (Zhang et al 2014). Shi et al (2012) have proved that Artificial Neural Network (ANN) can provide better results than the traditional multiple linear regression models about pollutant concentration prediction based on historical monitoring data. The combined model of ARMA and BPNN based on historical monitoring data studied by Zhu and Lu (2016) has a smaller prediction error than that of the traditional BPNN. The time series data is decomposed into wavelet coefficients, and the prediction experiments based on three types of neural network models (Multi-layer Perceptron, Elman and Support Vector Machine) based on historical data have shown that the improved models have provided

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.