Abstract

This work develops predictive models for estimating radon (222Rn) activity concentration in the atmosphere using novel grid search based random forest regression (GS-RFR) and stepwise regression (SWR). The developed models employ meteorological parameters which include the temperature, pressure, relative and absolute humidity, wind speed and wind direction as descriptors. Experimental data of radon concentration and meteorological parameters from two observatories of the Korea Polar Research Institute in Antarctica (King Sejong and Jang Bogo) have been employed in this work. The performance of the developed models was assessed using three different performance measuring parameters. On the basis of root mean square error (RMSE), the GS-RFR shows better performance over the SWR. An improvement of 64.09 % and 15.19 % was obtained on the training and test datasets, respectively at King Sejong station. At the Jang Bogo station, an improvement of 75.04 % and 28.04 % was obtained on the training and test datasets, respectively. The precision and robustness of the developed models would be of significant interest in determining the concentration of radon (222Rn) activity concentration in the atmosphere for various physical applications especially in regions where field measuring equipment for radon is not available or measurements have been interrupted.

Highlights

  • Description of Random Forest RegressionA random forest is described, according to [35], to consist of N regression trees that are randomized referred to as a family

  • This work develops predictive models for estimating radon (222Rn) activity concentration in the regression (SWR)

  • The data used in this work was published by [38], being data measured in two Korea Antarctic Research Program stations namely King Sejong (KSG) and Jang Bogo (JBS)

Read more

Summary

Description of Random Forest Regression

A random forest is described, according to [35], to consist of N regression trees that are randomized referred to as a family. The finite forest estimate for regression as a result of the combination of the trees is and ease of implementation of random forests, their predictions are remarkably accurate, with the ability to process a very large input data whilst dealing with overfitting. They perform well with small to medium data [29]. Their good predictive abilities have made them highly applicable to regression and classification problems in the atmospheric sciences.

Description of Grid Search optimization
Stepwise Regression
Performance measuring parameters
Description of sites
Radon and Meteorological Data
Comparison of Performance between the SWR and GSRFR
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call