Abstract

The ecological effects of accidental or malicious radioactive contamination are insufficiently understood because of the hazards and difficulties associated with conducting studies in radioactively-polluted areas. Data sets from severely contaminated locations can therefore be small. Moreover, many potentially important factors, such as soil concentrations of toxic chemicals, pH, and temperature, can be correlated with radiation levels and with each other. In such situations, commonly-used statistical techniques like generalized linear models (GLMs) may not be able to provide useful information about how radiation and/or these other variables affect the outcome (e.g. abundance of the studied organisms). Ensemble machine learning methods such as random forests offer powerful alternatives. We propose that analysis of small radioecological data sets by GLMs and/or machine learning can be made more informative by using the following techniques: (1) adding synthetic noise variables to provide benchmarks for distinguishing the performances of valuable predictors from irrelevant ones; (2) adding noise directly to the predictors and/or to the outcome to test the robustness of analysis results against random data fluctuations; (3) adding artificial effects to selected predictors to test the sensitivity of the analysis methods in detecting predictor effects; (4) running a selected machine learning method multiple times (with different random-number seeds) to test the robustness of the detected “signal”; (5) using several machine learning methods to test the “signal’s” sensitivity to differences in analysis techniques. Here, we applied these approaches to simulated data, and to two published examples of small radioecological data sets: (I) counts of fungal taxa in samples of soil contaminated by the Chernobyl nuclear power plan accident (Ukraine), and (II) bacterial abundance in soil samples under a ruptured nuclear waste storage tank (USA). We show that the proposed techniques were advantageous compared with the methodology used in the original publications where the data sets were presented. Specifically, our approach identified a negative effect of radioactive contamination in data set I, and suggested that in data set II stable chromium could have been a stronger limiting factor for bacterial abundance than the radionuclides 137Cs and 99Tc. This new information, which was extracted from these data sets using the proposed techniques, can potentially enhance the design of radioactive waste bioremediation.

Highlights

  • The ecological consequences of radioactive contamination after an accident at a nuclear power plant or nuclear waste storage site remain poorly understood because it can be difficult and hazardous to conduct studies in contaminated areas

  • Noise variables can be produced by an analogous procedure for other types of predictors, for example by using other distributions (e.g. Bernoulli instead of normal for binary variables) or by bootstrapping with replacement. These synthetic noise variables can be added as extra predictors to the data set and analyzed by generalized linear models (GLMs) and multi-model inference (MMI) along with real predictors

  • The number of fungal taxa isolated from soil at various locations near the Chernobyl nuclear power plant during the first five years after the accident had a moderate negative correlation with the severity of radioactive contamination (AvLogRad) (Fig 1)

Read more

Summary

Introduction

The ecological consequences of radioactive contamination after an accident at a nuclear power plant or nuclear waste storage site remain poorly understood because it can be difficult and hazardous to conduct studies in contaminated areas. Due to these limitations, data sets collected from locations where the contamination is most severe are often small and study design can be sub-optimal. Even small data sets collected under field conditions can in principle provide useful information which would be difficult to obtain in the laboratory Such complexity of ecological data can make their analysis challenging. Identifying the effects of these factors can provide insight into predicting and understanding the ecological impact of radioactive contamination, and potentially into mitigating its impact by bioremediation

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call