This study builds a model to predict distribution coefficients (Kd) using the random forest (RF) method and a machine learning model based on the Japan Atomic Energy Agency Sorption Database (JAEA-SDB). A database of ten input variables, including the distribution coefficient, pH, initial radionuclide concentrations, solid–liquid ratio, ionic strength, oxidation number, cation exchange capacity, surface area, electronegativity, and ionic radius, was constructed and used for the RF model calculation. The calculation parameters employed in this work included two different hyperparameters, the number of decision trees and the maximum number of variables to divide each node, together with the random seeds inside the RF model. The coefficients of determination were derived with various combinations of hyperparameters and random seeds, and were employed to assess the RF model calculation result. Based on the results of the RF model, the distribution coefficients of 22 target nuclides (Am, Ac, Co, Cm, Cd, Cs, Cu, Na, Np, Ni, Nb, U, Sr, Sn, Pb, Pa, Pu, Po, I, Tc, Th, and Zr) were predicted successfully. Among the various input variables, pH was found to make the highest contribution to determining the distribution coefficient. The novelty of this study lies in the first application of the machine learning method for predicting the Kd value of bentonites, using JAEA-SDB. This study has established a model for reliably predicting the distribution coefficient for various radionuclides that is intended for use in evaluating the Kd value in arbitrary aqueous conditions.
Read full abstract