PurposeAccurate electronic stopping power data is crucial for calculations of radiation-induced effects in a wide range of applications, from dosimetry and radiotherapy to particle physics. The data is dependent on the parameters of both the incident charged particle and the stopping medium. The existent Bethe theory can be used to calculate the stopping power of high-energy ions, but fails at lower energies, leaving incomplete and even contradictory experimental data, often expanded through extrapolations with fitting formula, as the only accessible resource. Moreover, the majority of the experimental data is available for elements only, further limiting the validity of fitting approaches for complex material compositions. A relatively novel machine learning methodology has been proven to be effective for exactly these types of problems. In this study, Stacking Ensemble Machine Learning (EML) algorithm was developedto predict electronic stopping power for any incident ion and target combination over a wide range of ion energies. For this purpose, five ML models, namely Bagging Regressor (BR), eXtreme Gradient Boosting (XGB), Adaptive Boosting (AdB), Gradient Boosting (GB), and Random Forest (RF), were selected as base and meta learners to construct the final Stacking EML. Methods40,044 experimental measurements, from 1928 to the present, available on the International Atomic Energy Agency (IAEA) website were used to train machine learning (ML) algorithms. This database consists of 593 various ion-target combinations across the energy range of 0.037 to 985 MeV. For model training, the eleven most important features were selected. The model evaluation was performed using several error metrics, including R-squared (R2), root-mean-squared-error (RMSE), mean-absolute-error (MAE), and mean-absolute-percentage-error (MAPE), on both the training and test datasets. ResultsBased on model performance evaluation tests, a stack of XGB and RF via BR meta-learner had the lowest error margin. The value of R2 = 0.9985 indicated a near-ideal fit to all samples in the training data across the entire range of stopping powers. R2 = 0.9955 for predictions made by the model on the unseen test data suggested that the model accurately predicted the test data. ConclusionsThe developed model can serve as a universal tool to generate the eSP data in a wide range of cases, regardless of the availability of experimental data or reliable theoretical equations. Overall, the results of the developed tool testified to the power of machine learning approaches, and the suitability of the chosen models for solutions to practically important physics problems.
Read full abstract