The masses of $$\sim$$ 2500 nuclei have been measured experimentally; however, >7000 isotopes are predicted to exist in the nuclear landscape from H ( $$Z=1$$ ) to Og ( $$Z=118$$ ) based on various theoretical calculations. Exploring the mass of the remaining isotopes is a popular topic in nuclear physics. Machine learning has served as a powerful tool for learning complex representations of big data in many fields. We use Light Gradient Boosting Machine (LightGBM), which is a highly efficient machine learning algorithm, to predict the masses of unknown nuclei and to explore the nuclear landscape on the neutron-rich side from learning the measured nuclear masses. Several characteristic quantities (e.g., mass number and proton number) are fed into the LightGBM algorithm to mimic the patterns of the residual $$\delta (Z,A)$$ between the experimental binding energy and the theoretical one given by the liquid-drop model (LDM), Duflo–Zucker (DZ, also dubbed DZ28) mass model, finite-range droplet model (FRDM, also dubbed FRDM2012), as well as the Weizsäcker–Skyrme (WS4) model to refine these mass models. By using the experimental data of 80 $$\%$$ of known nuclei as the training dataset, the root mean square deviations (RMSDs) between the predicted and the experimental binding energy of the remaining 20% are approximately $$0.234\pm 0.022$$ , $$0.213\pm 0.018$$ , $$0.170\pm 0.011$$ , and $$0.222\pm 0.016$$ MeV for the LightGBM-refined LDM, DZ model, WS4 model, and FRDM, respectively. These values are approximately 90%, 65%, 40%, and 60% smaller than those of the corresponding origin mass models. The RMSD for 66 newly measured nuclei that appeared in AME2020 was also significantly improved. The one-neutron and two-neutron separation energies predicted by these refined models are consistent with several theoretical predictions based on various physical models. In addition, the two-neutron separation energies of several newly measured nuclei (e.g., some isotopes of Ca, Ti, Pm, and Sm) predicted with LightGBM-refined mass models are also in good agreement with the latest experimental data. LightGBM can be used to refine theoretical nuclear mass models and predict the binding energy of unknown nuclei. Moreover, the correlation between the input characteristic quantities and the output can be interpreted by SHapley additive exPlanations (a popular explainable artificial intelligence tool), which may provide new insights for developing theoretical nuclear mass models.
Read full abstract