The numerical precision of density-functional-theory (DFT) calculations depends on a variety of computational parameters, one of the most critical being the basis-set size. The ultimate precision is reached in the limit of a complete basis set (CBS). Our aim in this work is to find a machine-learning model that extrapolates finite basis-size calculations to the CBS limit for periodic crystal structures. We start with a data set of 63 binary solids investigated with two all-electron DFT codes, and FHI-aims, which employ very different types of basis sets. A quantile-random-forest model and a symbolic regression approach using the SISSO model are used to estimate the total-energy correction with respect to a fully converged calculation as a function of the basis-set size. The random-forest model achieves a symmetric mean absolute percentage error of lower than 25% for both codes and outperforms previous approaches in the literature. SISSO outperforms the random forest model for the code. Our approach also provides prediction intervals, which quantify the uncertainty of the models' predictions. Published by the American Physical Society 2025
Read full abstract