Abstract

Most materials science datasets are not so large that the accuracy of machine learning (ML) models is relatively limited if only simple features are used. Here, we constructed an interpretable ∆-machine learning (∆-ML) model to connect the hybrid functional HSE bandgap ( E g HSE ) with the PBE functional bandgap ( E g PBE ). The former can reproduce the band gap comparable with experiments, but the computational cost is much more challenging. The training is based on our high-throughput calculations on a set of two-dimensional semiconductors. Four complex descriptors, all based on the E g PBE are constructed using the sure independence screening and sparsifying operator (SISSO) algorithm. Using these descriptors, the ∆-ML can accurately predict the E g HSE of test set with a determination coefficient (R 2 ) of 0.96. The error satisfies a normal distribution with a mean of zero. We provide a direct functional relationship between input descriptors and target properties. We find that E g HSE and the 5/6 th power of E g PBE show a significant linear correlation, which may guide rapid prediction of E g HSE from E g PBE for materials with a E g HSE greater than 0.22 eV. We also discussed the correlation between the atomic radius and the E g HSE . Our work will provide an effective and interpretable model to construct the optimal physical descriptors for ML prediction on bandgaps in screening massive new 2D materials research. • Constructing an interpretable ∆-machine learning (∆-ML) model to connect the hybrid functional E g HSE with the E g PBE . • SISSO descriptor D 3 = E g PBE 5 / 6 can predict the E g HSE of 2D-semiconductors using equation E g HSE = D 3 ×1.55+0.22. • SISSO descriptor D 1 shows the atomic volume negatively correlated to E g HSE .

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call