Abstract

Machine learning has emerged as a novel tool for the efficient prediction of material properties, and claims have been made that machine-learned models for the formation energy of compounds can approach the accuracy of Density Functional Theory (DFT). The models tested in this work include five recently published compositional models, a baseline model using stoichiometry alone, and a structural model. By testing seven machine learning models for formation energy on stability predictions using the Materials Project database of DFT calculations for 85,014 unique chemical compositions, we show that while formation energies can indeed be predicted well, all compositional models perform poorly on predicting the stability of compounds, making them considerably less useful than DFT for the discovery and design of new solids. Most critically, in sparse chemical spaces where few stoichiometries have stable compounds, only the structural model is capable of efficiently detecting which materials are stable. The nonincremental improvement of structural models compared with compositional models is noteworthy and encourages the use of structural models for materials discovery, with the constraint that for any new composition, the ground-state structure is not known a priori. This work demonstrates that accurate predictions of formation energy do not imply accurate predictions of stability, emphasizing the importance of assessing model performance on stability predictions, for which we provide a set of publicly available tests.

Highlights

  • Machine learning (ML) is emerging as a novel tool for rapid prediction of material properties[1,2,3,4,5,6]

  • We show that while existing ML models can predict ΔHf with relatively high accuracy from the chemical formula, they are insufficient to accurately distinguish stable from unstable compounds within an arbitrary chemical space

  • The error in predicting Density Functional Theory (DFT)-calculated ΔHf by ML models is often compared favorably with the error DFT makes in predicting ΔHf relative to experimentally obtained values

Read more

Summary

Introduction

Machine learning (ML) is emerging as a novel tool for rapid prediction of material properties[1,2,3,4,5,6]. The Inorganic Crystal Structure Database (ICSD) of known solid-state materials contains ~105 entries[13], several orders of magnitude less than the 1010 quaternary compositions identified as plausible using electronegativity- and charge-based rules[14]. This suggests that (1) there is ample opportunity for new materials discovery and (2) the problem of finding stable materials may resemble the needle-in-a-haystack problem, with many unstable compositions for each stable one. The immensity of this problem is a natural fit for high-throughput ML techniques

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call