Abstract
Accurate theoretical predictions of desired properties of materials play an important role in materials research and development. Machine learning (ML) can accelerate the materials design by building a model from input data. For complex datasets, such as those of crystalline compounds, a vital issue is how to construct low-dimensional representations for input crystal structures with chemical insights. In this work, we introduce an algebraic topology-based method, called atom-specific persistent homology (ASPH), as a unique representation of crystal structures. The ASPH can capture both pairwise and many-body interactions and reveal the topology-property relationship of a group of atoms at various scales. Combined with composition-based attributes, ASPH-based ML model provides a highly accurate prediction of the formation energy calculated by density functional theory (DFT). After training with more than 30,000 different structure types and compositions, our model achieves a mean absolute error of 61 meV/atom in cross-validation, which outperforms previous work such as Voronoi tessellations and Coulomb matrix method using the same ML algorithm and datasets. Our results indicate that the proposed topology-based method provides a powerful computational tool for predicting materials properties compared to previous works.
Highlights
Advances in materials science are typically slow and arduous[1], and is challenging to meet the increased demand for material characterization[2]
The coefficients of determination (R2), the root mean squared error (RMSE), and the mean absolute error (MAE) for tenfold crossvalidations repeated 20 times are given for various methods
An MAE of 61 meV/atom is achieved by atom-specific persistent homology (ASPH) combined with composition-based features, whose performance is better than those of Voronoi tessellations and Coulomb matrix (CM) modified by sine matrix approximation
Summary
Advances in materials science are typically slow and arduous[1], and is challenging to meet the increased demand for material characterization[2]. To accelerate the development of new materials, high-throughput computing methods have been proposed in recent years, especially the density functional theory (DFT) which can predict the properties of both experimental and hypothetical inorganic compounds[4,5] The combination of both experiments and computer simulations has proven to be a powerful approach to reduce the time and cost of materials design and has been widely used in Li-ion batteries, electrocatalysis, thermoelectrics, and structural alloys. Ward et al.[39] applied the standard random forest (RF) to predict the formation energy based on features derived from Voronoi tessellations to represent structural properties and atomic properties This model achieves an MAE of 80 meV/atom in crossvalidation for a dataset of 435,000 formation energies. Combined with composition-based attributes, our method achieved excellent results with the mean absolute error as low
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.