Abstract

Machine learning techniques can greatly accelerate material discovery while high-dimensional representation often causes overfitting problems and leads to poor model performance. Building a structure-property relationship with low-dimensional representation is always an open challenge, especially for diverse structures within small datasets. To address this issue, a low-dimensional representation named the transformed atom vector (TAV) is proposed, which is a crystal-graph-based descriptor. As an example, we apply it in two-dimensional materials and predict the band gap at the Heyd-Scuseria-Ernzerhof level with only 500 samples at acceptable accuracy. Moreover, TAV representation retains interpretability, based on which a property-oriented search method through element substitution is developed. This work provides a universal low-dimensional representation containing rich material information, as well as an intuitive interpretation approach for material design, which improves the feasibility and interpretability of machine learning models for small datasets and helps realize accurate yet meaningful property prediction at a lower cost.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call