Measuring the similarity between materials is essential for estimating their properties and revealing the associated physical mechanisms. However, current methods for measuring the similarity between materials rely on theoretically derived descriptors and parameters fitted from experimental or computational data, which are often insufficient and biased. Furthermore, outliers and data generated by multiple mechanisms are usually included in the dataset, making the data-driven approach challenging and mathematically complicated. To overcome such issues, we apply the Dempster–Shafer theory to develop an evidential regression-based similarity measurement (eRSM) method, which can rationally transform data into evidence. It then combines such evidence to conclude the similarities between materials, considering their physical properties. To evaluate the eRSM, we used two material datasets, including 3d transition metal–4f rare-earth binary and quaternary high-entropy alloys with target properties, Curie temperature, and magnetization. Based on the information obtained on the similarities between the materials, a clustering technique is applied to learn the cluster structures of the materials that facilitate the interpretation of the mechanism. The unsupervised learning experiments demonstrate that the obtained similarities are applicable to detect anomalies and appropriately identify groups of materials whose properties correlate differently with their compositions. Furthermore, significant improvements in the accuracies of the predictions for the Curie temperature and magnetization of the quaternary alloys are obtained by introducing the similarities, with the reduction in mean absolute errors of 36% and 18%, respectively. The results show that the eRSM can adequately measure the similarities and dissimilarities between materials in these datasets with respect to mechanisms of the target properties.
Read full abstract