In previous laboratory preparation processes, the selection and proportioning of specific experimental parameters often stemmed from the empirical experience of predecessors, necessitating the derivation of the optimal scheme for the target experiment through extensive trial and error. This process typically required the consumption of substantial resources and time. Thanks to the advancement of machine learning technologies, these have gradually become a powerful tool for addressing complex functional problems in material optimization. This study is based on a collected database of thermoelectric materials, aiming to clarify the correspondence between physical characteristics and the thermoelectric figure of merit, zT. The key features that can directly affect the experimental results are identified, and the features that indirectly affect the experimental results are also analyzed. By employing interpretable machine learning methods, this research analyzes critical molecular features in the characterized data, utilizing feature engineering techniques to construct and optimize machine learning models for the selected key features. Furthermore, in the subsequent model fitting and analysis phase, by comparing the efficiency of different feature combinations, the optimal feature descriptors for the current experimental system were determined. The adoption of the aforementioned improved methods endowed the related models with the potential for high-throughput screening, thereby further enhancing the efficiency of experimental optimization.
Read full abstract