Abstract

Machine learning (ML) is widely used in the field of material informatics. However, limitations on the size of available datasets are a key bottleneck in the use of machine learning methods to predict material properties or reverse-design high-performance materials. To solve this problem, we propose a virtual sample generation algorithm based on a Gaussian mixture model (GMM-VSG) to address the lack of training samples in machine learning. The core idea of the algorithm is to generate virtual samples by fitting the distribution of the original samples. We used an open rubber composite dataset (24 samples) to establish a machine learning model to predict the wear resistance of rubber materials through mechanical properties to verify the performance of the GMM-VSG algorithm. The results show that after using our algorithm, the R2 of the prediction model reached 0.95, and the prediction accuracy increased by 41%. This shows that the proposed algorithm can effectively promote the prediction accuracy of data model with small sample size.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.