Balancing data for generalizable machine learning to predict glass-forming ability of ternary alloys

Yi Yao,Timothy Sullivan,Feng Yan,Jiaqi Gong,Lin Li

doi:10.1016/j.scriptamat.2021.114366

Yi Yao, Timothy Sullivan + Show 3 more

Open Access

https://doi.org/10.1016/j.scriptamat.2021.114366

Copy DOI

Journal: Scripta Materialia	Publication Date: Oct 30, 2021
Citations: 22	License type: publisher-specific-oa

Affiliation: University of Alabama

Abstract

Machine Learning has thrived on the emergence of data-driven materials science. However, the materials datasets acquired at existing research efforts have significant imbalance issues. This paper investigated the data imbalance for the glass-forming ability of ternary alloy systems, which consists of abundant, low-fidelity high-throughput data, and sparse, high-fidelity traditional experimental data. We demonstrated a new method to handle the data imbalance and trained artificial neural network (ANN) models on the original vs. balanced datasets. The ANN model trained on the balanced dataset solved the overfitting issue suffered by the model trained on the original dataset. More importantly, the generalizability in predicting the new alloy system was improved in the data-balanced model, evidenced by the leave-one-alloy-system-out validation. Our work highlights the importance of handling data imbalance in material datasets to solve the overfitting issues of machine learning models and further enhance generalizability in predicting the characteristics of the new material systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Balancing data for generalizable machine learning to predict glass-forming ability of ternary alloys

Abstract

Talk to us

Similar Papers

More From: Scripta Materialia

Lead the way for us

Similar Papers

Improving accuracy of code smells detection using machine learning with data balancing techniques
Nasraldeen Alnor Adam Khleel ... Károly Nehéz
The Journal of Supercomputing | VOL. 80
Nasraldeen Alnor Adam Khleel, et. al.Nasraldeen Alnor Adam Khleel ... Károly Nehéz
05 Jun 2024
The Journal of Supercomputing | VOL. 80

Comparison of Predictive Models for Transferring Stroke In-Patients to Intensive Care Unit
Nawal N Alotaibi ... Sreela Sasi
Transactions on Machine Learning and Artificial Intelligence | VOL. 4
Nawal N Alotaibi, et. al.Nawal N Alotaibi ... Sreela Sasi
30 Jun 2016
Transactions on Machine Learning and Artificial Intelligence | VOL. 4

불균형 데이터 집합의 분류를 위한 하이브리드 SVM 모델
Jae Sik Lee ... Jong Gu Kwon
Journal of Intelligence and Information Systems | VOL. 19
Jae Sik Lee, et. al.Jae Sik Lee ... Jong Gu Kwon
30 Jun 2013
Journal of Intelligence and Information Systems | VOL. 19

Machine learning algorithms, bull genetic information, and imbalanced datasets used in abortion incidence prediction models for Iranian Holstein dairy cattle
Hamideh Keshavarzi ... Rabeh Ravanifard
Preventive Veterinary Medicine | VOL. 175
Hamideh Keshavarzi, et. al.Hamideh Keshavarzi ... Rabeh Ravanifard
17 Dec 2019
Preventive Veterinary Medicine | VOL. 175

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Balancing data for generalizable machine learning to predict glass-forming ability of ternary alloys

Abstract

Talk to us

Similar Papers

More From: Scripta Materialia