A Strategic Approach to Machine Learning for Material Science: How to Tackle Real-World Challenges and Avoid Pitfalls

Piyush Karande,Thomas Yong-Jin Han,Brian Gallagher

doi:10.1021/acs.chemmater.2c01333

Piyush Karande, Thomas Yong-Jin Han + Show 1 more

Open Access

https://doi.org/10.1021/acs.chemmater.2c01333

Copy DOI

Journal: Chemistry of Materials	Publication Date: Sep 1, 2022
Citations: 16	License type: CC BY-NC-ND 4.0

Affiliation: Lawrence Livermore National Laboratory

Abstract

The exponential growth and success of machine learning (ML) has resulted in its application in all scientific domains including material science. Advancement in experimental techniques has led to an increase in the volume of material science data encouraging material scientists to investigate data-driven solutions to scientific problems. While the resources available to get started with ML are ever increasing, there is little literature on traversing through the space of decisions that need to be made to implement a robust and trustworthy ML solution. A lack of such resources leads to researchers wading through articles and papers trying to determine the best approach for their problem and sometimes also falling prey to pitfalls in a real-world scenario. This paper aims to act as a guide for researchers who want to strategically approach a ML solution to their problem through the use of domain knowledge and systematic evaluation of the major aspects of a ML pipeline. We focus on four aspects of the ML pipeline: (1) problem formulation, (2) data curation, (3) feature representation and model selection, and (4) model generalizability and real-world performance. In each case, we discuss the space of decisions, provide examples from scientific literature, and illustrate how different choices can affect the outcome through a case study of predicting compressive strength of uniaxially pressed molecular solid, 2,4,6-triamino-1,3,5-trinitrobenzene (TATB) samples. Using a similar approach of critical thinking along with rigorous evaluation and diagnostics, researchers can be assured of the reliability of predictions from their ML models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Strategic Approach to Machine Learning for Material Science: How to Tackle Real-World Challenges and Avoid Pitfalls

Abstract

Talk to us

Similar Papers

More From: Chemistry of Materials

Lead the way for us

Similar Papers

FireRisk: A Web Platform for next day fire forecasting
Stella Girtsou ... Alex Apostolakis
-
Stella Girtsou, et. al.Stella Girtsou ... Alex Apostolakis
15 May 2023
15 May 2023

Grand rounds in methodology: key considerations for implementing machine learning solutions in quality improvement initiatives
Amol A Verma ... Kaveh G Shojania
BMJ Quality & Safety | VOL. 33
Amol A Verma, et. al.Amol A Verma ... Kaveh G Shojania
23 Nov 2023
BMJ Quality & Safety | VOL. 33

Toward Rapid Development and Deployment of Machine Learning Pipelines across Cloud-Edge
Anirban Bhattacharjee ... Thomas Damiano
-
Anirban Bhattacharjee, et. al.Anirban Bhattacharjee ... Thomas Damiano
12 Aug 2021
12 Aug 2021

Hyperparameter Tuning and Pipeline Optimization via Grid Search Method and Tree-Based AutoML in Breast Cancer Prediction.
Siti Fairuz Mat Radzi ... Mohd Amiruddin Abd Rahman
Journal of Personalized Medicine | VOL. 11
Siti Fairuz Mat Radzi, et. al.Siti Fairuz Mat Radzi ... Mohd Amiruddin Abd Rahman
29 Sep 2021
Journal of Personalized Medicine | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Strategic Approach to Machine Learning for Material Science: How to Tackle Real-World Challenges and Avoid Pitfalls

Abstract

Talk to us

Similar Papers

More From: Chemistry of Materials