BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale

Carl Poelking,Bingqing Cheng,Felix A Faber

doi:10.1088/2632-2153/ac4d11

Abstract

We introduce a machine-learning (ML) framework for high-throughput benchmarking of diverse representations of chemical systems against datasets of materials and molecules. The guiding principle underlying the benchmarking approach is to evaluate raw descriptor performance by limiting model complexity to simple regression schemes while enforcing best ML practices, allowing for unbiased hyperparameter optimization, and assessing learning progress through learning curves along series of synchronized train-test splits. The resulting models are intended as baselines that can inform future method development, in addition to indicating how easily a given dataset can be learnt. Through a comparative analysis of the training outcome across a diverse set of physicochemical, topological and geometric representations, we glean insight into the relative merits of these representations as well as their interrelatedness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning: Science and Technology	Publication Date: Nov 17, 2022
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology

Lead the way for us

Similar Papers

GEIN: An interpretable benchmarking framework towards all building types based on machine learning
Xiaoyu Jin ... Ao Li
Energy and Buildings | VOL. 260
Xiaoyu Jin, et. al.Xiaoyu Jin ... Ao Li
03 Feb 2022
Energy and Buildings | VOL. 260

Improving Deepwater Facility Uptime Using Machine Learning Approach
Ajay Singh ... Rohit Srikonda
-
Ajay Singh, et. al.Ajay Singh ... Rohit Srikonda
23 Sep 2019
23 Sep 2019

Lowering Carbon Foot-Print by Increasing Operational Efficiency Using Adaptive Machine Learning
Ajay Singh ... Michael Brinkley
-
Ajay Singh, et. al.Ajay Singh ... Michael Brinkley
26 Sep 2022
26 Sep 2022

A Deep Learning Framework for Sickle Cell Disease Microfluidic Biomarker Assays
Niksa Praljak ... Michael Hinczewski
Blood | VOL. 136
Niksa Praljak, et. al.Niksa Praljak ... Michael Hinczewski
05 Nov 2020
Blood | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology