Small Data Can Play a Big Role in Chemical Discovery.

Hadas Shalit Peleg,Anat Milo

doi:10.1002/anie.202219070

Small Data Can Play a Big Role in Chemical Discovery.

Hadas Shalit Peleg, Anat Milo

Open Access

https://doi.org/10.1002/anie.202219070

Copy DOI

Journal: Angewandte Chemie (International ed. in English)	Publication Date: Apr 26, 2023
Citations: 5	License type: CC BY-NC-ND 4.0

Affiliation: Ben-Gurion University of the Negev

#Small Data #Approach In Chemistry + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The chemistry community is currently witnessing a surge of scientific discoveries in organic chemistry supported by machine learning (ML) techniques. Whereas many of these techniques were developed for big data applications, the nature of experimental organic chemistry often confines practitioners to small datasets. Herein, we touch upon the limitations associated with small data in ML and emphasize the impact of bias and variance on constructing reliable predictive models. We aim to raise awareness to these possible pitfalls, and thus, provide an introductory guideline for good practice. Ultimately, we stress the great value associated with statistical analysis of small data, which can be further boosted by adopting a holistic data-centric approach in chemistry.

Full Text