PHOTONAI-A Python API for rapid machine learning model development.

Ramona Leenings,Benjamin Risse,Daniel Emden,Lukas Fisch,Jakob Steenweg,Xiaoyi Jiang,Julian Gebker,Nils Ralf Winter,Dominik Grotegerd,Udo Dannlowski,Nils Opel,Lucas Plagwitz,Vincent Holstein,Tim Hahn,Jan Ernsting,Leon Kleine-Vennekate,Kelvin Sarink

doi:10.1371/journal.pone.0254062

Ramona Leenings, Benjamin Risse + Show 15 more

Open Access

https://doi.org/10.1371/journal.pone.0254062

Copy DOI

Journal: PloS one	Publication Date: Jul 21, 2021
Citations: 19	License type: CC BY 4.0

Affiliation: University of Münster

Abstract

PHOTONAI is a high-level Python API designed to simplify and accelerate machine learning model development. It functions as a unifying framework allowing the user to easily access and combine algorithms from different toolboxes into custom algorithm sequences. It is especially designed to support the iterative model development process and automates the repetitive training, hyperparameter optimization and evaluation tasks. Importantly, the workflow ensures unbiased performance estimates while still allowing the user to fully customize the machine learning analysis. PHOTONAI extends existing solutions with a novel pipeline implementation supporting more complex data streams, feature combinations, and algorithm selection. Metrics and results can be conveniently visualized using the PHOTONAI Explorer and predictive models are shareable in a standardized format for further external validation or application. A growing add-on ecosystem allows researchers to offer data modality specific algorithms to the community and enhance machine learning in the areas of the life sciences. Its practical utility is demonstrated on an exemplary medical machine learning problem, achieving a state-of-the-art solution in few lines of code. Source code is publicly available on Github, while examples and documentation can be found at www.photon-ai.com.

Highlights

In recent years, the interest in machine learning for medical, biological, and life science research has significantly increased
We propose PHOTONAI as a high-level Python Application Programming Interfaces (APIs) that acts as a mediator between different toolboxes
When we examine the results of the three learning algorithms of the class balancing pipeline, we can further see that the Random Forest (f1 = 0.76) is still outperforming gradient boosting and the Support Vector Machine

Summary

Introduction

The interest in machine learning for medical, biological, and life science research has significantly increased. The basic workflow to construct, optimize and evaluate a machine learning model, has remained virtually unchanged. In essence, it can be framed as the (systematic) search for the best combination of data processing steps, learning algorithms, and hyperparameter values under the premise of unbiased performance estimation. Subject to the iteratively optimized workflow is a machine learning pipeline, which in this context is defined as the sequence of algorithms subsequently applied to the data.

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PHOTONAI-A Python API for rapid machine learning model development.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Similar Papers

PHOTONAI—A Python API for rapid machine learning model development
Kelvin Sarink ... Leon Kleine-Vennekate
-
Kelvin Sarink, et. al.Kelvin Sarink ... Leon Kleine-Vennekate
21 Jul 2021
21 Jul 2021

Development of Machine Learning Model for Prediction of Demolition Waste Generation Rate of Buildings in Redevelopment Areas.
Gi-Wook Cha ... Won-Hwa Hong
International Journal of Environmental Research and Public Health | VOL. 20
Gi-Wook Cha, et. al.Gi-Wook Cha ... Won-Hwa Hong
21 Dec 2022
International Journal of Environmental Research and Public Health | VOL. 20

SemML: Facilitating development of ML models for condition monitoring with semantics
Baifan Zhou ... Evgeny Kharlamov
Journal of Web Semantics | VOL. 71
Baifan Zhou, et. al.Baifan Zhou ... Evgeny Kharlamov
01 Oct 2021
Journal of Web Semantics | VOL. 71

A Machine Learning Pipeline for Demand Response Capacity Scheduling
Gautham Krishnadas ... Aristides Kiprakis
Energies | VOL. 13
Gautham Krishnadas, et. al.Gautham Krishnadas ... Aristides Kiprakis
10 Apr 2020
Energies | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PHOTONAI-A Python API for rapid machine learning model development.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PloS one