Comparison of machine learning methods for genomic prediction of selected Arabidopsis thaliana traits.

Ciaran Michael Kelly,Russell Lewis Mclaughlin

doi:10.1371/journal.pone.0308962

Abstract

We present a comparison of machine learning methods for the prediction of four quantitative traits in Arabidopsis thaliana. High prediction accuracies were achieved on individuals grown under standardized laboratory conditions from the 1001 Arabidopsis Genomes Project. An existing body of evidence suggests that linear models may be impeded by their inability to make use of non-additive effects to explain phenotypic variation at the population level. The results presented here use a nested cross-validation approach to confirm that some machine learning methods have the ability to statistically outperform linear prediction models, with the optimal model dependent on availability of training data and genetic architecture of the trait in question. Linear models were competitive in their performance as per previous work, though the neural network class of predictors was observed to be the most accurate and robust for traits with high heritability. The extent to which non-linear models exploit interaction effects will require further investigation of the causal pathways that lay behind their predictions. Future work utilizing more traits and larger sample sizes, combined with an improved understanding of their respective genetic architectures, may lead to improvements in prediction accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of machine learning methods for genomic prediction of selected Arabidopsis thaliana traits.

Abstract

Talk to us

Similar Papers

More From: PloS one

Lead the way for us

Journal: PloS one	Publication Date: Aug 28, 2024
License type: CC BY 4.0

Similar Papers

Comparative analysis of statistical and machine learning methods for predicting faulty modules
Ruchika Malhotra
Applied Soft Computing | VOL. 21
Ruchika MalhotraRuchika Malhotra
31 Mar 2014
Applied Soft Computing | VOL. 21

A Brief Survey of Machine Learning Methods in Protein Sub-Golgi Localization
Wuritu Yang ... Xiao-Juan Zhu
Current Bioinformatics | VOL. 14
Wuritu Yang, et. al.Wuritu Yang ... Xiao-Juan Zhu
07 Mar 2019
Current Bioinformatics | VOL. 14

Investigation of Machine Learning Methods for Prediction of Measured Values of Atmospheric Channel for Hybrid FSO/RF System
Maroš Lapčák ... Norbert Zdravecký
Photonics | VOL. 9
Maroš Lapčák, et. al.Maroš Lapčák ... Norbert Zdravecký
28 Jul 2022
Photonics | VOL. 9

Feasibility of machine learning methods for predicting hospital emergency room visits for respiratory diseases.
Jiaying Lu ... Pengju Bu
Environmental Science and Pollution Research | VOL. 28
Jiaying Lu, et. al.Jiaying Lu ... Pengju Bu
10 Feb 2021
Environmental Science and Pollution Research | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of machine learning methods for genomic prediction of selected Arabidopsis thaliana traits.

Abstract

Talk to us

Similar Papers

More From: PloS one