Novel applications of multitask learning and multiple output regression to multiple genetic trait prediction.

Dan He,Laxmi Parida,David Kuhn

doi:10.1093/bioinformatics/btw249

Dan He, Laxmi Parida + Show 1 more

Open Access

https://doi.org/10.1093/bioinformatics/btw249

Copy DOI

Journal: Bioinformatics	Publication Date: Jun 11, 2016
Citations: 62	License type: CC BY-NC 4.0

Affiliation: U.S. Horticultural Research Laboratory

Abstract

Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models. In many cases, for the same set of samples and markers, multiple traits are observed. Some of these traits might be correlated with each other. Therefore, modeling all the multiple traits together may improve the prediction accuracy. In this work, we view the multitrait prediction problem from a machine learning angle: as either a multitask learning problem or a multiple output regression problem, depending on whether different traits share the same genotype matrix or not. We then adapted multitask learning algorithms and multiple output regression algorithms to solve the multitrait prediction problem. We proposed a few strategies to improve the least square error of the prediction from these algorithms. Our experiments show that modeling multiple traits together could improve the prediction accuracy for correlated traits. Availability and implementation: The programs we used are either public or directly from the referred authors, such as MALSAR (http://www.public.asu.edu/~jye02/Software/MALSAR/) package. The Avocado data set has not been published yet and is available upon request.Contact: dhe@us.ibm.com

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Novel applications of multitask learning and multiple output regression to multiple genetic trait prediction.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Performance evaluation of different encoding strategies for quantitative genetic trait prediction
Oyetunji E Ogundijo ... Laxmi Parida
-
Oyetunji E Ogundijo, et. al.Oyetunji E Ogundijo ... Laxmi Parida
01 Oct 2015
01 Oct 2015

Does encoding matter? A novel view on the quantitative genetic trait prediction problem.
Dan He ... Laxmi Parida
BMC Bioinformatics | VOL. Suppl 17 9
Dan He, et. al.Dan He ... Laxmi Parida
01 Jul 2016
BMC Bioinformatics | VOL. Suppl 17 9

Does encoding matter? A novel view on the quantitative genetic trait prediction problem
Dan He ... Laxmi Parida
-
Dan He, et. al. Dan He ... Laxmi Parida
01 Nov 2015
01 Nov 2015

Data-driven encoding for quantitative genetic trait prediction.
Dan He ... Laxmi Parida
BMC Bioinformatics | VOL. Suppl 16 1
Dan He, et. al.Dan He ... Laxmi Parida
18 Feb 2015
BMC Bioinformatics | VOL. Suppl 16 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Novel applications of multitask learning and multiple output regression to multiple genetic trait prediction.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics