Abstract

Understanding of phenotypes and their genetic basis is a major focus in current plant biology. Large amounts of phenotype data are being generated, both for macroscopic phenotypes such as size or yield, and for molecular phenotypes such as expression levels and metabolite levels. More insight in the underlying genetic and molecular mechanisms that influence phenotypes will enable a better understanding of how various phenotypes are related to each other. This will be a major step forward in understanding plant biology, with immediate value for plant breeding and academic plant research. Currently the genetic basis of most phenotypes remains however to be discovered, and the relatedness of different traits is unclear. We here present a novel approach to connect phenotypes to underlying biological processes and molecular functions. These connections define similarities between different types of phenotypes. The approach starts by using Quantitative Trait Locus (QTL) data, which are abundantly available for many phenotypes of interest. Overrepresentation analysis of gene functions based on Gene Ontology term enrichment across multiple QTL regions for a given phenotype, be it macroscopic or molecular, results in a small set of biological processes and molecular functions for each phenotype. Subsequently, similarity between different phenotypes can be defined in terms of these gene functions. Using publicly available rice data as example, a close relationship with defined molecular phenotypes is demonstrated for many macroscopic phenotypes. This includes for example a link between ‘leaf senescence’ and ‘aspartic acid’, as well as between ‘days to maturity’ and ‘choline’. Relationships between macroscopic and molecular phenotypes may result in more efficient marker-assisted breeding and are likely to direct future research aimed at a better understanding of plant phenotypes.

Highlights

  • A major issue in current biological research is to translate differences in phenotype to variation in genotype

  • We previously demonstrated how macroscopic traits can be linked to underlying biological process (BP) ontology terms with overrepresentation analysis using Quantitative Trait Locus (QTL) data [15]

  • In other words, related gene functions are assumed to play a role in multiple different QTL regions for any given trait

Read more

Summary

Introduction

A major issue in current biological research is to translate differences in phenotype to variation in genotype. High-throughput ‘omics’ approaches generate large amounts of molecular phenotypes (transcriptome, proteome, metabolome) and new developments in high-throughput phenotyping [2, 3] are generating increasingly large datasets of macroscopic phenotypes Both molecular and macroscopic phenotypes are combined with genetic data using the approaches of quantitative genetics [4], resulting in a variety of quantitative trait loci (QTLs) [5,6,7,8,9]: eQTLs (gene expression data), mQTLs (metabolite data), and phQTLs (macroscopic data). Related approaches have been applied to study genetic correlations between human gene expression and traits [12] Such results hold the promise of defining a molecular trait, or a combination of molecular traits, as a biomarker for a given macroscopic trait, equivalent to the use of bio(chemical)markers for disease traits in human health research [13]. A better connection between macroscopic traits and molecular traits is warranted

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call