Abstract

Phenomics is an emerging branch of modern biology that uses high throughput phenotyping tools to capture multiple environmental and phenotypic traits, often at massive spatial and temporal scales. The resulting high dimensional data represent a treasure trove of information for providing an in-depth understanding of how multiple factors interact and contribute to the overall growth and behavior of different genotypes. However, computational tools that can parse through such complex data and aid in extracting plausible hypotheses are currently lacking. In this article, we present Hyppo-X, a new algorithmic approach to visually explore complex phenomics data and in the process characterize the role of environment on phenotypic traits. We model the problem as one of unsupervised structure discovery, and use emerging principles from algebraic topology and graph theory for discovering higher-order structures of complex phenomics data. We present an open source software which has interactive visualization capabilities to facilitate data navigation and hypothesis formulation. We test and evaluate Hyppo-X on two real-world plant (maize) data sets. Our results demonstrate the ability of our approach to delineate divergent subpopulation-level behavior. Notably, our approach shows how environmental factors could influence phenotypic behavior, and how that effect varies across different genotypes and different time scales. To the best of our knowledge, this effort provides one of the first approaches to systematically formalize the problem of hypothesis extraction for phenomics data. Considering the infancy of the phenomics field, tools that help users explore complex data and extract plausible hypotheses in a data-guided manner will be critical to future advancements in the use of such data.

Highlights

  • HIGH-THROUGHPUT technologies are beginning to change the way we observe and measure the natural world

  • We present some related work, both in topology and in plant phenomics, in order to put our contributions in context

  • A topologybased approach was rated as the best overall entry at an expression QTL visualization competition organized by the BioVis community [16]

Read more

Summary

INTRODUCTION

HIGH-THROUGHPUT technologies are beginning to change the way we observe and measure the natural world. P ) [3], [4] To address this fundamental albeit broad goal, plant biologists and farmers have started to widely deploy an array of high-throughput sensing technologies that measure tens of crop phenotypic traits in the field (e.g., crop height, growth characteristics, photosynthetic activity). These technologies, comprising mostly of camera and other recording devices, generate a wealth of images (visual, infrared, thermal) and time-lapse videos that represent a detailed set of observations of a crop population as it develops over the course of the growing season. We formulate the problem of hypothesis extraction as one of: (a) identifying the key connected structural features of the given data, and (b) exploring the structural features in a way to facilitate extraction of plausible hypotheses

Structure Identification
Topological Object Exploration
Software
RELATED WORK
Tools for Plant Phenomics
HYPPO-X
Filtering
Generation of Partial Clusters
Construction of Simplicial Complexes
Graph Formulation
Persistent Homology
Computational Complexity Analysis
EXTRACTING INTERESTING FEATURES
Interesting Paths
The MAX-IP Problem
An Efficient Heuristic for IP
EXPERIMENTAL EVALUATION
Single Filter Function
Two Filter Functions
Irrigation Data Set
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.