Abstract

Enabling discoveries and foundational understanding in modern day life sciences have largely become centered on our ability to effectively analyze large swathes of complex data from a diverse range of sources, capturing complex information encapsulated across the different layers of the nature-built system. While this data-centric approach has been the primary driver in computational life sciences and discovery pipelines for several decades now, the field has decisively diverged in the last few years on how and why these data are collected. More specifically, in contrast to yesteryear genomic and other -omic projects, modern day data collection by and large happens in an analysis-agnostic fashion---i.e., complex data are collected without any specific hypotheses to drive them; instead data are being collected because of easy availability of affordable high-throughput technologies. This has led to a fundamental shift in how we process these data and what we could glean from these data. In this work, we present a novel algorithmic and software framework called Hyppo-X, which is based on algebraic topology to discover hidden structure within complex biological data sets [1, 3]. Topology is the field of computational mathematics that deals with structure at large. Computational topology and its applications constitute an emerging area of research with ample scope for development and data-driven discovery. We present results of our extensive collaborative studies in developing and applying our methods to analyze two types of data---plant phenomics data obtained from agricultural fields [2], and patient trajectories obtained from a network of hospitals toward antimicrobial stewardship [4]. Topological data analysis holds tremendous promise to model and analyze high-dimensional data sets in numerous scientific domains, and are likely to become part of future machine learning pipelines. These early studies demonstrate its potential while also highlighting a number of challenges and opportunities for future research. The software is available for download at https://mhmethun.com/HYPPO-X/.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call