Network-Based Segmentation of Biological Multivariate Time Series

Nooshin Omranian,Sebastian Klie,Bernd Mueller-Roeber,Zoran Nikoloski

doi:10.1371/journal.pone.0062974

Abstract

Molecular phenotyping technologies (e.g., transcriptomics, proteomics, and metabolomics) offer the possibility to simultaneously obtain multivariate time series (MTS) data from different levels of information processing and metabolic conversions in biological systems. As a result, MTS data capture the dynamics of biochemical processes and components whose couplings may involve different scales and exhibit temporal changes. Therefore, it is important to develop methods for determining the time segments in MTS data, which may correspond to critical biochemical events reflected in the coupling of the system’s components. Here we provide a novel network-based formalization of the MTS segmentation problem based on temporal dependencies and the covariance structure of the data. We demonstrate that the problem of partitioning MTS data into segments to maximize a distance function, operating on polynomially computable network properties, often used in analysis of biological network, can be efficiently solved. To enable biological interpretation, we also propose a breakpoint-penalty (BP-penalty) formulation for determining MTS segmentation which combines a distance function with the number/length of segments. Our empirical analyses of synthetic benchmark data as well as time-resolved transcriptomics data from the metabolic and cell cycles of Saccharomyces cerevisiae demonstrate that the proposed method accurately infers the phases in the temporal compartmentalization of biological processes. In addition, through comparison on the same data sets, we show that the results from the proposed formalization of the MTS segmentation problem match biological knowledge and provide more rigorous statistical support in comparison to the contending state-of-the-art methods.

Highlights

Time-resolved data from different cellular processes hold the promise of identifying the dynamics and relations of key system descriptors mapped into putative metabolic reactions, allosteric regulations, and entire signaling pathways
Yeast’s Metabolic and Cell Cycles Motivated by the accurate predictions from applying the framework on the synthetic data set, we investigated the multivariate time series (MTS) segmentation of transcriptomics data sets from the Saccharomyces cerevisiae metabolic cycle [36] (YMC), cell cycle [37] (YCC), and the experiment capturing the effect of oxidative stress, induced by hydrogen peroxide (HP), on the yeast’s cell cycle [38]
Analysis of MTS data can be used to identify the key biological processes involved in the adjustment of the cellular states

Summary

Introduction

Time-resolved data from different cellular processes hold the promise of identifying the dynamics and relations of key system descriptors mapped into putative metabolic reactions, allosteric regulations, and entire signaling pathways. These data are usually referred to as multivariate time series (MTS) since high-throughput technologies allow for simultaneous monitoring of multiple biological entities (i.e., genes, proteins, metabolites) over time. Each segment is represented by either a single quantity, e.g., the mean/median of the time series elements in the segment or the slope of the line yielding the best fit [3]. The difference between a given segment and its representative is measured by using some distance measure d (e.g., Euclidean distance)

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: May 7, 2013
Citations: 38	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Network-Based Segmentation of Biological Multivariate Time Series

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

On the Stationarity of Multivariate Time Series for Correlation-Based Data Analysis
Kiyoung Yang ... C Shahabi
-
Kiyoung Yang, et. al. Kiyoung Yang ... C Shahabi
27 Nov 2005
27 Nov 2005

Feature Selection for Multivariate Time Series via Network Pruning
Kang Gu ... Soroush Vosoughi
-
Kang Gu, et. al.Kang Gu ... Soroush Vosoughi
01 Dec 2021
01 Dec 2021

Matrix-based vs. vector-based linear discriminant analysis: A comparison of regularized variants on multivariate time series data
Jianhua Zhao ... Zhen Wang
Information Sciences | VOL. 654
Jianhua Zhao, et. al.Jianhua Zhao ... Zhen Wang
08 Nov 2023
Information Sciences | VOL. 654

Information complexity criteria for detecting influential observations in dynamic multivariate linear models using the genetic algorithm
Hamparsum Bozdogan ... Peter Bearse
Journal of Statistical Planning and Inference | VOL. 114
Hamparsum Bozdogan, et. al.Hamparsum Bozdogan ... Peter Bearse
15 Nov 2002
Journal of Statistical Planning and Inference | VOL. 114

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Network-Based Segmentation of Biological Multivariate Time Series

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE