Abstract
When studying time courses of biological measurements and comparing these to other measurements eg. gene expression and phenotypic endpoints, the analysis is complicated by the fact that although the associated elements may show the same patterns of behaviour, the changes do not occur simultaneously. In these cases standard correlation-based measures of similarity will fail to find significant associations. Dynamic time warping (DTW) is a technique which can be used in these situations to find the optimal match between two time courses, which may then be assessed for its significance. We implement DTW4Omics, a tool for performing DTW in R. This tool extends existing R scripts for DTW making them applicable for “omics” datasets where thousands entities may need to be compared with a range of markers and endpoints. It includes facilities to estimate the significance of the matches between the supplied data, and provides a set of plots to enable the user to easily visualise the output. We illustrate the utility of this approach using a dataset linking the exposure of the colon carcinoma Caco-2 cell line to oxidative stress by hydrogen peroxide (H2O2) and menadione across 9 timepoints and show that on average 85% of the genes found are not obtained from a standard correlation analysis between the genes and the measured phenotypic endpoints. We then show that when we analyse the genes identified by DTW4Omics as significantly associated with a marker for oxidative DNA damage (8-oxodG), through over-representation, an Oxidative Stress pathway is identified as the most over-represented pathway demonstrating that the genes found by DTW4Omics are biologically relevant. In contrast, when the positively correlated genes were similarly analysed, no pathways were found. The tool is implemented as an R Package and is available, along with a user guide from http://web.tgx.unimaas.nl/svn/public/dtw/.
Highlights
Time courses provide insight into patterns and sequential biological events, and temporal studies are an important tool in biological research
We introduced a new tool for analysing time-course data, DTW4Omics which uses Dynamic time warping (DTW) to generate a list of genes whose time course may be minimally adjusted to obtain an optimal match to that of an endpoint
Many of the genes found in our test were not significant using correlation analysis, and this tool is complementary to such an analysis and we recommend applying both approaches in combination
Summary
Time courses provide insight into patterns and sequential biological events, and temporal studies are an important tool in biological research. The systems we study are not static, but change dynamically over time. A large amount of ’omics research is currently performed by taking samples at a single time point and seeking the significantly changed genes, proteins and/or metabolites. Given the dynamic features of biological systems we know the chosen time point will strongly influence the obtainable results. Even when aspects of the system are related and display similar patterns of change over time, we expect to see delays and differences in the speed of this change. Under these circumstances standard correlation analysis can often fail to find any association between the elements, and here DTW may be used
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have