Abstract

The process of collecting and organizing sets of observations represents a common theme throughout the history of science. However, despite the ubiquity of scientists measuring, recording and analysing the dynamics of different processes, an extensive organization of scientific time-series data and analysis methods has never been performed. Addressing this, annotated collections of over 35 000 real-world and model-generated time series, and over 9000 time-series analysis algorithms are analysed in this work. We introduce reduced representations of both time series, in terms of their properties measured by diverse scientific methods, and of time-series analysis methods, in terms of their behaviour on empirical time series, and use them to organize these interdisciplinary resources. This new approach to comparing across diverse scientific data and methods allows us to organize time-series datasets automatically according to their properties, retrieve alternatives to particular analysis methods developed in other scientific disciplines and automate the selection of useful methods for time-series classification and regression tasks. The broad scientific utility of these tools is demonstrated on datasets of electroencephalograms, self-affine time series, heartbeat intervals, speech signals and others, in each case contributing novel analysis techniques to the existing literature. Highly comparative techniques that compare across an interdisciplinary literature can thus be used to guide more focused research in time-series analysis for applications across the scientific disciplines.

Highlights

  • Time series, measurements of a quantity taken over time, are fundamental data objects studied across the scientific disciplines, including measurements of stock prices in finance, ion fluxes in astrophysics, atmospheric air temperatures in meteorology and human heartbeats in medicine

  • In order to understand the structure in these signals and the mechanisms that underlie them, scientists have developed a large variety of techniques: methods based on fluctuation analysis are frequently used in physics [1], generalized autoregressive conditional heteroskedasticity (GARCH) models are common in economics [2] and entropy measures such as sample entropy are popular in medical timeseries analysis [3], for example

  • Relationships are determined by comparing empirical behaviour: the outputs of the methods applied to data, and the properties of the data as measured by the methods

Read more

Summary

Introduction

Measurements of a quantity taken over time, are fundamental data objects studied across the scientific disciplines, including measurements of stock prices in finance, ion fluxes in astrophysics, atmospheric air temperatures in meteorology and human heartbeats in medicine. In order to understand the structure in these signals and the mechanisms that underlie them, scientists have developed a large variety of techniques: methods based on fluctuation analysis are frequently used in physics [1], generalized autoregressive conditional heteroskedasticity (GARCH) models are common in economics [2] and entropy measures such as sample entropy are popular in medical timeseries analysis [3], for example Are these methods that have been developed in different disciplinary contexts summarizing time series in unique and useful ways, or is there something to be learned by synthesizing and comparing them?

Framework
Empirical structure
Empirical structure of time-series analysis methods
Empirical structure of time series
Applications
Electroencephalogram recordings
Heart rate variability
Self-affine time series
Parkinsonian speech
Findings
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call