ABBA: adaptive Brownian bridge-based symbolic aggregation of time series

Steven Elsworth,Stefan Güttel

doi:10.1007/s10618-020-00689-6

Steven Elsworth, Stefan Güttel

Open Access

https://doi.org/10.1007/s10618-020-00689-6

Copy DOI

Journal: Data Mining and Knowledge Discovery	Publication Date: Jun 3, 2020
Citations: 20	License type: open-access

Affiliation: University of Manchester

Abstract

A new symbolic representation of time series, called ABBA, is introduced. It is based on an adaptive polygonal chain approximation of the time series into a sequence of tuples, followed by a mean-based clustering to obtain the symbolic representation. We show that the reconstruction error of this representation can be modelled as a random walk with pinned start and end points, a so-called Brownian bridge. This insight allows us to make ABBA essentially parameter-free, except for the approximation tolerance which must be chosen. Extensive comparisons with the SAX and 1d-SAX representations are included in the form of performance profiles, showing that ABBA is often able to better preserve the essential shape information of time series compared to other approaches, in particular when time warping measures are used. Advantages and applications of ABBA are discussed, including its in-built differencing property and use for anomaly detection, and Python implementations provided.

Highlights

Symbolic representations of time series are an active area of research, being useful for many data mining tasks including dimension reduction, motif and rule discovery, prediction, and clustering of time series
Aside from verifying that adaptive Brownian bridge-based aggregation (ABBA) can represent time series to higher accuracy than Symbolic Aggregate approXimation (SAX) and 1d-SAX using a comparable number of symbols k and string length n, we find that SAX outperforms 1d-SAX when the same number of symbols k is used for both
We introduced ABBA, an adaptive symbolic time series representation which aims to preserve the essential shape of a time series

Summary

Introduction

Symbolic representations of time series are an active area of research, being useful for many data mining tasks including dimension reduction, motif and rule discovery, prediction, and clustering of time series. Symbolic time series representations allow for the use of algorithms from text processing and bioinformatics, which often take. This series is sampled at equidistant time points with values t0, t1, . Despite the large number of dimension-reducing time series representations in the literature, very few are symbolic. Most techniques are numeric in the sense that they reduce a time series to a lower-dimensional vector with its components taken from a continuous range; see Bettaiah and Ranganath (2014), Fu (2011), Lin et al (2007) for reviews. The construction of symbolic time series representations typically consists of two parts. The second part, the discretization process, assigns a symbol to each segment

Objectives

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

ABBA: adaptive Brownian bridge-based symbolic aggregation of time series

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Similar Papers

Adaptive Segmentation-Based Symbolic Representations of Time Series for Better Modeling and Lower Bounding Distance Measures
Bernard Hugueney
-
Bernard HugueneyBernard Hugueney
01 Jan 2006
01 Jan 2006

Experiencing SAX: a novel symbolic representation of time series
Jessica Lin ... Li Wei
Data Mining and Knowledge Discovery | VOL. 15
Jessica Lin, et. al.Jessica Lin ... Li Wei
03 Apr 2007
Data Mining and Knowledge Discovery | VOL. 15

A symbolic representation of time series
Qiang Wang ... V Megalooikonomou
-
Qiang Wang, et. al. Qiang Wang ... V Megalooikonomou
28 Aug 2005
28 Aug 2005

Fuzzy Long Term Forecasting through Machine Learning and Symbolic Representations of Time Series
Bernard Hugueney ... Georges Hébrail
-
Bernard Hugueney, et. al.Bernard Hugueney ... Georges Hébrail
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ABBA: adaptive Brownian bridge-based symbolic aggregation of time series

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery