An ultra-fast time series distance measure to allow data mining in more complex real-world deployments

Shaghayegh Gharghabi,Amirali Darvishzadeh,Eamonn Keogh,Shima Imani,Anthony Bagnall

doi:10.1007/s10618-020-00695-8

Abstract

At their core, many time series data mining algorithms reduce to reasoning about the shapes of time series subsequences. This requires an effective distance measure, and for last two decades most algorithms use Euclidean distance or DTW as their core subroutine. We argue that these distance measures are not as robust as the community seems to believe. The undue faith in these measures perhaps derives from an overreliance on the benchmark datasets and self-selection bias. The community is simply reluctant to address more difficult domains, for which current distance measures are ill-suited. In this work, we introduce a novel distance measure MPdist. We show that our proposed distance measure is much more robust than current distance measures. For example, it can handle data with missing values or spurious regions. Furthermore, it allows us to successfully mine datasets that would defeat any Euclidean or DTW distance-based algorithm. Additionally, we show that our distance measure can be computed so efficiently as to allow analytics on very fast arriving streams.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An ultra-fast time series distance measure to allow data mining in more complex real-world deployments

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery

Lead the way for us

Journal: Data Mining and Knowledge Discovery	Publication Date: May 30, 2020
Citations: 20

Similar Papers

Matrix Profile XII: MPdist: A Novel Time Series Distance Measure to Allow Data Mining in More Challenging Scenarios
Shaghayegh Gharghabi ... Eamonn Keogh
-
Shaghayegh Gharghabi, et. al.Shaghayegh Gharghabi ... Eamonn Keogh
01 Nov 2018
01 Nov 2018

Searching and Mining Trillions of Time Series Subsequences under Dynamic Time Warping.
Thanawin Rakthanmanon ... Eamonn Keogh
KDD : proceedings. International Conference on Knowledge Discovery & Data Mining | VOL. 2012
Thanawin Rakthanmanon, et. al.Thanawin Rakthanmanon ... Eamonn Keogh
12 Aug 2012
KDD : proceedings. International Conference on Knowledge Discovery & Data Mining | VOL. 2012

Addressing Big Data Time Series
Thanawin Rakthanmanon ... Brandon Westover
ACM Transactions on Knowledge Discovery from Data | VOL. 7
Thanawin Rakthanmanon, et. al.Thanawin Rakthanmanon ... Brandon Westover
01 Sep 2013
ACM Transactions on Knowledge Discovery from Data | VOL. 7

CID: an efficient complexity-invariant distance for time series
Gustavo E A P A Batista ... Vinícius M A De Souza
Data Mining and Knowledge Discovery | VOL. 28
Gustavo E A P A Batista, et. al.Gustavo E A P A Batista ... Vinícius M A De Souza
12 Apr 2013
Data Mining and Knowledge Discovery | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An ultra-fast time series distance measure to allow data mining in more complex real-world deployments

Abstract

Talk to us

Similar Papers

More From: Data Mining and Knowledge Discovery