Abstract
Time Series Classification (TSC) involves building predictive models for a discrete target variable from ordered, real valued, attributes. Over recent years, a new set of TSC algorithms have been developed which have made significant improvement over the previous state of the art. The main focus has been on univariate TSC, i.e. the problem where each case has a single series and a class label. In reality, it is more common to encounter multivariate TSC (MTSC) problems where the time series for a single case has multiple dimensions. Despite this, much less consideration has been given to MTSC than the univariate case. The UCR archive has provided a valuable resource for univariate TSC, and the lack of a standard set of test problems may explain why there has been less focus on MTSC. The UEA archive of 30 MTSC problems released in 2018 has made comparison of algorithms easier. We review recently proposed bespoke MTSC algorithms based on deep learning, shapelets and bag of words approaches. If an algorithm cannot naturally handle multivariate data, the simplest approach to adapt a univariate classifier to MTSC is to ensemble it over the multivariate dimensions. We compare the bespoke algorithms to these dimension independent approaches on the 26 of the 30 MTSC archive problems where the data are all of equal length. We demonstrate that four classifiers are significantly more accurate than the benchmark dynamic time warping algorithm and that one of these recently proposed classifiers, ROCKET, achieves significant improvement on the archive datasets in at least an order of magnitude less time than the other three.
Highlights
Time series classification (TSC) is a form of machine learning where the features of the input vector are real valued and ordered
We find that dynamic time warping (DTW) is still hard to beat in multivariate TSC (MTSC), but that four algorithms are significantly more accurate than this benchmark on this archive
We provide an overview of MTSC and the classifiers we evaluate in Sect. 2, and the datasets used in Sect
Summary
Time series classification (TSC) is a form of machine learning where the features of the input vector are real valued and ordered. This scenario adds a layer of complexity to the problem, as important characteristics of the data can be missed by traditional algorithms. The main focus has been on univariate TSC, i.e. the problem where each case has a single series and a class label. It is more common to encounter multivariate TSC (MTSC) problems where the time series for a single case has multiple dimensions. In MTSC, the time series is a list of vectors over d dimensions and m observations, X =< x1, . Algorithms for MTSC can be categorised in similar ways as algorithms for univariate TSC on whether they are based on: distance measures; shapelets; histograms over a dictionary; interval summarising; or deep learning/neural networks
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have