A symbolic representation of time series

Qiang Wang Qiang Wang,V Megalooikonomou,Guo Li Guo Li

doi:10.1109/isspa.2005.1581023

Qiang Wang Qiang Wang, V Megalooikonomou + Show 1 more

https://doi.org/10.1109/isspa.2005.1581023

Copy DOI

Export

Save

Cite

Publication Date: Aug 28, 2005

Citations: 13

Affiliation: Temple University

Abstract
Full-Text
Similar Papers

Abstract

Listen

Various representations have been proposed for time series to facilitate similarity searches and discovery of interesting patterns. Although the Euclidean distance and its variants have been most frequently used as similarity measures, they are relatively sensitive to noise and fail to provide meaningful information in many cases. Moreover, for time series with high dimensionality, the similarity calculation may be extremely inefficient. To solve this problem, we introduce a new method which gives a symbolic representation of the time series and can dramatically reduce its dimensionality. The method employs Vector Quantization to encode time series using symbols prior to performing similarity analysis. Due to the symbolic representation, we can apply string matching algorithms to calculate the similarities more efficiently and accurately. We propose a similarity measure that is based on the Longest Common Subsequence (LCSS) model. The experimental results on real and simulated data demonstrate the utility and efficiency of the proposed technique.

Full Text