Clustering Noisy Time Series

Anastasiia Yevhenivna Tkachenko,Liudmyla Olehivna Kyrychenko,Tamara Anatoliivna Radyvylova

doi:10.34185/1562-9945-3-122-2019-15

Abstract

One of the urgent tasks of machine learning is the problem of clustering objects. Clustering time series is used as an independent research technique, as well as part of more complex data mining methods, such as rule detection, classification, anomaly detection, etc.A comparative analysis of clustering noisy time series is carried out. The clustering sample contained time series of various types, among which there were atypical objects. Clustering was performed by k-means and DBSCAN methods using various distance functions for time series.A numerical experiment was conducted to investigate the application of the k-means and DBSCAN methods to model time series with additive white noise. The sample on which clustering was carried out consisted of m time series of various types: harmonic realizations, parabolic realizations, and “bursts”.The work was carried out clustering noisy time series of various types.DBSCAN and k-means methods with different distance functions were used. The best results were shown by the DBSCAN method with the Euclidean metric and the CID function.Analysis of the results of the clustering of time series allows determining the key differences between the methods: if you can determine the number of clusters and you do not need to separate atypical time series, the k-means method shows fairly good results; if there is no information on the number of clusters and there is a problem of isolating non-typical rows, it is advisable to use the DBSCAN method.

Highlights

Целью данной работы является проведение сравнительного анализа кластеризации зашумленных временных рядов с нетипичными объектами с использованием нескольких методов кластеризации и различных функций расстояния
Вначале мы задаем количество кластеров и соответствующие центроиды для каждого из них
Что мы можем получить в кластере такие объекты, которые на самом деле не являются близкими к их центроиду

Summary

КЛАСТЕРИЗАЦИЯ ЗАШУМЛЕННЫХ ВРЕМЕННЫХ РЯДОВ

Анализ результатов кластеризации временных рядов позволяет определить ключевые различия между методами: если можно определить количество кластеров и не требуется отделять нетипичные временные ряды, метод k-средних показывает довольно хорошие результаты; если нет информации о количестве кластеров и существует задача выделения нетипичных рядов, целесобразно использовать метод DBSCAN. Вынести такие объекты в отдельный кластер успешно получилось только с помощью метода DBSCAN, не смотря на то, что для в методе k-means одним из начальных центров задавался нетипичный объект. Среди выбранных метрик для сравнения временных рядов наилучшие результаты были получены с помощью метрики Эвклида с функцией CID.

Кластеризація зашумленних часових рядів

Clustering Noisy Time Series

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Clustering Noisy Time Series

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: System technologies

Lead the way for us

Journal: System technologies	Publication Date: Oct 10, 2019
License type: cc-by

Similar Papers

Automatic Classification of Time Series (ACTS): A new clustering method for remote sensing time series
N Viovy
International Journal of Remote Sensing | VOL. 21
N ViovyN Viovy
01 Jan 1999
International Journal of Remote Sensing | VOL. 21

Small-Scale Demographic Sequences Projection Based on Time Series Clustering and LSTM-RNN
Donglin Zhan ... Denglin Jiang
-
Donglin Zhan, et. al.Donglin Zhan ... Denglin Jiang
01 Nov 2018
01 Nov 2018

Editor's evaluation: Disentangling the rhythms of human activity in the built environment for airborne transmission risk: An analysis of large-scale mobility data
Niel Hens
-
Niel HensNiel Hens
10 Nov 2022
10 Nov 2022

Author response: Disentangling the rhythms of human activity in the built environment for airborne transmission risk: An analysis of large-scale mobility data
Zachary Susswein ... Shweta Bansal
-
Zachary Susswein, et. al.Zachary Susswein ... Shweta Bansal
30 Jan 2023
30 Jan 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Clustering Noisy Time Series

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: System technologies