Abstract

Unprecedented amounts of spatio-temporal data instigates an urgent need for patterns exploration in it. Clustering analysis is useful in extracting patterns from big data by grouping similar data elements into clusters. Compared with one-way clustering and co-clustering methods, tri-clustering methods are more capable of exploring complex patterns. However, the explored patterns or clusters could be different due to varying temporal resolutions of input data. This study presents a tri-clustering based method to explore the impacts of different temporal resolutions on spatio-temporal clusters identified in geo-referenced time series (GTS), one type of spatio-temporal data. Dutch daily temperature data at 28 stations over 20 years was used to illustrate this study. The temperature data at daily, monthly, and yearly resolutions were subjected to the Bregman cube average tri-clustering algorithm with I-divergence (BCAT_I) to detect spatio-temporal clusters, which were then compared in terms of patterns exhibited, compositions, and changed elements. Results confirm the temporal resolution impacts on the spatio-temporal clusters identified in the Dutch temperature data: most compositions of clusters are varying when changing the temporal resolutions of input data in the GTS. Nevertheless, there is almost no change of elements in certain clusters (12 stations in the northeast of the country; years 1996, 2010) at all temporal resolutions, suggesting them as the “true” clusters in the case study dataset.

Highlights

  • IntroductionAlong with data sharing services has significantly promoted the accumulation of spatio-temporal data [1,2]

  • The advancement in data acquisition techniquesalong with data sharing services has significantly promoted the accumulation of spatio-temporal data [1,2]

  • Dutch temperature data: most compositions of clusters are varying when changing the temporal resolutions of input data in the geo-referenced time series (GTS)

Read more

Summary

Introduction

Along with data sharing services has significantly promoted the accumulation of spatio-temporal data [1,2] Such unprecedented amounts of data at multiple spatial and temporal resolutions instigates an urgent need for patterns exploration to obtain useful information in it [3,4]. As an important data mining task, clustering is useful for exploring patterns in GTS by assigning similar data elements into the same cluster and dissimilar elements into different ones [7,8]. As a result, it provides both an overview of data at cluster levels and investigation of details on single clusters [9,10]. According to the involved dimensions in the clustering analysis, clustering methods for GTS are categorized as one-way clustering, co-clustering, and tri-clustering methods [11,12]

Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.