Abstract
Data about the movements of diverse objects, including human beings, animals, and commodities, are collected in growing amounts as location-aware technologies become pervasive. Clustering has become an increasingly important analytical tool for revealing travel patterns from large-scale movement datasets. Most existing methods for origin-destination (OD) flow clustering focus on the geographic properties of an OD flow but ignore the temporal information preserved in the OD flow, which reflects the dynamic changes in the travel patterns over time. In addition, most methods require some predetermined parameters as inputs and are difficult to adjust considering the changes in the users’ demands. To overcome such limitations, we present a novel OD flow clustering method, namely, TOCOFC (Tree-based and Optimum Cut-based Origin-Destination Flow Clustering). A similarity measurement method is proposed to quantify the spatial similarity relationship between OD flows, and it can be extended to measure the spatiotemporal similarity between OD flows. By constructing a maximum spanning tree and splitting it into several unrelated parts, we effectively remove the noise in the flow data. Furthermore, a recursive two-way optimum cut-based method is utilized to partition the graph composed of OD flows into OD flow clusters. Moreover, a criterion called CSSC (Child tree/Child graph Self-Similarity Criterion) is formulated to determine if the clusters meet the output requirements. By modifying the parameters, TOCOFC can obtain clustering results for different time scales and spatial scales, which makes it possible to study movement patterns from a multiscale perspective. However, TOCOFC has the disadvantages of low efficiency and large memory consumption, and it is not conducive to quickly handling large-scale data. Compared with previous works, TOCOFC has a better clustering performance, which is reflected in the fact that TOCOFC can guarantee a balance between clusters and help to fully understand the corresponding patterns. Being able to perform the spatiotemporal clustering of OD flows is also a highlight of TOCOFC, which will help to capture the differences in the patterns at different times for a deeper analysis. Extensive experiments on both artificial spatial datasets and real-world spatiotemporal datasets have demonstrated the effectiveness and flexibility of TOCOFC.
Highlights
Origin-destination data, namely, OD flow data, that contain paired location information and temporal information, hold great potential to discover the links between two areas
To reveal the effects of the Child graph Self-Similarity Tolerance (CSST) and timeLimit parameters involved in Tree-based and Optimum Cut-based Origin-Destination Flow Clustering (TOCOFC) on the clustering results, we conduct a series of related comparison experiments on these datasets
The OD flows in the artificial spatial datasets DS1 and DS2 only include spatial information, so we use the spatial version of TOCOFC to experiment on DS1 and DS2
Summary
Origin-destination data, namely, OD flow data, that contain paired location information and temporal information, hold great potential to discover the links between two areas. Visual approaches such as flow maps [1,2] offer an easy way to analyze the mode of movements because of their intuitive nature. Boyandin [6] presented a new interactive visualization approach called “flowstrates”, in which the origins and destinations of the flows are displayed in two separate views, and the changes in the flow magnitudes over time are represented in a separate, central heat map view to support exploration in the temporal dimension. The OD matrix [7,8] and OD map [9] are useful tools for understanding the detailed patterns of OD flows
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.