An Efficient Algorithm for Mining Maximal Co-location Pattern Using Instance-trees

Dai Phong Le,Dang Hai Nguyen,Van Tuan Luu,Vanha Tran,Cao Dai Pham

doi:10.1109/nics54270.2021.9701511

Abstract

Prevalent co-location patterns, which refer to groups of features whose instances frequently appear together in nearby geographic space, are one of the main branches of spatial data mining. As the data volume continues to increase, it is redundant if all patterns are discovered. Maximal co-location patterns (MCPs) are a compressed representation of all these patterns and they provide a new insight into the interaction among different spatial features to discover more valuable knowledge from data sets. Increasing the volume of spatial data sets makes discovering MCPs still very challenging. We dedicate this study to designing an efficient MCP mining algorithm. First, features in size-2 patterns are regarded as a sparse graph, MCP candidates are generated by enumerating maximal cliques from the sparse graph. Second, we design two instance-tree structures, star neighbor- and sibling node-based instance-trees to store neighbor relationships of instances. All maximal co-location instances of the candidates are yielded efficiently from these instance-tree structures. Finally, a MCP candidate is marked as prevalent if its participation index, which is calculated based on the maximal co-location instances, is not smaller than a minimum prevalence threshold given by users. The efficiency of the proposed algorithm is proved by comparison with the previous algorithms on both synthetic and real data sets.

Full Text