Abstract

A co-location pattern indicates a subset of spatial features whose instances are frequently located together in proximate geographical space. Most previous studies of spatial co-location pattern mining concern what percentage of instances per feature are involved in the table instance of a pattern, but neglect the heterogeneity in the number of feature instances and the distribution of instances. As a result, the deviation may be occurred in the interest measure of co-locations. In this article, we propose a novel mixed prevalence index (MPI) incorporating the effect of feature-level and instance-level heterogeneity on the prevalence measure, which can address some dilemmas in existing interest measures. Luckily, MPI possesses the partial antimonotone property. In virtue of this property, a branch-based search algorithm equipped with some optimizing strategies of MPI calculation is proposed, namely, Branch-Opt-MPI. Comprehensive experiments are conducted on both real and synthetic spatial datasets. Experimental results reveal the superiority of MPI compared to other interest measures and also validate the efficiency and scalability of the Branch-Opt-MPI. Particularly, the Branch-Opt-MPI performs more efficiently than baselines for several times or even orders of magnitude in dense data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call