The continuous production of heterogeneous internet of things (IoT) data created a new challenge, which is their storage and retrieve efficiently. In this work, a new method of indexing, called threshold distance (TD), is proposed in the fog‐cloud architecture. In this method, the fog layer is divided into two levels: a clustering level and an indexing level. In the clustering fog level, the first data flow is grouped into clusters of homogenous objects using the density‐based spatial clustering applications with noise (DBSCAN) algorithm. In the indexing fog level, objects in each cluster are indexed in separated generalized hyperplane‐trees (GHT). After the clustering of the next data flow, objects are inserted in existing GHT or new GHT are created according the comparison of the distances between centers of the next clusters and those of the first clusters (took as representatives of their corresponding indexes) with a threshold distance value. To test the efficiency of the TD method, the experimental results were compared with those of our second proposed method called creation of a new tree (CNT) in which, data of each cluster is indexed in a new GHT. The experimental results show that the efficiency of the TD method in GHT construction depend on the size of the data flow. For the parallel kNN search, the TD method proved its efficiency whatever the size of the data flow.
Read full abstract