Fast Exact Algorithm to Solve Continuous Similarity Search for Evolving Queries

Tomohiro Yamazaki,Hisashi Koga,Takahisa Toda

doi:10.1007/978-3-319-70145-5_7

Tomohiro Yamazaki, Hisashi Koga + Show 1 more

https://doi.org/10.1007/978-3-319-70145-5_7

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2017

Citations: 2

Affiliation: University of Electro-Communications

Abstract
Full-Text
Similar Papers

Abstract

Listen

We study the continuous similarity search problem for evolving queries which has recently been formulated. Given a data stream and a database composed of n sets of items, the purpose of this problem is to maintain the top-k most similar sets to the query which evolves over time and consists of the latest W items in the data stream. For this problem, the previous exact algorithm adopts a pruning strategy which, at the present time T, decides the candidates of the top-k most similar sets from past similarity values and computes the similarity values only for them. This paper proposes a new exact algorithm which shortens the execution time by computing the similarity values only for sets whose similarity values at T can change from time \(T-1\). We identify such sets very fast with frequency-based inverted lists (FIL). Moreover, we derive the similarity values at T in O(1) time by updating the previous values computed at time \(T-1\). Experimentally, our exact algorithm runs faster than the previous exact algorithm by one order of magnitude.

Full Text