Abstract

In modern information retrieval (IR) systems, scoring functions have been extensively adopted for sorting results. For a given document, the rank in sorted result lists with respect to hot searches can be considered as its influence. When a new document comes, can we use such IR systems to evaluate its influence before we insert it into the corpus? Such issue may not be solved very well by current IR systems with inverted indexes. In this paper, an influence measure based on documents’ global rank is proposed, and the inverted index structure has been extended by adding the position milestones for speeding up the ranking calculation. Moreover, a performance study using both real data and synthetic data verifies the effectiveness and the efficiency of our method.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call