Abstract

The problem of replacement in web caches has received much attention in recent research, and it has been shown that the eviction rule replace the least recently used document performs poorly in web caches. Instead, it has been shown that using a combination of several criteria, such as the recentness and frequency of use, the size, and the cost of fetching a document, leads to a sizable improvement in hit rate and latency reduction. However, in order to implement these novel schemes, one needs to maintain complicated data structures. We propose randomized algorithms for approximating any existing web-cache replacement scheme and thereby avoid the need for any data structures.At document-replacement times, the randomized algorithm samples N documents from the cache and replaces the least useful from the sample, where usefulness is determined according to the criteria mentioned above. The next M N least useful documents are retained for the succeeding iteration. When the next replacement is to be performed, the algorithm obtains N --- M new samples from the cache and replaces the least useful from the N --- M new samples and the M previously retained. Using theory and simulations, we analyze the algorithm and find that it matches the performance of existing replacement schemes for values of N and M as low as 8 and 2 respectively. Interestingly, we find that retaining a small number of samples from one iteration to the next leads to an exponential improvement in performance as compared to retaining no samples at all.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.