Page Cache Research Articles

The following problem arose in connection with studies of Internet web page caching. The general setting is as follows: In some fixed metric space M , k “servers” S1, . . . , Sk are given with some arbitrary initial locations in M . Requests for service at certain points σ1, σ2, σ3, . . . , σN , in M arrive over time. Immediately after request σt is received, exactly one of several mutually exclusive actions must be taken: (i) Some server is moved to σt, with a resulting cost of c(σt), the “cost” of the point σt. (ii) No server moves. In this case, the cost for “no service” is defined to be mink d(Sk, σt), where d(x, y) denotes the distance between x and y in M . A further feature of our model is that two parameters u,w ≥ 0 are specified, which are used as follows. Before having to decide how to service request σt, the servers have at their disposal the knowledge of the u+ w requests σi with t− u ≤ i ≤ t+ w − 1. Thus, the servers can only ”remember” or store the past u requests σt−u, σt−u+1, . . . , σt−1 but are allowed to know the w future requests σt, σt+1, . . . , σt+w−1 before having to service σt. The rules which govern the choices made for servicing all the σt define some algorithm A. In this model, A is deterministic and can only depend on the values of the σi which it currently knows, and nothing else. In particular, A is not allowed to make probabilistic choices based on some source of randomness. We denote by A(σ), the cost of servicing the request sequence σ = (σ1, . . . , σN ). Of course, if we are allowed to know all the σt before having to act, it is very likely the cost of servicing σ can be decreased. Let us denote by OFF(σ) the minimum possible cost of ∗University of California, San Diego †Research supported in part by NSF Grant No. DMS 98-01446 ‡Research supported in part by Bell Communications Research, Morristown, New Jersey §University of California, San Diego ¶AT&T Labs, Florham Park, New Jersey

Shared Web Caches allow multiple clients to quickly access a pool of popular web pages. An organization that provides shared caching to its web clients will typically have a collection of shared caches rather than a single cache. If a collection of shared caches is used, it is required to coordinate the caches so that all cached pages in the collection are shared among the clients of the organization. In this paper, two protocol schemes for coordinating the collection of shared caches are investigated. The first scheme is based on Internet Caching Protocol (ICP) (D. Wessels, K. Claffy, ICP and the Squid WebCache, National Laboratory for Applied Network Research, http://www.nlanr.net/(wessels/papers/icp-squid.ps.gz). In the ICP scheme, the web caches query other caches for the web pages and fetch the web pages from the neighbors if they have cached the requested page. The second scheme is the hash routing scheme in which the client (browser) has to find the hash value for the URL of the requested page and send the request to the corresponding cache server (D. Wessels, K. Claffy, IEEE Network magazine, November–December (1997); G. Thalur, C.V. Ravishankar, IEEE/ACM Trans. Networking (1997); V. Valloppillil, J. Cohen, Hierarchical HTTP Routing Protocol, Internet Draft, http://ww.nlanr.net/Cache/ICP/draft-vinod-icp-traffic-dist00.txt; Super proxy script, How to make Distributed Proxy Servers by URL Hashing, White Paper, http://naragw.sharp.co.jp/sps, August (1996)). These two schemes have been implemented, and compared with respect to the page retrieval latency and the adaptability of the cache servers when a peer cache server fails. Our analysis shows that the hash routing schemes have significant performance advantages over ICP with respect to the average latency under normal conditions but when failure rate of the cache server is significant the ICP provides good adaptability. Also, we observe that the hashing function used in the hash routing scheme must have certain features such as quick calculation of the hash value and uniform distribution of the web pages (among cache servers).

Page Cache Research Articles

Related Topics

Articles published on Page Cache

A web page usage prediction scheme using sequence indexing and clustering techniques

Improving energy efficiency for flash memory based embedded applications

Controlling Control Flow in Web Applications

SWL

Cooperative Client-Side File Caching for MPI Applications

User data persistence in physical memory

Protect your users against the latest web-based threat: malicious code on caching servers [Your Internet Connection

Caching and prefetching algorithms for programs with looping reference patterns

Bridging the generation gap

Multitiered Cache Management and Acceleration for Database-Driven Websites

Mining interesting knowledge from Web-log

The influence of caching on web usage mining

On Staleness and the Delivery of Web Pages

Optimizing Traffic in DSM Clusters: Fine-Grain Memory Caching versus Page Migration/ Replication

The bigwig Project

Dynamic location problems with limited look-ahead

Dynamic data prefetching in home-based software DSMs

Implementation and comparison of distributed caching schemes

Improving performance of large physically indexed caches by decoupling memory addresses from cache addresses

Lookahead scheduling requests for multisize page caching

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Page Cache Research Articles

Related Topics

Articles published on Page Cache

A web page usage prediction scheme using sequence indexing and clustering techniques

Improving energy efficiency for flash memory based embedded applications

Controlling Control Flow in Web Applications

SWL

Cooperative Client-Side File Caching for MPI Applications

User data persistence in physical memory

Protect your users against the latest web-based threat: malicious code on caching servers [Your Internet Connection

Caching and prefetching algorithms for programs with looping reference patterns

Bridging the generation gap

Multitiered Cache Management and Acceleration for Database-Driven Websites

Mining interesting knowledge from Web-log

The influence of caching on web usage mining

On Staleness and the Delivery of Web Pages

Optimizing Traffic in DSM Clusters: Fine-Grain Memory Caching versus Page Migration/ Replication

The bigwig Project

Dynamic location problems with limited look-ahead

Dynamic data prefetching in home-based software DSMs

Implementation and comparison of distributed caching schemes

Improving performance of large physically indexed caches by decoupling memory addresses from cache addresses

Lookahead scheduling requests for multisize page caching