W-tree

Bruno T Ávila,Rafael D Lins

doi:10.1145/2835181

Abstract

World Wide Web applications need to use, constantly update, and maintain large webgraphs for executing several tasks, such as calculating the web impact factor, finding hubs and authorities, performing link analysis by webometrics tools, and ranking webpages by web search engines. Such webgraphs need to use a large amount of main memory, and, frequently, they do not completely fit in, even if compressed. Therefore, applications require the use of external memory. This article presents a new compact representation for webgraphs, called w-tree , which is designed specifically for external memory. It supports the execution of basic queries (e.g., full read, random read, and batch random read), set-oriented queries (e.g., superset, subset, equality, overlap, range, inlink, and co-inlink), and some advanced queries, such as edge reciprocal and hub and authority. Furthermore, a new layout tree designed specifically for webgraphs is also proposed, reducing the overall storage cost and allowing the random read query to be performed with an asymptotically faster runtime in the worst case. To validate the advantages of the w-tree, a series of experiments are performed to assess an implementation of the w-tree comparing it to a compact main memory representation. The results obtained show that w-tree is competitive in compression time and rate and in query time, which may execute several orders of magnitude faster for set-oriented queries than its competitors. The results provide empirical evidence that it is feasible to use a compact external memory representation for webgraphs in real applications, contradicting the previous assumptions made by several researchers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

W-tree

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on the Web

Lead the way for us

Similar Papers

An efficient variable selection method based on the use of external memory in ant colony optimization. Application to QSAR/QSPR studies
Mojtaba Shamsipur ... Morteza Akhond
Analytica Chimica Acta | VOL. 646
Mojtaba Shamsipur, et. al.Mojtaba Shamsipur ... Morteza Akhond
13 May 2009
Analytica Chimica Acta | VOL. 646

Optimization of address-based data sorting unit with external memory support
Dmitri Mihhailov ... Valery Sklyarov
-
Dmitri Mihhailov, et. al.Dmitri Mihhailov ... Valery Sklyarov
28 Jun 2013
28 Jun 2013

Trends in Remote User Authentication Based on Smart Card and External Memory
Bello Alhaji Buhari ... Afolayan Ayodele Obiniyi
International Journal of Security and Privacy in Pervasive Computing | VOL. 14
Bello Alhaji Buhari, et. al.Bello Alhaji Buhari ... Afolayan Ayodele Obiniyi
19 Aug 2022
International Journal of Security and Privacy in Pervasive Computing | VOL. 14

RUBIK
Eleni Tzirita Zacharatou ... Anastasia Ailamaki
-
Eleni Tzirita Zacharatou, et. al.Eleni Tzirita Zacharatou ... Anastasia Ailamaki
29 Jun 2015
29 Jun 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

W-tree

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on the Web