An Effective and Efficient MapReduce Algorithm for Computing BFS-Based Traversals of Large-Scale RDF Graphs

Alfredo Cuzzocrea,Mirel Cosulschi,Roberto De Virgilio

doi:10.3390/a9010007

Alfredo Cuzzocrea, Mirel Cosulschi + Show 1 more

Open Access

https://doi.org/10.3390/a9010007

Copy DOI

Abstract

Nowadays, a leading instance of big data is represented by Web data that lead to the definition of so-called big Web data. Indeed, extending beyond to a large number of critical applications (e.g., Web advertisement), these data expose several characteristics that clearly adhere to the well-known 3V properties (i.e., volume, velocity, variety). Resource Description Framework (RDF) is a significant formalism and language for the so-called Semantic Web, due to the fact that a very wide family of Web entities can be naturally modeled in a graph-shaped manner. In this context, RDF graphs play a first-class role, because they are widely used in the context of modern Web applications and systems, including the emerging context of social networks. When RDF graphs are defined on top of big (Web) data, they lead to the so-called large-scale RDF graphs, which reasonably populate the next-generation Semantic Web. In order to process such kind of big data, MapReduce, an open source computational framework specifically tailored to big data processing, has emerged during the last years as the reference implementation for this critical setting. In line with this trend, in this paper, we present an approach for efficiently implementing traversals of large-scale RDF graphs over MapReduce that is based on the Breadth First Search (BFS) strategy for visiting (RDF) graphs to be decomposed and processed according to the MapReduce framework. We demonstrate how such implementation speeds-up the analysis of RDF graphs with respect to competitor approaches. Experimental results clearly support our contributions.

Highlights

Big data describe data sets that grow so large that they become unpractical to be processed by traditional tools like database management systems, content management systems, advanced statistical analysis software, and so forth
In line with this trend, in this paper, we present an approach for efficiently implementing traversals of large-scale Resource Description Framework (RDF) graphs over MapReduce that is based on the Breadth
In this paper, we focus on the problem of computing traversals of RDF graphs, being computing traversals of such graphs very relevant in this area, e.g., for RDF graph analysis and mining over large-scale data sets

Summary

Introduction

Big data describe data sets that grow so large that they become unpractical to be processed by traditional tools like database management systems, content management systems, advanced statistical analysis software, and so forth. Our approach adopts the Breadth First Search (BFS) strategy for visiting (RDF) graphs to be decomposed and processed according to the MapReduce framework This with the goal of taking advantages from the powerful run-time support offered by MapReduce, reducing the computational overheads due to manage large RDF graphs efficiently. This approach is conceptually-sound, as similar initiatives have been adopted in the vest of viable solutions to the problem of managing large-scale sensor-network data over distributed environments (e.g., [16,17]), like for the case of Cloud infrastructures and their integration with sensor networks.

Hadoop

MapReduce

Different

The two RDF graphs corresponding to two different

Parallel BFS-Based and DFS-Based Traversal Strategies

MapReduce Algorithms for Big Data Processing

MapReduce Algorithms for RDF Databases

MapReduce Algorithms for RDF Graphs

BFS-Based Implementation of RDF Graph Traversals over MapReduce

Results

Hardware

Conclusions and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms	Publication Date: Jan 11, 2016
Citations: 60	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Effective and Efficient MapReduce Algorithm for Computing BFS-Based Traversals of Large-Scale RDF Graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms

Lead the way for us

Similar Papers

Canonical Forms for Isomorphic and Equivalent RDF Graphs
Aidan Hogan
ACM Transactions on the Web | VOL. 11
Aidan HoganAidan Hogan
25 Jul 2017
ACM Transactions on the Web | VOL. 11

RDF graph mining for cluster-based theme identification
Siham Eddamiri ... Elmoukhtar Zemmouri
International Journal of Web Information Systems | VOL. 16
Siham Eddamiri, et. al.Siham Eddamiri ... Elmoukhtar Zemmouri
17 Apr 2020
International Journal of Web Information Systems | VOL. 16

Aggregation Path Search using Multiple Large RDF Datasets with Equivalence Relations
Ken Kaneiwa ... Yuuki Yamanaka
Transactions of the Japanese Society for Artificial Intelligence | VOL. 38
Ken Kaneiwa, et. al.Ken Kaneiwa ... Yuuki Yamanaka
01 Mar 2023
Transactions of the Japanese Society for Artificial Intelligence | VOL. 38

WLeidenRDF: RDF Data Query Method based on Semantic-Enhanced Graph-Clustering Algorithm
Liu Yang ... Zhou Chen
-
Liu Yang, et. al.Liu Yang ... Zhou Chen
01 Dec 2020
01 Dec 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Effective and Efficient MapReduce Algorithm for Computing BFS-Based Traversals of Large-Scale RDF Graphs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms