MuSe: a multi-level storage scheme for big RDF data using MapReduce

Tanvi Chawla,Girdhari Singh,Emmanuel S Pilli

doi:10.1186/s40537-021-00519-6

Tanvi Chawla, Girdhari Singh + Show 1 more

Open Access

https://doi.org/10.1186/s40537-021-00519-6

Copy DOI

Abstract

Resource Description Framework (RDF) model owing to its flexible structure is increasingly being used to represent Linked data. The rise in amount of Linked data and Knowledge graphs has resulted in an increase in the volume of RDF data. RDF is used to model metadata especially for social media domains where the data is linked. With the plethora of RDF data sources available on the Web, scalable RDF data management becomes a tedious task. In this paper, we present MuSe—an efficient distributed RDF storage scheme for storing and querying RDF data with Hadoop MapReduce. In MuSe, the Big RDF data is stored at two levels for answering the common triple patterns in SPARQL queries. MuSe considers the type of frequently occuring triple patterns and optimizes RDF storage to answer such triple patterns in minimum time. It accesses only the tables that are sufficient for answering a triple pattern instead of scanning the whole RDF dataset. The extensive experiments on two synthetic RDF datasets i.e. LUBM and WatDiv, show that MuSe outperforms the compared state-of-the art frameworks in terms of query execution time and scalability.

Highlights

IntroductionThe Linked data can be understood and accessed and; is represented by the Semantic Web [2]
Semantic Web is an outcome of the vision of W3C of a ‘Web of Linked data’ [1]
We have carried out extensive experiments on two popular Resource Description Framework (RDF) benchmark datasets i.e. Lehigh University Benchmark (LUBM) and Waterloo SPARQL Diversity Test Suite (WatDiv) to verify the efficiency and scalability of Multi‐Level Big RDF Storage Scheme (MuSe) and compared it with the state-of-the-art SHARD and PigSPARQL frameworks

Summary

Introduction

The Linked data can be understood and accessed and; is represented by the Semantic Web [2]. Semantic Web provides ease of access to all information available on the World Wide Web (WWW) and represents it in a format that is understandable to both humans and machines. The Semantic Web is being put to good use for information retrieval [3]. The technologies like Web Ontology Language (OWL), RDF, and SPARQL Protocol and RDF Query Language (SPARQL) empower Linked data [4, 5]. Semantic Web has established RDF as the standard model for data interchange. The flexible nature of RDF is a result of its underlying graph-based model that makes it a popular and standard choice for data interchange on the Semantic Web. RDF is a key data representation

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Big Data	Publication Date: Oct 9, 2021
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

MuSe: a multi-level storage scheme for big RDF data using MapReduce

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Big Data

Lead the way for us

Similar Papers

Benchmarking over a Semantic Repository
Pranjul Yadav ... Vinith Samala
-
Pranjul Yadav, et. al.Pranjul Yadav ... Vinith Samala
01 Dec 2010
01 Dec 2010

RDF packages: a scheme for efficient reasoning and querying over large‐scale RDF data
Shohei Ohsawa ... Toshiyuki Amagasa
International Journal of Web Information Systems | VOL. 8
Shohei Ohsawa, et. al.Shohei Ohsawa ... Toshiyuki Amagasa
15 Jun 2012
International Journal of Web Information Systems | VOL. 8

Fast Processing SPARQL Queries on Large RDF Data
Guang Yang ... Pingpeng Yuan
-
Guang Yang, et. al.Guang Yang ... Pingpeng Yuan
01 Aug 2016
01 Aug 2016

Aggregation Path Search using Multiple Large RDF Datasets with Equivalence Relations
Ken Kaneiwa ... Yuuki Yamanaka
Transactions of the Japanese Society for Artificial Intelligence | VOL. 38
Ken Kaneiwa, et. al.Ken Kaneiwa ... Yuuki Yamanaka
01 Mar 2023
Transactions of the Japanese Society for Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MuSe: a multi-level storage scheme for big RDF data using MapReduce

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Big Data