A Hybrid Approach Combining R*-Tree and k-d Trees to Improve Linked Open Data Query Performance

Yuxiang Sun,Seulgi Yoon,Yongju Lee,Tianyi Zhao

doi:10.3390/app11052405

Abstract

Semantic Web has recently gained traction with the use of Linked Open Data (LOD) on the Web. Although numerous state-of-the-art methodologies, standards, and technologies are applicable to the LOD cloud, many issues persist. Because the LOD cloud is based on graph-based resource description framework (RDF) triples and the SPARQL query language, we cannot directly adopt traditional techniques employed for database management systems or distributed computing systems. This paper addresses how the LOD cloud can be efficiently organized, retrieved, and evaluated. We propose a novel hybrid approach that combines the index and live exploration approaches for improved LOD join query performance. Using a two-step index structure combining a disk-based 3D R*-tree with the extended multidimensional histogram and flash memory-based k-d trees, we can efficiently discover interlinked data distributed across multiple resources. Because this method rapidly prunes numerous false hits, the performance of join query processing is remarkably improved. We also propose a hot-cold segment identification algorithm to identify regions of high interest. The proposed method is compared with existing popular methods on real RDF datasets. Results indicate that our method outperforms the existing methods because it can quickly obtain target results by reducing unnecessary data scanning and reduce the amount of main memory required to load filtering results.

Highlights

The evolution of the Linked Open Data (LOD) cloud has made a strong wave of research approaches in Big Data [1]
We propose an efficient join query algorithm based on the two-step index structure for various SPARQL query types and a hot-cold segment identification algorithm that determines regions of high interest
Spurred by efforts such as the LOD project [22], large amounts of semantic data are published in the resource description framework (RDF) format in several diverse fields such as publishing, life sciences, social networking, internet of things (IOT), and healthcare

Summary

Introduction

The evolution of the Linked Open Data (LOD) cloud has made a strong wave of research approaches in Big Data [1]. The second approach is based on accessing distributed data on the fly using a recursive URI lookup process; we call this the live exploration approach This approach performs queries over multiple SPARQL endpoints offed by publishers for their LOD datasets [4]. This approach has several advantages, such as synchronizing copied data is not required, searching is more dynamic with up-to-date data, and new resources can be added without a time lag for indexing and integrating data.

Overview of Linked Open Data

Hybrid Storage Structure

Related Work

Two-Step SPARQL Query Processing

Performance of Hot-Cold Segment Identification Method

Conclusions and Future Work

Findings

22. Linking Open Data

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied sciences	Publication Date: Mar 8, 2021
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Hybrid Approach Combining R*-Tree and k-d Trees to Improve Linked Open Data Query Performance

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied sciences

Lead the way for us

Similar Papers

Linked Open Data for Learning Object Discovery: Adaptive e-Learning Systems
Burasakorn Yoosooka ... Vilas Wuwongse
-
Burasakorn Yoosooka, et. al.Burasakorn Yoosooka ... Vilas Wuwongse
01 Nov 2011
01 Nov 2011

Linked Open Data for Context-aware Services: Analysis, Classification and Context Data Discovery
Moritz Von Hoffen ... Abdulbaki Uzun
International Journal of Semantic Computing | VOL. 08
Moritz Von Hoffen, et. al.Moritz Von Hoffen ... Abdulbaki Uzun
01 Dec 2014
International Journal of Semantic Computing | VOL. 08

Analyzing the Applicability of the Linking Open Data Cloud for Context-Aware Services
Moritz Von Hoffen ... Abdulbaki Uzun
-
Moritz Von Hoffen, et. al.Moritz Von Hoffen ... Abdulbaki Uzun
01 Jun 2014
01 Jun 2014

Mid-Ontology Learning from Linked Data
Lihua Zhao ... Ryutaro Ichise
-
Lihua Zhao, et. al.Lihua Zhao ... Ryutaro Ichise
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Hybrid Approach Combining R*-Tree and k-d Trees to Improve Linked Open Data Query Performance

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied sciences