Ontario: Federated Query Processing Against a Semantic Data Lake

Kemele M Endris,Sören Auer,Philipp D Rohde,Maria-Esther Vidal

doi:10.1007/978-3-030-27615-7_29

Abstract

Data lakes enable flexible knowledge discovery and reduce the overhead of materialized data integration. Albeit effective for data storage, query execution over data lakes may be expensive, being demanded novel techniques to generate plans able to exploit the main characteristics of data lakes. We devise Ontario, a federated query processing approach tailored for large-scale heterogeneous data. Ontario provides efficient and effective query processing over a federation of heterogeneous data sources in a data lake. Ontario resorts to source descriptions named RDF Molecule Templates, i.e., abstract descriptions of the properties of the entities in a unified schema and their implementation in a data lake. We empirically evaluate the effectiveness of the Ontario optimization techniques over state-of-the-art benchmarks. The observed results suggest that Ontario can effectively select plans composed of subqueries that can be efficiently executed against heterogeneous data sources in a data lake.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ontario: Federated Query Processing Against a Semantic Data Lake

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Personalised Exploration Graphs on Semantic Data Lakes
Ada Bagozi ... Michele Melchiori
-
Ada Bagozi, et. al.Ada Bagozi ... Michele Melchiori
01 Jan 2019
01 Jan 2019

Design an efficient data driven decision support system to predict flooding by analysing heterogeneous and multiple data sources using Data Lake
Sreepathy H V ... Mohan Kumar J
MethodsX | VOL. 11
Sreepathy H V, et. al.Sreepathy H V ... Mohan Kumar J
22 Jun 2023
MethodsX | VOL. 11

An Analysis of Confidentiality Issues in Data Lakes
João Luiz Monteiro Joaquim ... Ronaldo Dos Santos Mello
-
João Luiz Monteiro Joaquim, et. al.João Luiz Monteiro Joaquim ... Ronaldo Dos Santos Mello
30 Nov 2020
30 Nov 2020

Data Lakes: A Panacea for Big Data Problems, Cyber Safety Issues, and Enterprise Security
A N M Bazlur Rashid ... Mohiuddin Ahmed
-
A N M Bazlur Rashid, et. al.A N M Bazlur Rashid ... Mohiuddin Ahmed
25 Feb 2022
25 Feb 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ontario: Federated Query Processing Against a Semantic Data Lake

Abstract

Talk to us

Similar Papers