Using Semantic Web Resources for Data Quality Management

Christian Fürber,Martin Hepp

doi:10.1007/978-3-642-16438-5_15

Abstract

The quality of data is a critical factor for all kinds of decision-making and transaction processing. While there has been a lot of research on data quality in the past two decades, the topic has not yet received sufficient attention from the Semantic Web community. In this paper, we discuss (1) the data quality issues related to the growing amount of data available on the Semantic Web, (2) how data quality problems can be handled within the Semantic Web technology framework, namely using SPARQL on RDF representations, and (3) how Semantic Web reference data, e.g. from DBPedia, can be used to spot incorrect literal values and functional dependency violations. We show how this approach can be used for data quality management of public Semantic Web data and data stored in relational databases in closed settings alike. As part of our work, we developed generic SPARQL queries to identify (1) missing datatype properties or literal values, (2) illegal values, and (3) functional dependency violations. We argue that using Semantic Web datasets reduces the effort for data quality management substantially. As a use-case, we employ Geonames, a publicly available Semantic Web resource for geographical data, as a trusted reference for managing the quality of other data sources.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Semantic Web Resources for Data Quality Management

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using SPARQL and SPIN for Data Quality Management on the Semantic Web
Christian Fürber ... Martin Hepp
-
Christian Fürber, et. al.Christian Fürber ... Martin Hepp
01 Jan 2009
01 Jan 2009

An Exploratory Study of RDF: A Data Model for Cloud Computing
A Clara Kanmani ... T Chockalingam
-
A Clara Kanmani, et. al.A Clara Kanmani ... T Chockalingam
01 Jan 2017
01 Jan 2017

Towards a vocabulary for data quality management in semantic web architectures
Christian Fürber ... Martin Hepp
-
Christian Fürber, et. al.Christian Fürber ... Martin Hepp
25 Mar 2011
25 Mar 2011

Effective and efficient semantic web data management over DB2
Li Ma ... Yue Pan
-
Li Ma, et. al.Li Ma ... Yue Pan
09 Jun 2008
09 Jun 2008

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Semantic Web Resources for Data Quality Management

Abstract

Talk to us

Similar Papers