Learning conflict resolution strategies for cross-language Wikipedia data fusion

Volha Bryl,Christian Bizer

doi:10.1145/2567948.2578999

Learning conflict resolution strategies for cross-language Wikipedia data fusion

Volha Bryl, Christian Bizer

https://doi.org/10.1145/2567948.2578999

Copy DOI

Export

Save

Cite

Publication Date: Apr 7, 2014

Citations: 44

Affiliation: University of Mannheim

#Data Fusion #Data Quality Assessment Framework #Fusion Framework #Tools For Data Integration #Large-scale Knowledge Base #Data Fusion Framework #Quality Assessment Framework #Data Integration #Large-scale Knowledge #Tools For Integration

Abstract
Full-Text
Similar Papers

Abstract

Listen

In order to efficiently use the ever growing amounts of structured data on the web, methods and tools for quality-aware data integration should be devised. In this paper we propose an approach to automatically learn the conflict resolution strategies, which is a crucial step in large-scale data integration. The approach is implemented as an extension of the Sieve data quality assessment and fusion framework. We apply and evaluate our approach on the use case of fusing data from 10 language editions of DBpedia, a large-scale structured knowledge base extracted from Wikipedia. We also propose a method for extracting rich provenance metadata for each DBpedia fact, which is later used in data fusion.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

R Discovery Prime

Learning conflict resolution strategies for cross-language Wikipedia data fusion