Abstract

The Linked Open Data Cloud is a project that uses RDF formalism to publish data in the form of a triple on the web under open licence. With the ever increasing amount of data sets available in the LOD Cloud, it is already beyond the human capability to integrate heterogeneous data manually. So far, the task of Linked Data fusion entails a significant amount of time owing to the large number of instances in the data sets from the LOD Cloud. In this paper, we suggest a new system to efficiently combine heterogeneous data from the LOD Cloud. First, we extract similar instances from the LOD Cloud to identify identical or related information. Then, our system collects all predicates and objects of the similar instances to construct a set of trees. Finally, we propose a genetic algorithm to merge data in the constructed trees. In the following, we give an overview of our system architecture and we detail our genetic algorithm. We also evaluate our system using real data sets showing that it can increase the completeness and the conciseness in data fusion. Moreover, we prove that our system is faster when fusing large data sets from the LOD Cloud.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.