Abstract

The digital age is making more datasets available through the Internet, but their interoperability is still limited. The Semantic Web should play a fundamental role in achieving interoperable datasets. The semantic exploitation of data requires its efficient transformation into semantic formats and the integration of heterogeneous sources. Either the scalability of the existing tools for the semantic transformation of large volumes of data is limited or these tools do not provide a semantics-rich representation of the data.The goal of this work was to show how scalable semantic data transformation processes can be designed and implemented, thereby addressing the first limitation mentioned above. Here, we propose an application of high-performance computing techniques to overcome the scalability limitation. The proposed method was implemented as an upgrade of our Semantic Web Integration Tool (SWIT). Additional improvements for supporting the transformation process in SWIT are also described in this paper. We evaluated the new method by using three case studies from the areas of bioinformatics, movies and persons. The results showed a significant speed-up with respect to the original SWIT algorithm and the related tools. The lessons learnt in our work allowed us to configure semantic transformation processes efficiently.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call