Abstract

Schema matching plays an important role in many database applications, such as ontology merging, data integration, data warehouse and dataspaces. The problem of schema matching is to find the semantic correspondence between attributes of schemas to be matched. In this paper, we propose multi-schema matching based on clustering techniques. Traditional matching techniques mainly address matching tasks between two attributes, namely pairwise-attribute correspondence. However, there exist lots of applications that require the semantic correspondence among multiple attributes. Thus, we will focus on matching multiple attributes, which is more difficult than pairwise-attribute correspondence. We employ the clustering techniques to solve the multi-schema matching problem. We use the well-known TFIDF weighting method to convert each attribute in schemas to a point in the vector space model. Then, these attributes can be partitioned into different clusters each of which has a specific semantics topic. Finally, the attributes partitioned into the same cluster are similar with higher confidence. We validate our approach with an experimental study, the results of which demonstrate that our approach is effective and has good performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.