Abstract
Lexical taxonomies are widely used to foster information retrieval and exchange in several domains and applications. When there are multiple taxonomies, heterogeneity among them is a severe problem for efficient collaboration processes. In this paper, we propose WETA, a domain-independent, knowledge-poor method for automatic taxonomy alignment via word embeddings. WETA associates all the leaf terms of the origin taxonomy to one or many concepts in the destination taxonomy, employing a scoring function, which merges the score of a hierarchical method based on cosine similarity and the score of a classification task. WETA is developed in the context of an EU Grant aiming at bridging the national taxonomies of EU countries towards the European Skills, Competences, Qualifications and Occupations taxonomy (ESCO) using AI Algorithms. The results, validated within the EU project activities for bridging the Italian occupation taxonomy CP and ESCO, confirm the usefulness of WETA in supporting the automatic alignment of national labor taxonomies. WETA reaches a 0.8 accuracy on recommending top-5 occupations and a wMRR of 0.72. WETA reduces the human effort needed for building a mapping from scratch: it would allow domain experts to concentrate on the validation task and decrease the incoherence due to multiple judgments. It would also make the approach reproducible and transparent to policymakers.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.