Learning to Discover Subsumptions between Software Engineering Concepts in Wikipedia

Xiang Dong,Beijun Shen,Jiangang Zhu,Kai Chen

doi:10.18293/seke2016-021

Abstract

Wikipedia contains large-scale concepts and rich semantic information. A number of knowledge base construction projects such as WikiTaxonomy, DBpedia, and YAGO have acquired data from Wikipedia. Despite the huge amount of relations in Wikipedia, the semantic relations (i.e. subsumptions) between domain concepts are rather sparse, especially in software engineering (SE) area. Hence, it is difficult to derive a software engineering knowledge base directly from Wikipedia. Meanwhile, domain knowledge base has become indispensable to a growing number of applications in software engineering. So the discov- ery of missing semantic relations between software engineering concepts in Wikipedia is essential. In this paper, we propose an approach to automatically discovering the missing subsumption relations between software engineering concepts. Specifically, we extract the SE domain concepts from Wikipedia firstly. And secondly, we design a machine learning based algorithm with some novel features to calculate the semantic relevancy between concepts. Thirdly, we offer and utilize a semi-supervised model to incorporate the features, which discovers the SE subsumptions. Experimental results show that our approach can effectively find the missing subsumption relations between software engineering concepts. Finally, we build a taxonomy which contains 193,593 concepts together with 357,662 subsumption relations. Compared with the taxonomies which are extracted from general-purpose knowledge bases such as WikiTaxonomy, YAGO and Schema.org, our dataset has a larger scale in software engineering domain. Index Terms—Subsumption Extraction, Software Engineering, Wikipedia

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning to Discover Subsumptions between Software Engineering Concepts in Wikipedia

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using Software Engineering Concepts And Techniques To Leverage Learning: A Novel Approach
Zhong Gu ... James Peters
-
Zhong Gu, et. al.Zhong Gu ... James Peters
03 Sep 2020
03 Sep 2020

Application of software engineering and quality assurance to expert systems development
J.L Sun
-
J.L SunJ.L Sun
12 Jun 1988
12 Jun 1988

Knowledge Intensive Software Engineering Applications
Jezreel Mejía ... Rafael Valencia-García
JUCS - Journal of Universal Computer Science | VOL. 27
Jezreel Mejía, et. al.Jezreel Mejía ... Rafael Valencia-García
28 Feb 2021
JUCS - Journal of Universal Computer Science | VOL. 27

Research Progress on Software Engineering Data Mining Technology
Fengxian Deng
-
Fengxian DengFengxian Deng
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning to Discover Subsumptions between Software Engineering Concepts in Wikipedia

Abstract

Talk to us

Similar Papers