IFTA: Iterative filtering by using TF-AICL algorithm for Chinese encyclopedia knowledge refinement

Ting Wang,Zhuang Wu,Jiale Guo,Tiansheng Xu

doi:10.1007/s10489-021-02220-w

Abstract

The influence of inaccurate knowledge still exists in the Semantic Web. The problem of knowledge inaccuracy in Knowledge Bases (KBs) is one of the largest obstacles that limit the development of Linked Open Data (LOD) and Knowledge Graphs (KGs). To solve the semantic ambiguity and improper classification of knowledge triples in the process of constructing Chinese online encyclopedia KBs, first, a new TF-AICL algorithm is proposed to calculate the concentration level of predicates in each top-category. Second, the predicate which can best represent the features of a top-category is selected, and the related predicate candidate set is extracted. Third, based on the positive and negative examples counting strategy, the predicate candidate set is used as the comparison group to filter each entity. Finally, based on the TF-AICL algorithm, this paper proposes a new iterative filtering method called IFTA. IFTA adopts a new predicate feature extraction method, TF-AICL, which considers the hierarchical features of the predicate. In addition, IFTA can automatically prune, filter and refine large-scale online encyclopedia knowledge in an iterative way. The precision, recall and F-measure results on the BaiduBaike and Hudong datasets indicate that the refining effects on open-domain Chinese encyclopedia KBs by the IFTA method outperform the state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IFTA: Iterative filtering by using TF-AICL algorithm for Chinese encyclopedia knowledge refinement

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence

Lead the way for us

Journal: Applied Intelligence	Publication Date: Feb 22, 2021
Citations: 9

Similar Papers

TriTag-NFPF: Knowledge Denoising for Chinese Encyclopedia based on Triple Tag-Constructed Potential Function
Ting Wang ... Jie Li
IEEE Access | VOL. 7
Ting Wang, et. al.Ting Wang ... Jie Li
01 Jan 2019
IEEE Access | VOL. 7

A scalable parallel Chinese online encyclopedia knowledge denoising method based on entry tags and Spark cluster
Ting Wang ... Jiale Guo
Applied Intelligence | VOL. 51
Ting Wang, et. al.Ting Wang ... Jiale Guo
20 Mar 2021
Applied Intelligence | VOL. 51

Semantic Interpretation and Integration of Open Data Tables
A Subramanian ... S Srinivasa
-
A Subramanian, et. al.A Subramanian ... S Srinivasa
01 Jan 2018
01 Jan 2018

Is dc:subject enough? A landscape on iconography and iconology statements of knowledge graphs in the semantic web
Sofia Baroncini ... Aldo Gangemi
Journal of Documentation | VOL. 79
Sofia Baroncini, et. al.Sofia Baroncini ... Aldo Gangemi
30 Mar 2023
Journal of Documentation | VOL. 79

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IFTA: Iterative filtering by using TF-AICL algorithm for Chinese encyclopedia knowledge refinement

Abstract

Talk to us

Similar Papers

More From: Applied Intelligence