CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System

Bo Xu,Jiaqing Liang,Bin Liang,Chenhao Xie,Wanyun Cui,Yong Xu,Yanghua Xiao

doi:10.1007/978-3-319-60045-1_44

Abstract

Great efforts have been dedicated to harvesting knowledge bases from online encyclopedias. These knowledge bases play important roles in enabling machines to understand texts. However, most current knowledge bases are in English and non-English knowledge bases, especially Chinese ones, are still very rare. Many previous systems that extract knowledge from online encyclopedias, although are applicable for building a Chinese knowledge base, still suffer from two challenges. The first is that it requires great human efforts to construct an ontology and build a supervised knowledge extraction model. The second is that the update frequency of knowledge bases is very slow. To solve these challenges, we propose a never-ending Chinese Knowledge extraction system, CN-DBpedia, which can automatically generate a knowledge base that is of ever-increasing in size and constantly updated. Specially, we reduce the human costs by reusing the ontology of existing knowledge bases and building an end-to-end facts extraction model. We further propose a smart active update strategy to keep the freshness of our knowledge base with little human costs. The 164 million API calls of the published services justify the success of our system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

The method of Constructing Chinese Knowledge Base based on open source English Knowledge Base
Liang Gan ... Zhonghe He
-
Liang Gan, et. al.Liang Gan ... Zhonghe He
01 Jan 2015
01 Jan 2015

Building Chinese field association knowledge base from Wikipedia
Li Wang ... Xinyun Geng
International Journal of Computer Applications in Technology | VOL. 52
Li Wang, et. al.Li Wang ... Xinyun Geng
01 Jan 2015
International Journal of Computer Applications in Technology | VOL. 52

A scalable parallel Chinese online encyclopedia knowledge denoising method based on entry tags and Spark cluster
Ting Wang ... Jiale Guo
Applied Intelligence | VOL. 51
Ting Wang, et. al.Ting Wang ... Jiale Guo
20 Mar 2021
Applied Intelligence | VOL. 51

Knowledge graph construction from multiple online encyclopedias
Tianxing Wu ... Guilin Qi
World Wide Web | VOL. 23
Tianxing Wu, et. al.Tianxing Wu ... Guilin Qi
14 Sep 2019
World Wide Web | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CN-DBpedia: A Never-Ending Chinese Knowledge Extraction System

Abstract

Talk to us

Similar Papers