Methods and Tools for Summarization of Entities and Facts in Knowledge Bases

Tomasz Tylenda

doi:10.22028/d291-26620

Abstract

Knowledge bases have become key assets for search and analytics over large document corpora. They are used in applications ranging from highly specialized tasks in bioinformatics to general purpose search engines. The large amount of structured knowledge they contain calls for effective summarization and ranking methods. The goal of this dissertation is to develop methods for automatic summarization of entities in knowledge bases, which also involves augmenting them with information about the importance of particular facts on entities of interest. We make two main contributions. First, we develop a method to generate a summary of information about an entity using the type information contained in a knowledge base. We call such a summary a semantic snippet. Our method relies on having importance information about types, which is external to the knowledge base. We show that such information can be obtained using human computing methods, such as Amazon Mechanical Turk, or extracted from the edit history of encyclopedic articles in Wikipedia. Our second contribution is linking facts to their occurrences in supplementary documents. Information retrieval on text uses the frequency of terms in a document to judge their importance. Such an approach, while natural, is difficult for facts extracted from text. This is because information extraction is only concerned with finding any occurrence of a fact. To overcome this limitation we propose linking known facts with all their occurrences in a process we call fact spotting. We develop two solutions to this problem and evaluate them on a real world corpus of biographical documents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Methods and Tools for Summarization of Entities and Facts in Knowledge Bases

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

HOFD: An Outdated Fact Detector for Knowledge Bases
Shuang Hao ... Ning Wang
IEEE Transactions on Knowledge and Data Engineering | VOL. -
Shuang Hao, et. al.Shuang Hao ... Ning Wang
01 Jan 2023
IEEE Transactions on Knowledge and Data Engineering | VOL. -

Outdated Fact Detection in Knowledge Bases
Shuang Hao ... Chengliang Chai
-
Shuang Hao, et. al.Shuang Hao ... Chengliang Chai
01 Apr 2020
01 Apr 2020

Combining Word and Entity Embeddings for Entity Linking
Jose G Moreno ... Xavier Tannier
-
Jose G Moreno, et. al.Jose G Moreno ... Xavier Tannier
01 Jan 2017
01 Jan 2017

Improving Entity Disambiguation by Reasoning over a Knowledge Base
...
-
, et. al. ...
27 Jun 2022
27 Jun 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Methods and Tools for Summarization of Entities and Facts in Knowledge Bases

Abstract

Talk to us

Similar Papers