Extractive multi-document summarization based on textual entailment and sentence compression via knapsack problem

Ali Naserasadi,Faramarz Sadeghi,Hamid Khosravi

doi:10.1017/s1351324918000414

Abstract

AbstractBy increasing the amount of data in computer networks, searching and finding suitable information will be harder for users. One of the most widespread forms of information on such networks are textual documents. So exploring these documents to get information about their content is difficult and sometimes impossible. Multi-document text summarization systems are an aid to producing a summary with a fixed and predefined length, while covering the maximum content of the input documents. This paper presents a novel method for multi-document extractive summarization based on textual entailment relations and sentence compression via formulating the problem as a knapsack problem. In this approach, sentences of documents are ranked according to the extended Tf-Idf method, then entailment scores of selected sentences are computed. Through these scores, the final score of each sentence is calculated. Finally, by decreasing the lengths of sentences via sentence compression, the problem has been solved by greedy and dynamic Programming approaches to the knapsack problem. Experiments on standard summarization datasets and evaluating the results based on the Rouge system show that the suggested method, according to the best of our knowledge, has increased F-measure of query-based summarization systems by two per cent and F-measure of general summarization systems by five per cent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extractive multi-document summarization based on textual entailment and sentence compression via knapsack problem

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering

Lead the way for us

Journal: Natural Language Engineering	Publication Date: Oct 31, 2018
Citations: 7

Similar Papers

Event-based Multi-document Summarization
Luís Carlos Dos Santos Marujo
ACM SIGIR Forum | VOL. 49
Luís Carlos Dos Santos MarujoLuís Carlos Dos Santos Marujo
29 Jan 2016
ACM SIGIR Forum | VOL. 49

An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings
Salima Lamsiyah ... Saïd El Alaoui Ouatik
Expert Systems with Applications | VOL. 167
Salima Lamsiyah, et. al.Salima Lamsiyah ... Saïd El Alaoui Ouatik
27 Oct 2020
Expert Systems with Applications | VOL. 167

A Fuzzy Approach for Sentences Relevance Assessment in Multi-document Summarization
Eduardo Valladares-Valdés ... Francisco P Romero
-
Eduardo Valladares-Valdés, et. al.Eduardo Valladares-Valdés ... Francisco P Romero
01 May 2019
01 May 2019

Extractive Multi-document Text Summarization Leveraging Hybrid Semantic Similarity Measures
Rajesh Bandaru ... Y Radhika
International Journal of Advanced Computer Science and Applications | VOL. 13
Rajesh Bandaru, et. al.Rajesh Bandaru ... Y Radhika
01 Jan 2021
International Journal of Advanced Computer Science and Applications | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extractive multi-document summarization based on textual entailment and sentence compression via knapsack problem

Abstract

Talk to us

Similar Papers

More From: Natural Language Engineering