Extractive Multi-Document Summarization Model Based On Different Integrations of Double Similarity Measures

Dheyaa Abdulameer Mohammed,Nasreen J Kadhim

doi:10.24996/ijs.2020.61.6.30

Dheyaa Abdulameer Mohammed, Nasreen J Kadhim

Open Access

https://doi.org/10.24996/ijs.2020.61.6.30

Copy DOI

Journal: IRAQI JOURNAL OF SCIENCE	Publication Date: Jun 27, 2020
License type: cc-by

Affiliation: University of Baghdad

Abstract

Currently, the prominence of automatic multi document summarization task belongs to the information rapid increasing on the Internet. Automatic document summarization technology is progressing and may offer a solution to the problem of information overload.  Automatic text summarization system has the challenge of producing a high quality summary. In this study, the design of generic text summarization model based on sentence extraction has been redirected into a more semantic measure reflecting individually the two significant objectives: content coverage and diversity when generating summaries from multiple documents as an explicit optimization model. The proposed two models have been then coupled and defined as a single-objective optimization problem. Also, for improving the performance of the proposed model, different integrations concerning two similarity measures have been introduced and applied to the proposed model along with the single similarity measures that are based on using Cosine, Dice and similarity measures for measuring text similarity. For solving the proposed model, Genetic Algorithm (GA) has been used. Document sets supplied by Document Understanding Conference 2002 ( ) have been used for the proposed system as an evaluation dataset. Also, as an evaluation metric, Recall-Oriented Understudy for Gisting Evaluation ( ) toolkit has been used for performance evaluation of the proposed method. Experimental results have illustrated the positive impact of measuring text similarity using double integration of similarity measures against single similarity measure when applied to the proposed model wherein the best performance in terms of and has been recorded for the integration of Cosine similarity and similarity.

Highlights

One of the most important challenges facing humans today is the rapid increase in the amount of data generated by users, especially those on the Internet
The main contribution of this paper is to model the multi-document text summarization task as an optimization problem
The proposed method was able to obtain high accuracy and improve the performance compared with the current techniques

Summary

Introduction

One of the most important challenges facing humans today is the rapid increase in the amount of data generated by users, especially those on the Internet. For solving the discrete optimization problem in their work, they created an adaptive Differential Evolution algorithm They implemented their model on the task of multi-document summarization. ALGULIEV et al (2011) proposed an unsupervised model for text summarization which performs generation to a summary by means of an extraction to the significant sentences in given document(s). They modeled TS as an integer linear programming problem. In ALGULIEV et al (2013), a model based on optimization for generic text summarization has been proposed Their proposed model generated a summary through performing an extraction of significant sentences from documents. The optimization problem was solved through developing an adaptive differential evolution algorithm with a new mutation approach [10]

Preliminaries

The proposed model: definitions and formulations

The proposed similarity integrations

Requirements and parameter setting

Evaluation metrics

System performance

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extractive Multi-Document Summarization Model Based On Different Integrations of Double Similarity Measures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IRAQI JOURNAL OF SCIENCE

Lead the way for us

Similar Papers

Improving Extractive Multi-Document Text Summarization Through Multi-Objective Optimization
...
IRAQI JOURNAL OF SCIENCE | VOL. 59
, et. al. ...
01 Nov 2018
IRAQI JOURNAL OF SCIENCE | VOL. 59

The impact analysis of language differences on an automatic multilingual text summarization system
Fu Lee Wang ... Christopher C Yang
Journal of the American Society for Information Science and Technology | VOL. 57
Fu Lee Wang, et. al.Fu Lee Wang ... Christopher C Yang
01 Feb 2006
Journal of the American Society for Information Science and Technology | VOL. 57

Automatic multi-documents text summarization by a large-scale sparse multi-objective optimization algorithm
H Abo-Bakr ... S A Mohamed
Complex & Intelligent Systems | VOL. 9
H Abo-Bakr, et. al.H Abo-Bakr ... S A Mohamed
02 Feb 2023
Complex & Intelligent Systems | VOL. 9

A novel automatic text summarization system with feature terms identification
Suneetha Manne ... Shaik Mohammed Zaheer Pervez
-
Suneetha Manne, et. al.Suneetha Manne ... Shaik Mohammed Zaheer Pervez
01 Dec 2011
01 Dec 2011

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extractive Multi-Document Summarization Model Based On Different Integrations of Double Similarity Measures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IRAQI JOURNAL OF SCIENCE