Automatic multi-documents text summarization by a large-scale sparse multi-objective optimization algorithm

H Abo-Bakr,S A Mohamed

doi:10.1007/s40747-023-00967-y

Abstract

AbstractDue to the exponential overflow of textual information in various fields of knowledge and on the internet, it is very challenging to extract important information or to generate a summary from some multi-document collection in a specific field. With such a gigantic amount of textual content, human text summarization becomes impractical since it is expensive and consumes a lot of time and effort. So, developing automatic text summarization (ATS) systems is becoming increasingly essential. ATS approaches are either extractive or abstractive. The extractive approach is simpler and faster than the abstractive approach. This work proposes an extractive ATS system that aims to extract a small subset of sentences from a large multi-document text. First, the whole text is preprocessed by applying some natural language processing techniques such as sentences segmentation, words tokenization, removal of stop-words, and stemming to provide a structured representation of the original document collection. Based on this structured representation, the ATS problem is formulated as a multi-objective optimization (MOO) problem that optimizes the extracted summary to maintain the coverage of the main text content while avoiding redundant information. Secondly, an evolutionary sparse multi-objective algorithm is developed to solve the formulated large-scale MOO. The output of this algorithm is a set of non-dominated summaries (Pareto front). A novel criterion is proposed to select the target summary from the Pareto front. The proposed ATS system has been examined using (DUC) datasets, and the output summaries have been evaluated using (ROUGE) metrics and compared with the literature.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Complex & Intelligent Systems	Publication Date: Feb 2, 2023
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Automatic multi-documents text summarization by a large-scale sparse multi-objective optimization algorithm

Abstract

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

The impact analysis of language differences on an automatic multilingual text summarization system
Fu Lee Wang ... Christopher C Yang
Journal of the American Society for Information Science and Technology | VOL. 57
Fu Lee Wang, et. al.Fu Lee Wang ... Christopher C Yang
01 Feb 2006
Journal of the American Society for Information Science and Technology | VOL. 57

The Design of Automatic Summarization of Indonesian Texts Using a Hybrid Approach
Kania Evita Dewi ... Nelly Indriani Widiastuti
Jurnal Teknologi Informasi dan Pendidikan | VOL. 15
Kania Evita Dewi, et. al.Kania Evita Dewi ... Nelly Indriani Widiastuti
13 May 2022
Jurnal Teknologi Informasi dan Pendidikan | VOL. 15

Extractive Text and Video Summarization using TF-IDF Algorithm
Ajinkya Gothankar ... Samiksha Nehe
International Journal for Research in Applied Science and Engineering Technology | VOL. 10
Ajinkya Gothankar, et. al.Ajinkya Gothankar ... Samiksha Nehe
31 Mar 2022
International Journal for Research in Applied Science and Engineering Technology | VOL. 10

An Evaluation of Automatic Text Summarization of News Articles: The Case of Three Online Arabic Text Summary Generators
Fahad M Alliheibi ... Nasser Al-Horais
International Journal of Advanced Computer Science and Applications | VOL. 12
Fahad M Alliheibi, et. al.Fahad M Alliheibi ... Nasser Al-Horais
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic multi-documents text summarization by a large-scale sparse multi-objective optimization algorithm

Abstract

Talk to us

Similar Papers

More From: Complex &amp; Intelligent Systems

More From: Complex & Intelligent Systems