A Highly Scalable Method for Extractive Text Summarization Using Convex Optimization

Claudiu Popescu,Lacrimioara Grama,Corneliu Rusu

doi:10.3390/sym13101824

Claudiu Popescu, Lacrimioara Grama + Show 1 more

Open Access

https://doi.org/10.3390/sym13101824

Copy DOI

Abstract

The paper describes a convex optimization formulation of the extractive text summarization problem and a simple and scalable algorithm to solve it. The optimization program is constructed as a convex relaxation of an intuitive but computationally hard integer programming problem. The objective function is highly symmetric, being invariant under unitary transformations of the text representations. Another key idea is to replace the constraint on the number of sentences in the summary with a convex surrogate. For solving the program we have designed a specific projected gradient descent algorithm and analyzed its performance in terms of execution time and quality of the approximation. Using the datasets DUC 2005 and Cornell Newsroom Summarization Dataset, we have shown empirically that the algorithm can provide competitive results for single document summarization and multi-document query-based summarization. On the Cornell Newsroom Summarization Dataset, it ranked second among the unsupervised methods tested. For the more challenging task of multi-document query-based summarization, the method was tested on the DUC 2005 Dataset. Our algorithm surpassed the other reported methods with respect to the ROUGE-SU4 metric, and it was at less than 0.01 from the top performing algorithms with respect to ROUGE-1 and ROUGE-2 metrics.

Highlights

The process of providing a concise, fluent, and accurate summary starting from a text document or a group of documents is called text summarization [1]
We proceed to evaluate the method on two different tasks: single document summarization and query-based multi-document summarization
We have proposed a new algorithm for extractive text summarization based on some simple and intuitive ideas, and we have tried to establish its properties and performance

Summary

Introduction

The process of providing a concise, fluent, and accurate summary starting from a text document or a group of documents is called text summarization [1]. One method to generate a summary is by extracting and recombining the most relevant parts from the original text or texts This process is known as extractive summarization and our work is focused on this problem. The method described in this paper is based on minimizing a convex function subject to some constraints and on the properties of the l1 norm [3]. The properties of this norm are well known and it has many applications in signal processing (compressive sampling [4]) and statistics/machine learning (LASSO regression [5]). For the basic notions from convex optimization and signal processing used in this paper we refer the reader to Appendix C and references therein

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Sep 30, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Highly Scalable Method for Extractive Text Summarization Using Convex Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Extractive Multi-document Text Summarization Leveraging Hybrid Semantic Similarity Measures
Rajesh Bandaru ... Y Radhika
International Journal of Advanced Computer Science and Applications | VOL. 13
Rajesh Bandaru, et. al.Rajesh Bandaru ... Y Radhika
01 Jan 2021
International Journal of Advanced Computer Science and Applications | VOL. 13

An unsupervised method for extractive multi-document summarization based on centroid approach and sentence embeddings
Salima Lamsiyah ... Saïd El Alaoui Ouatik
Expert Systems with Applications | VOL. 167
Salima Lamsiyah, et. al.Salima Lamsiyah ... Saïd El Alaoui Ouatik
27 Oct 2020
Expert Systems with Applications | VOL. 167

Analysis of Sentence Scoring Methods for Extractive Automatic Text Summarization
Yogesh Kumar Meena ... Dinesh Gopalani
-
Yogesh Kumar Meena, et. al.Yogesh Kumar Meena ... Dinesh Gopalani
27 Oct 2014
27 Oct 2014

An Indicator-based Multi-Objective Optimization Approach Applied to Extractive Multi-Document Text Summarization
Jesus M Sanchez-Gomez ... Miguel A Vega-Rodríguez
IEEE Latin America Transactions | VOL. 17
Jesus M Sanchez-Gomez, et. al.Jesus M Sanchez-Gomez ... Miguel A Vega-Rodríguez
01 Aug 2019
IEEE Latin America Transactions | VOL. 17

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Highly Scalable Method for Extractive Text Summarization Using Convex Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry