Post-Editing Extractive Summaries by Definiteness Prediction

Jad Kabbara,Jackie Chi Kit Cheung

doi:10.18653/v1/2021.findings-emnlp.312

Abstract

Extractive summarization has been the mainstay of automatic summarization for decades. Despite all the progress, extractive summarizers still suffer from shortcomings including coreference issues arising from extracting sentences away from their original context in the source document. This affects the coherence and readability of extractive summaries. In this work, we propose a lightweight post-editing step for extractive summaries that centers around a single linguistic decision: the definiteness of noun phrases. We conduct human evaluation studies that show that human expert judges substantially prefer the output of our proposed system over the original summaries. Moreover, based on an automatic evaluation study, we provide evidence for our system’s ability to generate linguistic decisions that lead to improved extractive summaries. We also draw insights about how the automatic system is exploiting some local cues related to the writing style of the main article texts or summary texts to make the decisions, rather than reasoning about the contexts pragmatically.

Highlights

Source Text: The school had to deal with a suspicious package received early in the morning
Original Extractive Summary: The school had to deal with a suspicious package received early in the morning
Post-Edited Pseudo-Extractive Summary: The school had to deal with a suspicious package received early in the morning

Summary

Definiteness Prediction

For the second step of predicting the definiteness of NPs, we adopt the methodology of Kabbara. Source Pre-trained Extractive Definiteness Modified Document Summarizer Summary Prediction Summary represent one of three classes: “the", “a" (or “an") and “none". Performance of different learning models on this task, we explore the use of a logistic regression. Sn} with n sentences, a a BERT-based (Devlin et al, 2019) neural model pre-trained extractive summarizer, f , generates a which has shown strong performance across a wide summary S = f (D) ⊂ D with the length of S be- range of NLP tasks (Rogers et al, 2020). The generated summary is passed to a post-editing step in which decisions are made regarding the definiteness of noun phrases (NPs). A definiteness prediction model g generates a modified summary S = g(S) which we refer to as pseudo-extractive summary.

Model Description

Experimental Setup

Datasets

Input Representation

Training Details

Study 1

Study 2

Methodology

Results

Analyzing the Hyperparameters Effect on Model Performance

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Post-Editing Extractive Summaries by Definiteness Prediction

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Post-Editing Extractive Summaries by Definiteness Prediction

-

23 Oct 2021
23 Oct 2021

Comparative Study on Extractive Summarization Using Sentence Ranking Algorithm and Text Ranking Algorithm
Mansoora Majeed ... Kala M T
-
Mansoora Majeed, et. al.Mansoora Majeed ... Kala M T
19 Apr 2023
19 Apr 2023

Investigating the Application of Multi-lingual Transformer in Graph-Based Extractive Text Summarization for Hindi Text
Sawan Rai ... Abhinav Sharma
-
Sawan Rai, et. al.Sawan Rai ... Abhinav Sharma
01 Jan 2023
01 Jan 2023

Extractive speech summarization using structural modeling
Jian Zhang
-
Jian ZhangJian Zhang
23 Dec 2014
23 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Post-Editing Extractive Summaries by Definiteness Prediction

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers