Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Yen-Chun Chen,Mohit Bansal

doi:10.18653/v1/p18-1063

Abstract

Inspired by how humans summarize long documents, we propose an accurate and fast summarization model that first selects salient sentences and then rewrites them abstractively (i.e., compresses and paraphrases) to generate a concise overall summary. We use a novel sentence-level policy gradient method to bridge the non-differentiable computation between these two neural networks in a hierarchical way, while maintaining language fluency. Empirically, we achieve the new state-of-the-art on all metrics (including human evaluation) on the CNN/Daily Mail dataset, as well as significantly higher abstractiveness scores. Moreover, by first operating at the sentence-level and then the word-level, we enable parallel decoding of our neural generative model that results in substantially faster (10-20x) inference speed as well as 4x faster training convergence than previous long-paragraph encoder-decoder models. We also demonstrate the generalization of our model on the test-only DUC-2002 dataset, where we achieve higher scores than a state-of-the-art model.

Highlights

The task of document summarization has two main paradigms: extractive and abstractive
Abstractive models can be more concise by performing generation from scratch, but they suffer from slow and inaccurate encoding of very long documents, with the attention model being required to look at all encoded words for decoding each generated summary word
Abstractive models suffer from redundancy, especially when generating multi-sentence summary. To address both these issues and combine the advantages of both paradigms, we propose a hybrid extractive-abstractive architecture, with policy-based reinforcement learning (RL) to bridge together the two networks

Summary

Introduction

The task of document summarization has two main paradigms: extractive and abstractive The former method directly chooses and outputs the salient sentences (or phrases) in the original document (Jing and McKeown, 2000; Knight and Marcu, 2000; Martins and Smith, 2009; BergKirkpatrick et al, 2011). Abstractive models suffer from redundancy (repetitions), especially when generating multi-sentence summary To address both these issues and combine the advantages of both paradigms, we propose a hybrid extractive-abstractive architecture, with policy-based reinforcement learning (RL) to bridge together the two networks.

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 567	License type: cc-by

Similar Papers

Cross-lingual Fine-tuning for Abstractive Arabic Text Summarization
Mram Kahla ... Attila Novák
-
Mram Kahla, et. al.Mram Kahla ... Attila Novák
01 Jan 2020
01 Jan 2020

Hierarchical Variational Memory Network for Dialogue Generation
Hongshen Chen ... Dawei Yin
-
Hongshen Chen, et. al.Hongshen Chen ... Dawei Yin
01 Jan 2018
01 Jan 2018

The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks
Jiho Shin ... Song Wang
ACM Transactions on Software Engineering and Methodology | VOL. 33
Jiho Shin, et. al.Jiho Shin ... Song Wang
22 Dec 2023
ACM Transactions on Software Engineering and Methodology | VOL. 33

The power of deep learning to ligand-based novel drug discovery
Igor I Baskin
Expert Opinion on Drug Discovery | VOL. 15
Igor I BaskinIgor I Baskin
31 Mar 2020
Expert Opinion on Drug Discovery | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Abstract

Highlights

Summary

Talk to us

Similar Papers