Comprehensive Document Summarization with Refined Self-Matching Mechanism

Biqing Zeng,Wu Zhou,Heng Yang,Ruyang Xu,Zibang Gan

doi:10.3390/app10051864

Abstract

Under the constraint of memory capacity of the neural network and the document length, it is difficult to generate summaries with adequate salient information. In this work, the self-matching mechanism is incorporated into the extractive summarization system at the encoder side, which allows the encoder to optimize the encoding information at the global level and effectively improves the memory capacity of conventional LSTM. Inspired by human coarse-to-fine understanding mode, localness is modeled by Gaussian bias to improve contextualization for each sentence, and merged into the self-matching energy. The refined self-matching mechanism not only establishes global document attention but perceives association with neighboring signals. At the decoder side, the pointer network is utilized to perform a two-hop attention on context and extraction state. Evaluations on the CNN/Daily Mail dataset verify that the proposed model outperforms the strong baseline models and statistical significantly.

Highlights

Automatic summarization systems have been made great progress in many applications, such as headline generation [1], single or multi-document summarization [2,3], opinion mining [4], text categorization, etc
The abstractive summarization is more difficult as it has to deal with factual or grammatical errors, semantic incoherence, as well as problems with the obtaining of explicit textual paraphrases and generalizations. Extractive methods relieve these problems by identifying important sentences from the document, summary generated by extractive methods are generally better than that generated by abstractive methods in terms of grammaticality and factuality
All of our recall-oriented understanding for gisting evaluation (ROUGE) scores are reported by the official ROUGE script, with a 95% confidence interval of at most

Summary

Introduction

Automatic summarization systems have been made great progress in many applications, such as headline generation [1], single or multi-document summarization [2,3], opinion mining [4], text categorization, etc. The abstractive summarization is more difficult as it has to deal with factual or grammatical errors, semantic incoherence, as well as problems with the obtaining of explicit textual paraphrases and generalizations. Extractive methods relieve these problems by identifying important sentences from the document, summary generated by extractive methods are generally better than that generated by abstractive methods in terms of grammaticality and factuality. Those methods may encounter problems like the lack of core information and incomprehensive generalization. With the advantages of simpler calculation and higher generation efficiency, numerous empirical comparisons in recent years have shown that the state-of-the-art extractive methods usually have better performance than the abstractive ones [5]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 9, 2020
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Comprehensive Document Summarization with Refined Self-Matching Mechanism

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Extractive text summarization system to aid data extraction from full text in systematic review development
Duy Duc An Bui ... Siddhartha Jonnalagadda
Journal of Biomedical Informatics | VOL. 64
Duy Duc An Bui, et. al.Duy Duc An Bui ... Siddhartha Jonnalagadda
27 Oct 2016
Journal of Biomedical Informatics | VOL. 64

A Theoretical Model and Study of Weighted MCTF Residual Energy
Fengling Li ... Nam Ling
-
Fengling Li, et. al.Fengling Li ... Nam Ling
01 Oct 2006
01 Oct 2006

A joint encoder–decoder error control framework for stereoscopic video coding
Xinguang Xiang ... Wen Gao
Journal of Visual Communication and Image Representation | VOL. 21
Xinguang Xiang, et. al.Xinguang Xiang ... Wen Gao
13 Jul 2010
Journal of Visual Communication and Image Representation | VOL. 21

U-Net Based Defects Inspection in Photovoltaic Electroluminecscence Images
Muhammad Rameez Ur Rahman ... Wen Xi
-
Muhammad Rameez Ur Rahman, et. al.Muhammad Rameez Ur Rahman ... Wen Xi
01 Nov 2019
01 Nov 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comprehensive Document Summarization with Refined Self-Matching Mechanism

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences