Abstract
With the advent of the information age, excessive information collection leads to information overload. Automatic text summarization technology has become an effective way to solve information overload. This paper proposes an automatic text summarization model, which extends traditional sequence-to-sequence (Seq2Seq) neural text summarization model by using a syntax-augmented encoder and a headline-aware decoder. The encoder encodes both syntactic structure and word information of a sentence in the sentence embedding. A hierarchical attention mechanism is proposed to pay attentions to syntactic units. The decoder is improved by a headline attention mechanism and a Dual-memory-cell LSTM network to achieve a higher quality of generated summaries. We designed experiments to compare the proposed method with baseline models on the CNN/DM datasets. The experiment results show that the proposed method is superior to abstractive baseline models in terms of the scores on ROUGE evaluation metrics, and achieve a summary generation performance comparable to the extractive baseline method. Though qualitative analysis, the summary quality of the propose method is more readable and less redundant, which agrees well with our intuition.
Highlights
A DVANCEMENTS in digital technologies have revolutionized the way information is produced and delivered, and people are confronted with overwhelming information every day, which is far beyond the range that people can efficiently handle and digest
A SYNTAX-AUGMENTED AND HEADLINE-AWARE TEXT SUMMARIZATION MODEL we introduce an abstractive Seq2Seq summarization model, which consists of a syntax-augmented encoder and a headline-aware decoder
Grams occurring in the reference summaries, and the numerator is the number of N-grams shared by a computergenerated summary and reference summaries
Summary
A DVANCEMENTS in digital technologies have revolutionized the way information is produced and delivered, and people are confronted with overwhelming information every day, which is far beyond the range that people can efficiently handle and digest. Extracting and understanding valuable information has become an urgent need, which bring into the birth of automatic text summarization technology. People may get the fist of texts in a shorter time by only reading text summaries. Limited by the computer performance at that time, only statistical information such as word frequency is used to extract sentences in the original text. Automatic text summarization began to attract attention of researches [3]
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.