Text summarization based on multi-head self-attention mechanism and pointer network

Dong Qiu,Bing Yang

doi:10.1007/s40747-021-00527-2

Abstract

Existing text summarization methods mainly rely on the mapping between manually labeled standard summaries and the original text for feature extraction, often ignoring the internal structure and semantic feature information of the original document. Therefore, the text summary extracted by the existing model has the problems of grammatical structure errors and semantic deviation from the original text. This paper attempts to enhance the model’s attention to the inherent feature information of the source text so that the model can more accurately identify the grammatical structure and semantic information of the document. Therefore, this paper proposes a model based on the multi-head self-attention mechanism and the soft attention mechanism. By introducing an improved multi-head self-attention mechanism in the model coding stage, the training model enables the correct summary syntax and semantic information to obtain higher weight, thereby making the generated summary more coherent and accurate. At the same time, the pointer network model is adopted, and the coverage mechanism is improved to solve out-of-vocabulary and repetitive problems when generating abstracts. This article uses CNN/DailyMail dataset to verify the model proposed in this article and uses the ROUGE indicator to evaluate the model. The experimental results show that the model in this article improves the quality of the generated summary compared with other models.

Highlights

The internet generates a large quantity of text data at all times, and the problem of text information overload is becoming increasingly serious
Automatic text summarization extracts a paragraph of content from the original text or generates a paragraph of new content to summarize the main information of the original text
In the process of using the sequence-to-sequence model, the researchers found that the model can extract information from the original text, but the text summary generated by the model has out-of-vocabulary and word repetition problems

Summary

Introduction

The internet generates a large quantity of text data at all times, and the problem of text information overload is becoming increasingly serious. The pointer generation network uses the traditional soft attention mechanism, which cannot extract the various semantic and grammatical information inside the original text, resulting in grammatical structure error and semantic deviation problems from the original text in the generated abstract.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Complex & Intelligent Systems	Publication Date: Oct 2, 2021
Citations: 15	License type: open-access

R Discovery Prime

R Discovery Prime

Text summarization based on multi-head self-attention mechanism and pointer network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Complex & Intelligent Systems

Lead the way for us

Similar Papers

Siamese network text similarity calculation with multi-head self-attention mechanism
...
微电子学与计算机 | VOL. 38
, et. al. ...
05 Oct 2021
微电子学与计算机 | VOL. 38

MS-Pointer Network: Abstractive Text Summary Based on Multi-Head Self-Attention
Qian Guo ... Pan Wang
IEEE Access | VOL. 7
Qian Guo, et. al.Qian Guo ... Pan Wang
01 Jan 2019
IEEE Access | VOL. 7

Software defect prediction with semantic and structural information of codes based on Graph Neural Networks
Chunying Zhou ... Peng He
Information and Software Technology | VOL. 152
Chunying Zhou, et. al.Chunying Zhou ... Peng He
01 Dec 2022
Information and Software Technology | VOL. 152

RB_BG_MHA: A RoBERTa-Based Model with Bi-GRU and Multi-Head Attention for Chinese Offensive Language Detection in Social Media
Meijia Xu ... Shuxian Liu
Applied Sciences | VOL. 13
Meijia Xu, et. al.Meijia Xu ... Shuxian Liu
06 Oct 2023
Applied Sciences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Text summarization based on multi-head self-attention mechanism and pointer network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Complex &amp; Intelligent Systems

More From: Complex & Intelligent Systems