Abstract Summarization Based on the Combination of Transformer and LSTM

Xin Liu,Liutong Xv

doi:10.1109/icicas48597.2019.00199

Abstract

Summary is the task of compressing a piece of text into a short version with the main information of the original text. While previous architecture choices revolve around Convolutional Neural Networks (CNN) and long shortterm memory (LSTM) recurrent neural networks, recently self-attention and transformer have been used for the text generation task and achieved very good results. However transformer lacks local attention information and uses simple calculations as address vectors. In our article, we put forward a new seq-to-seq model for generating text summary called LTABS (Abstractive summarization on LSTM and Transformer). This model makes transformer be more suitable for summary generation tasks. For the encoder part of the model, we make use of the hidden layer result of LSTM as the location information. At the same time, transformer is used as the attention matrix of LSTM. This design allows the transformer to obtain location information and enhance local attention. We add copy mechanism to the new network to finish off the summary tasks OOV (out of vocabulary) problem. We apply our model to CNN and Daily Mail and Xsum datasets, the test outcome shows that the LTABS framework is superior to the most advanced model in semantic and syntactic structure, and has achieved competitive results in manual language quality assessment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Abstract Summarization Based on the Combination of Transformer and LSTM

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using long short-term memory recurrent neural network in land cover classification on Landsat and Cropland data layer time series
Ziheng Sun ... Hui Fang
International Journal of Remote Sensing | VOL. 40
Ziheng Sun, et. al.Ziheng Sun ... Hui Fang
16 Oct 2018
International Journal of Remote Sensing | VOL. 40

Kazakh and Russian Languages Identification Using Long Short-Term Memory Recurrent Neural Networks
Zhanibek Kozhirbayev ... Muslima Karabalayeva
-
Zhanibek Kozhirbayev, et. al.Zhanibek Kozhirbayev ... Muslima Karabalayeva
01 Sep 2017
01 Sep 2017

Spatiotemporal Co-Attention Hybrid Neural Network for Pedestrian Localization Based on 6D IMU
Yingying Wang ... Hu Cheng
IEEE Transactions on Automation Science and Engineering | VOL. 20
Yingying Wang, et. al.Yingying Wang ... Hu Cheng
01 Jan 2023
IEEE Transactions on Automation Science and Engineering | VOL. 20

Bidirectional Quaternion Long Short-term Memory Recurrent Neural Networks for Speech Recognition
Titouan Parcollet ... Georges Linares
-
Titouan Parcollet, et. al.Titouan Parcollet ... Georges Linares
01 May 2019
01 May 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Abstract Summarization Based on the Combination of Transformer and LSTM

Abstract

Talk to us

Similar Papers