Align-then-abstract representation learning for low-resource summarization

Gianluca Moro,Luca Ragazzi

doi:10.1016/j.neucom.2023.126356

Abstract

Generative transformer-based models have achieved state-of-the-art performance in text summarization. Nevertheless, they still struggle in real-world scenarios with long documents when trained in low-resource settings of a few dozen labeled training instances, namely in low-resource summarization (LRS). This paper bridges the gap by addressing two key research challenges when summarizing long documents, i.e., long-input processing and document representation, in one coherent model trained for LRS. Specifically, our novel align-then-abstract representation learning model (Athena) jointly trains a segmenter and a summarizer by maximizing the alignment between the chunk-target pairs in output from the text segmentation. Extensive experiments reveal that Athena outperforms the current state-of-the-art approaches in LRS on multiple long document summarization datasets from different domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neurocomputing	Publication Date: Jun 2, 2023
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Align-then-abstract representation learning for low-resource summarization

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Advancing understanding on industrial relations in multinational companies: Key research challenges and the INTREPID contribution
Patrick Gunnigle ... Tony Edwards
Journal of Industrial Relations | VOL. 57
Patrick Gunnigle, et. al.Patrick Gunnigle ... Tony Edwards
17 Feb 2015
Journal of Industrial Relations | VOL. 57

A Sensors Based Deep Learning Model for Unseen Locomotion Mode Identification using Multiple Semantic Matrices
Rahul Mishra ... Tanima Dutta
IEEE Transactions on Mobile Computing | VOL. 21
Rahul Mishra, et. al.Rahul Mishra ... Tanima Dutta
01 Mar 2022
IEEE Transactions on Mobile Computing | VOL. 21

A Text Classification Methodology to Assist a Large Technical Support System
Elene Firmeza Ohata ... Elizangela De Souza Reboucas
IEEE Access | VOL. 10
Elene Firmeza Ohata, et. al.Elene Firmeza Ohata ... Elizangela De Souza Reboucas
01 Jan 2021
IEEE Access | VOL. 10

Coalition formation based on marginal contributions and the Markov process
Stephen Shaoyi Liao ... Tianying Wu
Decision Support Systems | VOL. 57
Stephen Shaoyi Liao, et. al.Stephen Shaoyi Liao ... Tianying Wu
18 Oct 2013
Decision Support Systems | VOL. 57

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Align-then-abstract representation learning for low-resource summarization

Abstract

Talk to us

Similar Papers

More From: Neurocomputing