Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

Ramakanth Pasunuru,Mohit Bansal,Chenyan Xiong,Asli Celikyilmaz,Yizhe Zhang,Jianfeng Gao,Michel Galley

doi:10.1609/aaai.v35i15.17611

Abstract

The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient largescale high-quality training datasets. We present two QMDS training datasets, which we construct using two data augmentation methods: (1) transferring the commonly used single-document CNN/Daily Mail summarization dataset to create the QMDSCNN dataset, and (2) mining search-query logs to create the QMDSIR dataset. These two datasets have complementary properties, i.e., QMDSCNN has real summaries but queries are simulated, while QMDSIR has real queries but simulated summaries. To cover both these real summary and query aspects, we build abstractive end-to-end neural network models on the combined datasets that yield new state-of-the-art transfer results on DUC datasets. We also introduce new hierarchical encoders that enable a more efficient encoding of the query together with multiple documents. Empirical results demonstrate that our data augmentation and encoding methods outperform baseline models on automatic metrics, as well as on human evaluations along multiple attributes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 18

Similar Papers

Improving classification performance of motor imagery BCI through EEG data augmentation with conditional generative adversarial networks
Sanghyun Choo ... Chang S Nam
Neural Networks | VOL. 180
Sanghyun Choo, et. al.Sanghyun Choo ... Chang S Nam
01 Aug 2024
Neural Networks | VOL. 180

Data Augmentation for Building Footprint Segmentation in SAR Images: An Empirical Study
Sandhi Wangiyana ... Artur Gromek
Remote Sensing | VOL. 14
Sandhi Wangiyana, et. al.Sandhi Wangiyana ... Artur Gromek
22 Apr 2022
Remote Sensing | VOL. 14

Understanding Data Augmentation in Neural Machine Translation: Two Perspectives towards Generalization
Guanlin Li ... Guoping Huang
-
Guanlin Li, et. al.Guanlin Li ... Guoping Huang
01 Jan 2019
01 Jan 2019

A General Multiple Data Augmentation Based Framework for Training Deep Neural Networks
Binyan Hu ... A K Qin
-
Binyan Hu, et. al.Binyan Hu ... A K Qin
18 Jul 2022
18 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence