Comparative Document Summarisation via Classification

Umanga Bista,Minjeong Shin,Lexing Xie,Alexander Mathews,Aditya Krishna Menon

doi:10.1609/aaai.v33i01.330120

Abstract

Thispaperconsidersextractivesummarisationinacomparative setting: given two or more document groups (e.g., separated by publication time), the goal is to select a small number of documents that are representative of each group, and also maximally distinguishable from other groups. We formulate a set of new objective functions for this problem that connect recent literature on document summarisation, interpretable machine learning, and data subset selection. In particular, by casting the problem as a binary classification amongst different groups, we derive objectives based on the notion of maximum mean discrepancy, as well as a simple yet effective gradient-based optimisation strategy. Our new formulation allows scalable evaluations of comparative summarisation as a classification task, both automatically and via crowd-sourcing. To this end, we evaluate comparative summarisation methods on a newly curated collection of controversial news topics over 13months.Weobserve thatgradient-based optimisationoutperforms discrete and baseline approaches in 15 out of 24 different automatic evaluation settings. In crowd-sourced evaluations, summaries from gradient optimisation elicit 7% more accurate classification from human workers than discrete optimisation. Our result contrasts with recent literature on submodular data subset selection that favours discrete optimisation. We posit that our formulation of comparative summarisation will prove useful in a diverse range of use cases such as comparing content sources, authors, related topics, or distinct view points.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparative Document Summarisation via Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 6

Similar Papers

Enhancing Image Classification Capabilities of Crowdsourcing-Based Methods through Expanded Input Elicitation
Romena Yasmin ... Md Mahmudulla Hassan
Proceedings of the AAAI Conference on Human Computation and Crowdsourcing | VOL. 9
Romena Yasmin, et. al.Romena Yasmin ... Md Mahmudulla Hassan
04 Oct 2021
Proceedings of the AAAI Conference on Human Computation and Crowdsourcing | VOL. 9

Active Sampling for Learning Interpretable Surrogate Machine Learning Models
Amal Saadallah ... Katharina Morik
-
Amal Saadallah, et. al.Amal Saadallah ... Katharina Morik
01 Oct 2020
01 Oct 2020

A Unified Pipeline for Simultaneous Brain Tumor Classification and Segmentation Using Fine-Tuned CNN and Residual UNet Architecture.
Faisal Alshomrani
Life (Basel, Switzerland) | VOL. 14
Faisal AlshomraniFaisal Alshomrani
10 Sep 2024
Life (Basel, Switzerland) | VOL. 14

Topic2features: a novel framework to classify noisy and sparse textual data using LDA topic distributions.
Junaid Abdul Wahid ... Shabir Hussain
PeerJ Computer Science | VOL. 7
Junaid Abdul Wahid, et. al.Junaid Abdul Wahid ... Shabir Hussain
11 Aug 2021
PeerJ Computer Science | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative Document Summarisation via Classification

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence