On macro- and micro-level information in multiple documents and its influence on summarization

Jiaming Zhan,Han Tong Loh,Ying Liu

doi:10.1016/j.ijinfomgt.2008.04.011

Abstract

A well-known challenge for multi-document summarization (MDS) is that a single best or “gold standard” summary does not exist, i.e. it is often difficult to secure a consensus among reference summaries written by different authors. It therefore motivates us to study what the “important information” is in multiple input documents that will guide different authors in writing a summary. In this paper, we propose the notions of macro- and micro-level information. Macro-level information refers to the salient topics shared among different input documents, while micro-level information consists of different sentences that act as elaborating or provide complementary details for those salient topics. Experimental studies were conducted to examine the influence of macro- and micro-level information on summarization and its evaluation. Results showed that human subjects highly relied on macro-level information when writing a summary. The length allowed for summaries is the leading factor that affects the summary agreement. Meanwhile, our summarization evaluation approach based on the proposed macro- and micro-structure information also suggested that micro-level information offered complementary details for macro-level information. We believe that both levels of information form the “important information” which affects the modeling and evaluation of automatic summarization systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On macro- and micro-level information in multiple documents and its influence on summarization

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Management

Lead the way for us

Journal: International Journal of Information Management	Publication Date: Feb 1, 2009
Citations: 6

Similar Papers

A Discourse Structure Analysis of Technical Japanese Texts and Its Implementation on the WWW
Jie Chi Yang ... Kanji Akahori
Computer Assisted Language Learning | VOL. 13
Jie Chi Yang, et. al.Jie Chi Yang ... Kanji Akahori
01 Apr 2000
Computer Assisted Language Learning | VOL. 13

Survey on Graph and Cluster Based approaches in Multi-document Text Summarization
Yogesh Kumar Meena ... Ashish Jain
-
Yogesh Kumar Meena, et. al.Yogesh Kumar Meena ... Ashish Jain
01 May 2014
01 May 2014

Weighted Graph Embedding Feature with Bi-Directional Long Short-Term Memory Classifier for Multi-Document Text Summarization
Samina Mulla ... Nuzhat F Shaikh
International Journal of Image and Graphics | VOL. 24
Samina Mulla, et. al.Samina Mulla ... Nuzhat F Shaikh
10 Dec 2022
International Journal of Image and Graphics | VOL. 24

Clustered genetic semantic graph approach for multi-document abstractive summarization
Atif Khan ... Naomie Salim
-
Atif Khan, et. al.Atif Khan ... Naomie Salim
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On macro- and micro-level information in multiple documents and its influence on summarization

Abstract

Talk to us

Similar Papers

More From: International Journal of Information Management