Abstract

In this paper, we propose a novel update summarization framework based on topic correlation analysis. The topics are first extracted from the two document sets provided in the task of update summarization by means of Latent Dirichlet Allocation (LDA) topic model. Then, the correlation between the new topics and the old topics are identified, based on which we further defined four categories of topic evolution patterns to capture the topic shift between the two document collections. We develop a new sentence ranking algorithm, i.e. CorrRank, which fully incorporates the topic evolution in the process of sentence ranking and sentence selection in update summarization. We choose the DUC 2008 and 2009 query-oriented multi-document update summarization tasks to examine the proposed model. Experimental results show the effectiveness of the LDA topic correlation analysis based update summarization framework.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.