Multi-document summarization finds its application in many downstream information retrieval and natural language processing tasks. In the light of recent developments in social media data mining, Tweet summarization has emerged as a fundamental task of automatically detecting important keyphrases from a set of Tweets about current happenings. In the existing literature, the graph-based keyphrase extraction techniques are well-established unsupervised algorithms to capture summaries from dynamically evolving data. We argue that the traditional multi-tweet summarization technique may or may not capture user’s interest-specific keyphrases during tweet summarization. The nature of user-generated factual short-text is different from well-formed descriptive and perceptual long-text due to their repetitive nature. In this context, we introduce a simple yet effective interest-specific keyphrase extraction technique for tweet summarization as KEST: Key Extraction for Summarization of Tweets using Markov Decision Process (MDP). In this research work, we generate a path as evolving chain of highly interconnected words from sub-components in graph of words. We evaluate the effectiveness of our computationally, inexpensive, graph-based, abstractive keyphrase extraction approach over two datasets which we make publicly available.
Read full abstract