Abstract

Now-a-days, there are increasingly huge amount of user generated comments on the web. The user generated comments usually contains useful and essential information reflecting public’s or customers’ opinions. Since the information in the comments could be used for decision making, production or service improvement, and achieving user satisfaction, the systematic analysis of these comments is an essential need in so many domains including e-commerce, production, and social network analysis. However, the analysis of large volume of comments is a difficult and time-consuming task. Therefore, the need for a system which can convert this massive volume of comments to a useful and efficient summary is felt more and more. Text summarization leads to using more resources at higher speeds and getting richer information. According to numerous studies conducted in the field of multi-document summarization, few studies can be found that have been focused on the user generated comments in Persian language. In this paper, we propose a novel approach to summarize huge amount of comments in Persian, which is enough close to a human summarization. Our approach is based on semantic and lexical similarities and uses a graph-based summarization. We also propose a clustering to deal with multiple aspects (subjects) in a corpus of comments. According to the experiments, the summaries extracted by the proposed approach reached an average score of 8.75 out of 10, which improves the state-of-the-art summarizer’s score about 14 percent.

Highlights

  • With increasing number of user on Web 2 platforms like social networks, weblogs, and online review sites, the user generated comments is dramatically increasing

  • We proposed an approach to summarizing a huge corpus of user generated comments in Persian language

  • Our approach uses a clustering technique to deal with multiple aspects within comments

Read more

Summary

INTRODUCTION

With increasing number of user on Web 2 platforms like social networks, weblogs, and online review sites, the user generated comments is dramatically increasing. The user generated comments contain primordial and useful information about public’s opinion, social interactions, cultural events, customer’s satisfaction, market analysis, etc. These online comments affect the customers’ behavior and could be useful in decision making as well as improving the services or products. 2) Abstract summarization: In the abstract summarization, the structure of sentences are generally changed, this type of summary is an interpretation of the original text In this method, first, the system analyzes the text and expresses its perception of the text in the form of an understandable language for the user [12].

WORK STUDY
COMMENTS SUMMARIZATION
Preprocessing
Clustering
Graph Construction
EXPERIMENTS
Results
Comparing with Ijaz
Findings
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call