Abstract

Before the advent of the World Wide Web, lack of information was a problem. But with the advent of the web today, we are faced with an explosive amount of information in every area of search. This extra information is troublesome and prevents a quick and correct decision. This is the problem of information overload. Multi-document summarization is an important solution for this problem by producing a brief summary containing the most important information from a set of documents in a short time. This summary should preserve the main concepts of the documents. When the input documents are related to a specific domain, for example, medicine or law, summarization faces more challenges. Domain-oriented summarization methods use special characteristics related to that domain to generate summaries. This paper introduces the purpose of multi-document summarization systems and discusses domain-oriented approaches. Various methods have been proposed by researchers for multi-document summarization. This survey reviews the categorizations that authors have made on multi-document summarization methods. We also categorize the multi-document summarization methods into six categories: machine learning, clustering, graph, Latent Dirichlet Allocation (LDA), optimization, and deep learning. We review the different methods presented in each of these groups. We also compare the advantages and disadvantages of these groups. We have discussed the standard datasets used in this field, evaluation measures, challenges and recommendations.

Highlights

  • Unlike in the past, there was often not enough data on every issue, but today we are faced with the issue of information overload

  • The deep Auto Encoder (AE) network is used for this purpose

  • Lakshmi and Rani provide a method for implementing multi-document summarization (MDS) using deep learning and fuzzy logic [35]

Read more

Summary

Introduction

There was often not enough data on every issue, but today we are faced with the issue of information overload. Obtaining the most important information from a huge amount of data is a timeconsuming and difficult task. Various fields such as natural language processing (NLP), text mining and artificial intelligence were used to provide a solution to this problem. Automatic text document summarization is an important solution to this problem. Text summarization obtains a short and compact gist that preserves the main concepts of the original text. The user can understand the concept of a long text in the form of a brief summary. Text summarization methods can be divided into several categories. From the summary producing perspective, summarization is divided into two categories: extractive and abstractive

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.