Abstract

Nowadays, automatic multidocument text summarization systems can successfully retrieve the summary sentences from the input documents. But, it has many limitations such as inaccurate extraction to essential sentences, low coverage, poor coherence among the sentences, and redundancy. This paper introduces a new concept of timestamp approach with Naïve Bayesian Classification approach for multidocument text summarization. The timestamp provides the summary an ordered look, which achieves the coherent looking summary. It extracts the more relevant information from the multiple documents. Here, scoring strategy is also used to calculate the score for the words to obtain the word frequency. The higher linguistic quality is estimated in terms of readability and comprehensibility. In order to show the efficiency of the proposed method, this paper presents the comparison between the proposed methods with the existing MEAD algorithm. The timestamp procedure is also applied on the MEAD algorithm and the results are examined with the proposed method. The results show that the proposed method results in lesser time than the existing MEAD algorithm to execute the summarization process. Moreover, the proposed method results in better precision, recall, and F-score than the existing clustering with lexical chaining approach.

Highlights

  • Data mining is the domain in which rapid changes are evolved in the recent years due to the enormous advances in the software and hardware technology

  • This paper introduces an automatic text summarization approach to overcome the difficulties in the existing summarization approaches

  • This section discussed the experimental results obtained for the proposed Naıve Bayesian based multidocument summarization and the comparative results with the MEAD algorithm

Read more

Summary

Introduction

Data mining is the domain in which rapid changes are evolved in the recent years due to the enormous advances in the software and hardware technology. The advancement leads to the availability of various kinds of data, which is especially suitable for the instance of text data. The software and hardware platforms used for the social networks and web have facilitated the rapid generation of huge repositories of various types of data. The structured data are managed with the help of database system, whereas text data are generally managed by search engine due to the lack of structures. The search engine allows the web user to identify the necessary information from the collected works suitably with the help of keyword query. This paper focuses on developing a multidocument text summarization approach

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call