Abstract

Automated information retrieval and text summarization concept is a difficult process in natural language processing because of the infrequent structure and high complexity of the documents. The text summarization process creates a summary by paraphrasing a long text. Earlier models on information retrieval and summarization are based on a massive labeled dataset by the use of handcrafted features, leveraging on knowledge for a particular domain, and concentrated on the narrow sub-domain to improve efficiency. This paper presents a new deep learning (DL) based information retrieval with a text summarization model. The proposed model involves three major processes namely information retrieval, template generation, and text summarization. Initially, the bidirectional long short term memory (BiLSTM) approach is employed for retrieving the textual data, which assumes each word in a sentence, extracts the information, and embeds it into the semantic vector. Next, the template generation process takes place using the DL model. The deep belief network (DBN) model is employed as a text summarization tool to summarize the textual content. In addition, the image description is generated for the visualized entities that exist in the images. The design of BiLSTM with the DBN model for the text summarization and image captioning process shows the novelty of the work. The performance of the presented method is validated using Giga word corpus and DUC corpus. The experimental results referred that the proposed DBN model outperformed the compared methods with the maximum precision, recall and F-score. The image captions are compared with a predefined set of captions that exists for the image and the performance is evaluated using the BLEU metric.

Highlights

  • In recent times, retrieving data from voluminous content is highly a tedious process because of the rapid development of blogs, articles, reports, and so forth

  • The DUC-2003 corpus is too small to be trained by deep neural networks (DNN)

  • This paper has developed a new deep learning (DL) based information retrieval with a text summarization model

Read more

Summary

INTRODUCTION

In recent times, retrieving data from voluminous content is highly a tedious process because of the rapid development of blogs, articles, reports, and so forth. Abstractive text summarization is a novel approach that has gained maximum attention from the developers as it is capable to produce new words by applying language generationapproaches. This frameworkprovidesthe summary by paraphrasing the text which is composed of original meaning in the content [2]. The abstractive text summarization is learning models find useful in several application areas such highly effective in retaining the semantic coherence but it fails as human activity recognition [4], weather prediction [5], to confirm the syntactic structure of produced summary. Model outperformed the related methods with the maximum precision, recall, and F-score

LITERATURE REVIEW
THE PROPOSED MODEL
Bi-LSTM based information retrieval and template generation
DBN based Text Summarization
Image Caption Generation using deep learning techniques
RESULTS AND DISCUSSION
RESULT
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call