Maximum Marginal Relevance and Vector Space Model for Summarizing Students' Final Project Abstracts

Gunawan Gunawan,Kimiya Fujisawa,Fitria Fitria,Esther Irawati Setiawan

doi:10.17977/um018v6i12023p57-68

Abstract

Automatic summarization is reducing a text document with a computer program to create a summary that retains the essential parts of the original document. Automatic summarization is necessary to deal with information overload, and the amount of data is increasing. A summary is needed to get the contents of the article briefly. A summary is an effective way to present extended information in a concise form of the main contents of an article, and the aim is to tell the reader the essence of a central idea. The simple concept of a summary is to take an essential part of the entire contents of the article. Which then presents it back in summary form. The steps in this research will start with the user selecting or searching for text documents that will be summarized with keywords in the abstract as a query. The proposed approach performs text preprocessing for documents: sentence breaking, case folding, word tokenizing, filtering, and stemming. The results of the preprocessed text are weighted by term frequency-inverse document frequency (tf-idf), then weighted for query relevance using the vector space model and sentence similarity using cosine similarity. The next stage is maximum marginal relevance for sentence extraction. The proposed approach provides comprehensive summarization compared with another approach. The test results are compared with manual summaries, which produce an average precision of 88%, recall of 61%, and f-measure of 70%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Maximum Marginal Relevance and Vector Space Model for Summarizing Students' Final Project Abstracts

Abstract

Talk to us

Similar Papers

More From: Knowledge Engineering and Data Science

Lead the way for us

Journal: Knowledge Engineering and Data Science	Publication Date: Aug 1, 2023
License type: CC BY-SA 4.0

Similar Papers

Automatic Text Summarization using Maximum Marginal Relevance for Health Ethics Protocol Document in Bahasa
Doni Putra Purbawa ... Ratih Nur Esti Anggraini
-
Doni Putra Purbawa, et. al.Doni Putra Purbawa ... Ratih Nur Esti Anggraini
20 Oct 2021
20 Oct 2021

Automatic Extractive Text Summarization for Indonesian News Articles Using Maximal Marginal Relevance and Non-Negative Matrix Factorization
Inggar Riyandi Musyaffanto ... Guntur Budi Herwanto
-
Inggar Riyandi Musyaffanto, et. al.Inggar Riyandi Musyaffanto ... Guntur Budi Herwanto
01 Jul 2019
01 Jul 2019

The Use of MMR and Diversity-Based Reranking in Document Reranking and Summarization
...
-
, et. al. ...
30 Jun 2018
30 Jun 2018

Two-Level Text Summarization Using Topic Modeling
Dhannuri Saikumar ... P Subathra
-
Dhannuri Saikumar, et. al.Dhannuri Saikumar ... P Subathra
11 Aug 2020
11 Aug 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Maximum Marginal Relevance and Vector Space Model for Summarizing Students' Final Project Abstracts

Abstract

Talk to us

Similar Papers

More From: Knowledge Engineering and Data Science