Automatic Text Summarization of Konkani Folk Tales Using Supervised Machine Learning Algorithms and Language Independent Features

Jovi D’Silva,Uzzal Sharma

doi:10.1080/03772063.2021.1987993

Abstract

Automatic text summarization is an emerging field of research in Natural Language Processing. This work is a novel attempt to include a low-resource language to the domain of Automatic Text Summarization. We use supervised machine learning algorithms to perform single document extractive automatic text summarization on documents in a low-resource language, Konkani. In particular, we propose using language independent features to train supervised machine learning algorithms using a Konkani dataset, specifically devised for the experimentation using books on Konkani folktale literature. We approach the automatic text summarization task as a binary classification problem, and the algorithms, once trained, classify the sentences based on their relevance to generate a summary. Thereafter, the performance of popular linear and non-linear supervised machine learning algorithms is evaluated using K-fold cross-validation. The summary generated by the systems is compared with human-generated summaries to verify its effectiveness. The results show that the linear models exhibit better performance in comparison with the non-linear models; however, all the models could beat the baselines. The output produced by the proposed methodology generates promising summaries without the need for any language-specific domain knowledge.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Text Summarization of Konkani Folk Tales Using Supervised Machine Learning Algorithms and Language Independent Features

Abstract

Talk to us

Similar Papers

More From: IETE Journal of Research

Lead the way for us

Journal: IETE Journal of Research	Publication Date: Oct 19, 2021
Citations: 3

Similar Papers

Survey on Extractive Text Summarization Methods with Multi-Document Datasets
P N Varalakshmi K ... Jagadish S Kallimani
-
P N Varalakshmi K, et. al.P N Varalakshmi K ... Jagadish S Kallimani
01 Sep 2018
01 Sep 2018

An Abstractive Summarization Technique with Variable Length Keywords as per Document Diversity
Muhammad Yahya Saeed ... Muhammad Arif Shah
Computers, Materials & Continua | VOL. 66
Muhammad Yahya Saeed, et. al.Muhammad Yahya Saeed ... Muhammad Arif Shah
01 Jan 2020
Computers, Materials & Continua | VOL. 66

Supervised Automatic Text Summarization of Konkani Texts Using Linear Regression-Based Feature Weighing and Language-Independent Features
Jovi D’Silva ... Uzzal Sharma
-
Jovi D’Silva, et. al.Jovi D’Silva ... Uzzal Sharma
27 Sep 2022
27 Sep 2022

Unsupervised Machine Learning Approach for Extractive Punjabi Text Summarization
Kamal Deep Garg ... Ambuj Kumar Agarwal
-
Kamal Deep Garg, et. al.Kamal Deep Garg ... Ambuj Kumar Agarwal
26 Aug 2021
26 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Text Summarization of Konkani Folk Tales Using Supervised Machine Learning Algorithms and Language Independent Features

Abstract

Talk to us

Similar Papers

More From: IETE Journal of Research