Transformer-based abstractive indonesian text summarization

Miracle Aurelia,Sheila Monica,Abba Suganda Girsang

doi:10.11591/ijict.v13i3.pp388-399

Miracle Aurelia, Sheila Monica + Show 1 more

https://doi.org/10.11591/ijict.v13i3.pp388-399

Copy DOI

Abstract

The volume of data created, captured, copied, and consumed worldwide has increased from 2 zettabytes in 2010 to over 97 zettabytes in 2020, with an estimation of 181 zettabytes in 2025. Automatic text summarization (ATS) will ease giving points of information and will increase efficiency at the time consumed to understand the information. Therefore, improving ATS performance in summarizing news articles is the goal of this paper. This work will fine-tune the BART model using IndoSum, Liputan6, and Liputan6 augmented dataset for abstractive summarization. Data augmentation for Liputan6 will be augmented with the ChatGPT method. This work will also use r ecall-oriented understudy of gisting evaluation (ROUGE) as an evaluation metric. The data augmentation with ChatGPT used 10% of the clean news article from the Liputan6 training dataset and ChatGPT generated the abstractive summary based on that input, culminating in over 36 thousand data for the model’s fine-tuning. BART model that was finetuned using Indosum, Liputan6, and augmented Liputan6 dataset has the best ROUGE-2 score, outperforming ORACLE’s model although ORACLE still has the best ROUGE-1 and ROUGE-L score. This concludes that fine-tuning the BART model with multiple datasets will increase the performance of the model to do abstractive summarization tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Transformer-based abstractive indonesian text summarization

Abstract

Talk to us

Similar Papers

More From: International Journal of Informatics and Communication Technology (IJ-ICT)

Lead the way for us

Journal: International Journal of Informatics and Communication Technology (IJ-ICT)	Publication Date: Dec 1, 2024
License type: CC BY-SA 4.0

Similar Papers

A Novel Attention Mechanism Considering Decoder Input for Abstractive Text Summarization
Jianwei Niu ... Joel J P C Rodrigues
-
Jianwei Niu, et. al.Jianwei Niu ... Joel J P C Rodrigues
01 May 2019
01 May 2019

Abstractive Text Summarization via Stacked LSTM
Ireddy Siddhartha ... Huixin Zhan
-
Ireddy Siddhartha, et. al.Ireddy Siddhartha ... Huixin Zhan
01 Dec 2021
01 Dec 2021

Deep Learning based Automatic Hindi Text Summarization
Aashil Shah ... Kevan Mehta
-
Aashil Shah, et. al.Aashil Shah ... Kevan Mehta
29 Mar 2022
29 Mar 2022

ViMs: a high-quality Vietnamese dataset for abstractive multi-document summarization
Nhi-Thao Tran ... Nam Van Chi
Language Resources and Evaluation | VOL. 54
Nhi-Thao Tran, et. al.Nhi-Thao Tran ... Nam Van Chi
25 Jun 2020
Language Resources and Evaluation | VOL. 54

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Transformer-based abstractive indonesian text summarization

Abstract

Talk to us

Similar Papers

More From: International Journal of Informatics and Communication Technology (IJ-ICT)