WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS

Jean Louis Ebongue Kedieng Fendji,Adam Musa Ali,Marcellin Atemkeng,Désiré Manuel Taira

doi:10.3390/fi13090238

Jean Louis Ebongue Kedieng Fendji, Adam Musa Ali + Show 2 more

Open Access

https://doi.org/10.3390/fi13090238

Copy DOI

Journal: Future internet	Publication Date: Sep 18, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: University of Ngaoundéré, Rhodes University

Abstract

Text summarization remains a challenging task in the natural language processing field despite the plethora of applications in enterprises and daily life. One of the common use cases is the summarization of web pages which has the potential to provide an overview of web pages to devices with limited features. In fact, despite the increasing penetration rate of mobile devices in rural areas, the bulk of those devices offer limited features in addition to the fact that these areas are covered with limited connectivity such as the GSM network. Summarizing web pages into SMS becomes, therefore, an important task to provide information to limited devices. This work introduces WATS-SMS, a T5-based French Wikipedia Abstractive Text Summarizer for SMS. It is built through a transfer learning approach. The T5 English pre-trained model is used to generate a French text summarization model by retraining the model on 25,000 Wikipedia pages then compared with different approaches in the literature. The objective is twofold: (1) to check the assumption made in the literature that abstractive models provide better results compared to extractive ones; and (2) to evaluate the performance of our model compared to other existing abstractive models. A score based on ROUGE metrics gave us a value of 52% for articles with length up to 500 characters against 34.2% for transformer-ED and 12.7% for seq-2seq-attention; and a value of 77% for articles with larger size against 37% for transformers-DMCA. Moreover, an architecture including a software SMS-gateway has been developed to allow owners of mobile devices with limited features to send requests and to receive summaries through the GSM network.

Highlights

One of the most fascinating advances in the field of artificial intelligence is the ability of computers to understand natural language
This paper introduces WATS-SMS, a French Wikipedia Abstractive Text Summarizer that aims to summarize French Wikipedia pages into SMS and to provide summaries directly on the user’s device
T5 works as a ledger of all Natural language processing (NLP) tasks into a unified format, which is different from BERT-based models that usually generate either a class label or a span of the input [57]

Summary

Introduction

One of the most fascinating advances in the field of artificial intelligence is the ability of computers to understand natural language. This paper introduces WATS-SMS, a French Wikipedia Abstractive Text Summarizer that aims to summarize French Wikipedia pages into SMS and to provide summaries directly on the user’s device It is built by applying a transfer learning technique to fineduring the summarization [26,27,28,29,30,31], they do not take into consideration the limitation in terms of the number of characters. That aims to summarize French Wikipedia pages into SMS and to provide summaries directly on the user’s device It is built by applying a transfer learning technique to fine-tune a pre-trained model on French Wikipedia pages.

Section 3 explains generaland architecture

Related Works on Text Summarization

Classification

WATS-SMS System

WATS-SMS

Summarization Process

Post-Processing of the Summary

Comparison with Extractive Approaches

Summary section

Comparison with Abstractive Models

Demonstration

Conclusions and Future Works

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future internet

Lead the way for us

Similar Papers

Interpretable extractive text summarization with meta-learning and BI-LSTM: A study of meta learning and explainability techniques
Song-Nguyen Vo ... Bac Le
Expert systems with applications | VOL. 245
Song-Nguyen Vo, et. al.Song-Nguyen Vo ... Bac Le
30 Dec 2023
Expert systems with applications | VOL. 245

Improving Kullback-Leibler based legal document summarization using enhanced text representation
Deepali Jain ... Anupam Biswas
-
Deepali Jain, et. al.Deepali Jain ... Anupam Biswas
04 Nov 2022
04 Nov 2022

An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization
Amina Chouigui ... Bilel Elayeb
Arabian Journal for Science and Engineering | VOL. 46
Amina Chouigui, et. al.Amina Chouigui ... Bilel Elayeb
04 Feb 2021
Arabian Journal for Science and Engineering | VOL. 46

A supervised learning method combine with dimensionality reduction in Vietnamese text summarization
Ha Nguyen Thi Thu ... Quynh Nguyen Huu
-
Ha Nguyen Thi Thu, et. al. Ha Nguyen Thi Thu ... Quynh Nguyen Huu
01 Apr 2013
01 Apr 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

WATS-SMS: A T5-Based French Wikipedia Abstractive Text Summarizer for SMS

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future internet