Bridging Linguistic Gaps: Developing a Greek Text Simplification Dataset

Leonidas Agathos,Andreas Kanavos,Xristiana Kryelesi,Andreas Avgoustis,Despoina Mouratidis,Katia Lida Kermanidis,Aikaterini Makridou,Ilias Tzanis

doi:10.3390/info15080500

Abstract

Text simplification is crucial in bridging the comprehension gap in today’s information-rich environment. Despite advancements in English text simplification, languages with intricate grammatical structures, such as Greek, often remain under-explored. The complexity of Greek grammar, characterized by its flexible syntactic ordering, presents unique challenges that hinder comprehension for native speakers, learners, tourists, and international students. This paper introduces a comprehensive dataset for Greek text simplification, containing over 7500 sentences across diverse topics such as history, science, and culture, tailored to address these challenges. We outline the methodology for compiling this dataset, including a collection of texts from Greek Wikipedia, their annotation with simplified versions, and the establishment of robust evaluation metrics. Additionally, the paper details the implementation of quality control measures and the application of machine learning techniques to analyze text complexity. Our experimental results demonstrate the dataset’s initial effectiveness and potential in reducing linguistic barriers and enhancing communication, with initial machine learning models showing promising directions for future improvements in classifying text complexity. The development of this dataset marks a significant step toward improving accessibility and comprehension for a broad audience of Greek speakers and learners, fostering a more inclusive society.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bridging Linguistic Gaps: Developing a Greek Text Simplification Dataset

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Journal: Information	Publication Date: Aug 20, 2024
License type: CC BY 4.0

Similar Papers

Machine Learning in the Management of Lateral Skull Base Tumors: A Systematic Review
Kotaro Tsutsumi ... Sina Soltanzadeh-Zarandi
Journal of Otorhinolaryngology, Hearing and Balance Medicine | VOL. 3
Kotaro Tsutsumi, et. al.Kotaro Tsutsumi ... Sina Soltanzadeh-Zarandi
28 Sep 2022
Journal of Otorhinolaryngology, Hearing and Balance Medicine | VOL. 3

Machine learning and soil sciences: a review aided by machine learning tools
José Padarian ... Alex B Mcbratney
SOIL | VOL. 6
José Padarian, et. al.José Padarian ... Alex B Mcbratney
06 Feb 2020
SOIL | VOL. 6

Discovery and prediction capabilities in metal-based nanomaterials: An overview of the application of machine learning techniques and some recent advances
Emmanuel Anuoluwa Bamidele ... Eylem Asmatulu
Advanced Engineering Informatics | VOL. 52
Emmanuel Anuoluwa Bamidele, et. al.Emmanuel Anuoluwa Bamidele ... Eylem Asmatulu
21 Mar 2022
Advanced Engineering Informatics | VOL. 52

A systematic mapping to investigate the application of machine learning techniques in requirement engineering activities
Shoaib Hassan ... Javed Ali Khan
CAAI Transactions on Intelligence Technology | VOL. -
Shoaib Hassan, et. al.Shoaib Hassan ... Javed Ali Khan
10 Jun 2024
CAAI Transactions on Intelligence Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bridging Linguistic Gaps: Developing a Greek Text Simplification Dataset

Abstract

Talk to us

Similar Papers

More From: Information