The Saudi Novel Corpus: Design and Compilation

Tareq Alfraidi,Abdulmohsen Al-Thubaity,Reyadh Alluhaibi,Mohammad A R Abdeen,Ahmed Yatimi

doi:10.3390/app12136648

Tareq Alfraidi, Abdulmohsen Al-Thubaity + Show 3 more

Open Access

https://doi.org/10.3390/app12136648

Copy DOI

Abstract

Arabic has recently received significant attention from corpus compilers. This situation has led to the creation of many Arabic corpora that cover various genres, most notably the newswire genre. Yet, Arabic novels, and specifically those authored by Saudi writers, lack the sufficient digital datasets that would enhance corpus linguistic and stylistic studies of these works. Thus, Arabic lags behind English and other European languages in this context. In this paper, we present the Saudi Novels Corpus, built to be a valuable resource for linguistic and stylistic research communities. We specifically present the procedures we followed and the decisions we made in creating the corpus. We describe and clarify the design criteria, data collection methods, process of annotation, and encoding. In addition, we present preliminary results that emerged from the analysis of the corpus content. We consider the work described in this paper as initial steps to bridge the existing gap between corpus linguistics and Arabic literary texts. Further work is planned to improve the quality of the corpus by adding advanced features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jun 30, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Saudi Novel Corpus: Design and Compilation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Classical and modern Arabic corpora
Eric Steven Atwell
-
Eric Steven AtwellEric Steven Atwell
23 Oct 2018
23 Oct 2018

Text Analysis of Corpus Linguistics in a Post-concordancer Era
Simon Ho Wang
-
Simon Ho WangSimon Ho Wang
01 Jan 2017
01 Jan 2017

Patterns and Meanings: Using Corpora for English Language Research and Teaching (review)
Dirk Noel
Language | VOL. 78
Dirk NoelDirk Noel
01 Jun 2002
Language | VOL. 78

A-11 Study of the Prophet’s Ḥadīth in the origins of the art of Arabic literature
Abdul Majid Nadeem
Al-Aijaz Research Journal of Islamic Studies & Humanities | VOL. 4
Abdul Majid NadeemAbdul Majid Nadeem
20 Dec 2020
Al-Aijaz Research Journal of Islamic Studies & Humanities | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Saudi Novel Corpus: Design and Compilation

Abstract

Talk to us

Similar Papers

More From: Applied Sciences