Arabic News Research Articles

AbstractText classification is the process of labelling a given set of text documents with predefined classes or categories. Existing Arabic text classifiers are either applying classic Machine Learning algorithms such as k‐NN and SVM or using modern deep learning techniques. The former are assessed using small text collections and their accuracy is still subject to improvement while the latter are efficient in classifying big data collections and show limited effectiveness in classifying small corpora with a large number of categories. This paper proposes a new approach to Arabic text classification to treat small and large data collections while improving the classification rates of existing classifiers. We first demonstrate the ability of analogical proportions (AP) (statements of the form ‘x is to as is to ’), which have recently been shown to be effective in classifying ‘structured’ data, to classify ‘unstructured’ text documents requiring preprocessing. We design an analogical model to express the relationship between text documents and their real categories. Next, based on this principle, we develop two new analogical Arabic text classifiers. These rely on the idea that the category of a new document can be predicted from the categories of three others, in the training set, in case the four documents build together a ‘valid’ analogical proportion on all or on a large number of components extracted from each of them. The two proposed classifiers (denoted AATC1 and AATC2) differ mainly in terms of the keywords extracted for classification. To evaluate the proposed classifiers, we perform an extensive experimental study using five benchmark Arabic text collections with small or large sizes, namely ANT (Arabic News Texts) v2.1 and v1.1, BBC‐Arabic, CNN‐Arabic and AlKhaleej‐2004. We also compare analogical classifiers with both classical ML‐based and Deep Learning‐based classifiers. Results show that AATC2 has the best average accuracy (78.78%) over all other classifiers and the best average precision (0.77) ranked first followed by AATC1 (0.73), NB (0.73) and SVM (0.72) for the ANT corpus v2.1. Besides, AATC1 shows the best average precisions (0.88) and (0.92), respectively for the BBC‐Arabic corpus and AlKhaleej‐2004, and the best average accuracy (85.64%) for CNN‐Arabic over all other classifiers. Results demonstrate the utility of analogical proportions for text classification. In particular, the proposed analogical classifiers are shown to significantly outperform a number of existing Arabic classifiers, and in many cases, compare favourably to the robust SVM classifier.

Fake news has become a serious problem due to many reasons; its rapid spread across the Internet, the difficulty of discovering and distinguishing it from the real news, and the increasing dependence of individuals on social media platforms as the main source. In addition, it has harmful consequences at different levels; individual, community, political, and financial levels. Arabic fake news detection task still needs a considerable amount of effort, due to the lack of datasets and limited research in this field. In this thesis, we investigate detecting the credibility of news in Arabic, based on deep learning methods and transformers models for capturing the hidden information and pattern rather than depending on a set of handcrafted features, that have been addressed in most of the related work in the literature. Also, we release “ArabicFakeNews” which is a manual fake news dataset that contains 2 k Arabic fake news collected from different sources. We also study how the length of news can affect the model performance by applying the deep learning model and transformer-based model on three datasets with different average lengths (title, description, and text dataset). The description dataset performs better in classifying the news, this is due to the fact that the average length of fake news is close to the average length of real news in this dataset. The evaluation result shows that transformers models outperform the deep learning models on the three datasets. The transformer results show that AraBERTv2 on the “description” dataset gets the highest result among all other transformers with an accuracy of 0.97, 0.97 f1-sore, 0.97 precision, and 0.9658 recall.

Arabic News Research Articles

Related Topics

Articles published on Arabic News

Accessibility barriers in arabic news websites for visually impaired users: a mixed-method evaluation approach

Framing the shooting of Al Jazeera journalist Shireen Abu Akleh in English and Arabic news headlines: a critical discourse study

Pre-Trained Language Model Ensemble for Arabic Fake News Detection

Amina: an Arabic multi-purpose integral news articles dataset

Approach for Detecting Arabic Fake News using Deep Learning

Rima Aliterasi dan Asonansi sebagai Sumber Ritma Linguistik Surah Fatihah yang Mensinkroni Gelombang Otak

Kesan Modul Terapi Realiti (MTR) Terhadap Ciri-ciri dan Kemurungan Mangsa Buli dari Perspektif Islam

Transediting the terms used for describing the US dollar in Arabic and English news websites: a triangulational study of transediting strategies adopted in four channels

A Deep Learning-based Classification Model for Arabic News Tweets Using Bidirectional Long Short-Term Memory Networks

Predictive Modeling for Arabic Fake News Detection: Leveraging Language Model Embeddings and Stacked Ensemble

Rumor gatekeepers: Unsupervised ranking of Arabic twitter authorities for information verification

Arabic text classification based on analogical proportions

Survey of machine learning techniques for Arabic fake news detection

A Qualitative Exploration Of The Impact Of Fostering AFL Learners' Vocabulary Skills Using BBC Arabic Live News

An Ensemble Keyword Extraction Model for News Texts with Statistical and Graphical Features

Arabic Fake News Detection in Social Media Context Using Word Embeddings and Pre-trained Transformers

Al-Quds Issues in The Media: A Comparative Study of Aqsa T.V. And Palestine T.V. Coverage During the American Embassy Relocating

Detect Arabic fake news through deep learning models and Transformers

Mediated Clash of Civilizations: Examining the Proximity-Visual Framing Nexus in Al Jazeera Arabic and Fox News’ Coverage of the 2021 Gaza War

Ensemble-Based Machine Learning Approach for Detecting Arabic Fake News on Twitter

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Arabic News Research Articles

Related Topics

Articles published on Arabic News

Accessibility barriers in arabic news websites for visually impaired users: a mixed-method evaluation approach

Framing the shooting of Al Jazeera journalist Shireen Abu Akleh in English and Arabic news headlines: a critical discourse study

Pre-Trained Language Model Ensemble for Arabic Fake News Detection

Amina: an Arabic multi-purpose integral news articles dataset

Approach for Detecting Arabic Fake News using Deep Learning

Rima Aliterasi dan Asonansi sebagai Sumber Ritma Linguistik Surah Fatihah yang Mensinkroni Gelombang Otak

Kesan Modul Terapi Realiti (MTR) Terhadap Ciri-ciri dan Kemurungan Mangsa Buli dari Perspektif Islam

Transediting the terms used for describing the US dollar in Arabic and English news websites: a triangulational study of transediting strategies adopted in four channels

A Deep Learning-based Classification Model for Arabic News Tweets Using Bidirectional Long Short-Term Memory Networks

Predictive Modeling for Arabic Fake News Detection: Leveraging Language Model Embeddings and Stacked Ensemble

Rumor gatekeepers: Unsupervised ranking of Arabic twitter authorities for information verification

Arabic text classification based on analogical proportions

Survey of machine learning techniques for Arabic fake news detection

A Qualitative Exploration Of The Impact Of Fostering AFL Learners' Vocabulary Skills Using BBC Arabic Live News

An Ensemble Keyword Extraction Model for News Texts with Statistical and Graphical Features

Arabic Fake News Detection in Social Media Context Using Word Embeddings and Pre-trained Transformers

Al-Quds Issues in The Media: A Comparative Study of Aqsa T.V. And Palestine T.V. Coverage During the American Embassy Relocating

Detect Arabic fake news through deep learning models and Transformers

Mediated Clash of Civilizations: Examining the Proximity-Visual Framing Nexus in Al Jazeera Arabic and Fox News’ Coverage of the 2021 Gaza War

Ensemble-Based Machine Learning Approach for Detecting Arabic Fake News on Twitter