Arabic News Articles Research Articles

Hyper vectors are holographic and randomly processed with independent and identically distributed tools. A hyper vector includes whole data merged as well as spread completely on its pieces as an encompassing portrayal. So, no spot is more dependable to store any snippet of data compared to others. Hyper vectors are joined with tasks likened to expansion, and changed the structure of numerical processing on vector regions. Hyper vectors are intended to analyze the closeness utilizing a separation metric over the vector region. These activities are nothing but hyper vectors in which it can be joined into intriguing processing conduct with novel highlights which make them vigorous and proficient. This paper focuses on a utilization of hyper dimensional processing for distinguishing the language of text tests for encoding sequential letters into hyper vectors. Perceiving the language of a given book is the initial phase in all sorts of language handling. Examples: text examination, arrangement, and interpretation. High dimension vector models are mainstream in Natural Language Processing and are utilized to catch word significance from word insights. In this research work, the first task is high dimensional computing classification, based on Arabic datasets which contain three datasets such as Arabiya, Khaleej and Akhbarona. High dimensional computing is applied to obtain the results from the previous dataset when it is applied to N-gram encoding. When utilizing SANAD single-label Arabic news articles datasets with 12 N-gram encoding, the accuracy of high computing is 0.9665%. The high dimensional computing with 6 N-gram encoding while utilizing RTA dataset, provides the accuracy of 0.6648%. ANT dataset with 12 N-gram encoding in high dimensional computing gives the accuracy 0.9248%. The second task is applying high dimensional computing on Arabic language recognition for Levantine dialects three dataset is utilized. The first dataset is SDC Shami Dialects Corpus which contains Jordanian, Lebanese, Palestinian and Syrian. The same provides an accuracy of 0.8234% while it is applied to high dimensional computing with 7 N-gram encoding. PADIC (Parallel Arabic dialect corpus) is the second dataset which contains Syria and Palestine Arabic dialects that provide an accuracy of 0.7458% when applied high dimensional computing with 5 N-gram encoding. The high dimensional computing when applied to third dataset MADAR (Multi-Arabic dialect applications and resources) with 6 N-gram encoding provides the accuracy rate of 0.7800%.

Read full abstract

Multi-label text categorization refers to the problem of assigning each document to a subset of categories by means of multi-label learning algorithms. Unlike English and most other languages, the unavailability of Arabic benchmark datasets prevents evaluating multi-label learning algorithms for Arabic text categorization. As a result, only a few recent studies have dealt with multi-label Arabic text categorization on non-benchmark and inaccessible datasets. Therefore, this work aims to promote multi-label Arabic text categorization through (a) introducing “RTAnews”, a new benchmark dataset of multi-label Arabic news articles for text categorization and other supervised learning tasks. The benchmark is publicly available in several formats compatible with the existing multi-label learning tools, such as MEKA and Mulan. (b) Conducting an extensive comparison of most of the well-known multi-label learning algorithms for Arabic text categorization in order to have baseline results and show the effectiveness of these algorithms for Arabic text categorization on RTAnews. The evaluation involves four multi-label transformation-based algorithms: Binary Relevance, Classifier Chains, Calibrated Ranking by Pairwise Comparison and Label Powerset, with three base learners (Support Vector Machine, k-Nearest-Neighbors and Random Forest); and four adaptation-based algorithms (Multi-label kNN, Instance-Based Learning by Logistic Regression Multi-label, Binary Relevance kNN and RFBoost). The reported baseline results show that both RFBoost and Label Powerset with Support Vector Machine as base learner outperformed other compared algorithms. Results also demonstrated that adaptation-based algorithms are faster than transformation-based algorithms.

Read full abstract

Arabic News Articles Research Articles

Articles published on Arabic News Articles

Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news

Amina: an Arabic multi-purpose integral news articles dataset

Rumor gatekeepers: Unsupervised ranking of Arabic twitter authorities for information verification

Hybrid Neural Network Models for Detecting Fake News Articles

Speech Acts Used in Covid-19 English and Arabic News Reports

Arabic News Classification Based on the Country of Origin Using Machine Learning and Deep Learning Techniques

From Eco-Jihad to Politicization: A Corpus-based Eco-linguistic Discourse Analysis of the Arab Media Coverage of the Safer Floating Oil Tanker

Similarity Detection of Time-Sensitive Online News Articles Based on RSS Feeds and Contextual Data

Investigating the relevance of Arabic text classification datasets based on supervised learning

High dimensional autonomous computing on Arabic language classification

Arabic Fake News Detection Based on Textual Analysis

Arabic Fake News Detection Using Deep Learning

Media Coverage of Syrian Female Refugees in Jordan and Lebanon

Multi-label Arabic text categorization: A benchmark and baseline comparison of multi-label learning algorithms

The Representation of Laji’een (Refugees) and Muhajireen (Migrants) in the Headlines of Jordan News Agency (PETRA)

Using corpus linguistic techniques in (critical) discourse studies reduces but does not remove bias: Evidence from an Arabic corpus about refugees

Automated arabic text classification with P‐Stemmer, machine learning, and a tailored news article taxonomy

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Arabic News Articles Research Articles

Articles published on Arabic News Articles

Computational linguistics and natural language processing techniques for semantic field extraction in Arabic online news

Amina: an Arabic multi-purpose integral news articles dataset

Rumor gatekeepers: Unsupervised ranking of Arabic twitter authorities for information verification

Hybrid Neural Network Models for Detecting Fake News Articles

Speech Acts Used in Covid-19 English and Arabic News Reports

Arabic News Classification Based on the Country of Origin Using Machine Learning and Deep Learning Techniques

From Eco-Jihad to Politicization: A Corpus-based Eco-linguistic Discourse Analysis of the Arab Media Coverage of the Safer Floating Oil Tanker

Similarity Detection of Time-Sensitive Online News Articles Based on RSS Feeds and Contextual Data

Investigating the relevance of Arabic text classification datasets based on supervised learning

High dimensional autonomous computing on Arabic language classification

Arabic Fake News Detection Based on Textual Analysis

Arabic Fake News Detection Using Deep Learning

Media Coverage of Syrian Female Refugees in Jordan and Lebanon

Multi-label Arabic text categorization: A benchmark and baseline comparison of multi-label learning algorithms

The Representation of Laji’een (Refugees) and Muhajireen (Migrants) in the Headlines of Jordan News Agency (PETRA)

Using corpus linguistic techniques in (critical) discourse studies reduces but does not remove bias: Evidence from an Arabic corpus about refugees

Automated arabic text classification with P‐Stemmer, machine learning, and a tailored news article taxonomy