Large Number Of Categories Research Articles

AbstractText classification is the process of labelling a given set of text documents with predefined classes or categories. Existing Arabic text classifiers are either applying classic Machine Learning algorithms such as k‐NN and SVM or using modern deep learning techniques. The former are assessed using small text collections and their accuracy is still subject to improvement while the latter are efficient in classifying big data collections and show limited effectiveness in classifying small corpora with a large number of categories. This paper proposes a new approach to Arabic text classification to treat small and large data collections while improving the classification rates of existing classifiers. We first demonstrate the ability of analogical proportions (AP) (statements of the form ‘x is to as is to ’), which have recently been shown to be effective in classifying ‘structured’ data, to classify ‘unstructured’ text documents requiring preprocessing. We design an analogical model to express the relationship between text documents and their real categories. Next, based on this principle, we develop two new analogical Arabic text classifiers. These rely on the idea that the category of a new document can be predicted from the categories of three others, in the training set, in case the four documents build together a ‘valid’ analogical proportion on all or on a large number of components extracted from each of them. The two proposed classifiers (denoted AATC1 and AATC2) differ mainly in terms of the keywords extracted for classification. To evaluate the proposed classifiers, we perform an extensive experimental study using five benchmark Arabic text collections with small or large sizes, namely ANT (Arabic News Texts) v2.1 and v1.1, BBC‐Arabic, CNN‐Arabic and AlKhaleej‐2004. We also compare analogical classifiers with both classical ML‐based and Deep Learning‐based classifiers. Results show that AATC2 has the best average accuracy (78.78%) over all other classifiers and the best average precision (0.77) ranked first followed by AATC1 (0.73), NB (0.73) and SVM (0.72) for the ANT corpus v2.1. Besides, AATC1 shows the best average precisions (0.88) and (0.92), respectively for the BBC‐Arabic corpus and AlKhaleej‐2004, and the best average accuracy (85.64%) for CNN‐Arabic over all other classifiers. Results demonstrate the utility of analogical proportions for text classification. In particular, the proposed analogical classifiers are shown to significantly outperform a number of existing Arabic classifiers, and in many cases, compare favourably to the robust SVM classifier.

At the present stage of the study, powerlifting is one of the priority sports in Ukraine among various segments of the population. This is a fairly new and modern sport with a huge history, which has played an important role in the formation and implementation of sports activities. The popularity of powerlifting is due to its simplicity, accessibility for a large number of categories of the population, fast results with the right technique and strong-willed qualities of character, it is also important to have a positive impact on the athlete's health. and the psychological state of man. Powerlifting classes increase muscle strength, strengthen ligaments and joints, help to develop endurance, flexibility and other physical qualities, cultivate willpower, self-confidence, increase the efficiency of the whole organism. Strength sports have always been popular among athletes and people who strive to lead a healthy lifestyle. Recently, this area of sport is developing particularly rapidly, such relatively new types as bodybuilding, powerlifting, bench press, arm wrestling, armlifting, workout have become widespread. The main problem faced by athletes and coaches is the almost complete lack of competent training systems for these power sports, as well as in the process of competitive training: the requirements and features of competitive activities. The reason is clear and quite objective - the relative novelty and lack of theoretical basis. Now the only possible way to create a training technique is mechanical borrowing from other sports, mainly from weightlifting. Powerlifting (from English. Powerlifting, power - strength, lifting - lifting) - power triathlon, includes three competitive exercises with a barbell: squats, bench press and traction. Powerlifting sports training is a process aimed at comprehensive and systematic improvement of all physical and moral qualities that are leading in this sport. Sports training in powerlifting includes training in the technique of competitive exercises (squats, bench press, traction position) and the development of physical qualities with special auxiliary exercises with a barbell and other types of weights. Despite the fact that powerlifting does not require special technical skills. The three movements of the competition look simple and, it would seem, with some physical strength you can show significant results. However, as in any other sport, powerlifting has many nuances and factors, despite which it is impossible to talk about any results. First of all, these are the general basics of technique, as well as individually selected techniques and training methods that take into account age, anatomical, biomechanical, psychological characteristics of the athlete, his physical fitness. Since all three movements in powerlifting are associated with a heavy load on the joint and musculoskeletal system, any serious powerlifting is impossible without constant medical supervision (at least once a month). Not the last role is played by the training and competitive equipment meeting the requirements of the international standards and the equipment of the athlete.

Large Number Of Categories Research Articles

Articles published on Large Number Of Categories

Preserving text space integrity for robust compositional zero-shot learning via mixture of pretrained experts

Arabic text classification based on analogical proportions

A novel plant type, leaf disease and severity identification framework using CNN and transformer with multi-label method.

Investigating the Functioning of Rating Scales With Rasch Models.

Finding Discriminative Subsequences Via a Coverage Measure and Mutual Information Selection Strategy for Multi-Class Time Series Classification

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Bayesian estimation of the Kullback-Leibler divergence for categorical systems using mixtures of Dirichlet priors.

On clustering levels of a hierarchical categorical risk factor

An improved multi-stage framework for large-scale hierarchical text classification problems using a modified feature hashing and bi-filtering strategy

Image Classification on Fashion Dataset Using Inception V3

Метрика схожості категоріальних розподілів, що враховує спорідненість різних категорій

КОНЦЕПТУАЛЬНАЯ ОБЛАСТЬ PERSONALITY TRAITS: СТРУКТУРА И СРЕДСТВА ЯЗЫКОВОЙ РЕПРЕЗЕНТАЦИИ

Learning From High-Cardinality Categorical Features in Deep Neural Networks

Sequence Embeddings Help Detect Insurance Fraud

A classification method to classify bone marrow cells with class imbalance problem

The number of response categories in ordered response models.

Historical analysis of the development of powerlifting. Powerlifting. Stages of the learning process. Competition texchniques

Identification of Multi-Class Drugs Based on Near Infrared Spectroscopy and Bidirectional Generative Adversarial Networks.

An explainable attention network for fraud detection in claims management

Bringing Context Inside Process Research with Digital Trace Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large Number Of Categories Research Articles

Articles published on Large Number Of Categories

Preserving text space integrity for robust compositional zero-shot learning via mixture of pretrained experts

Arabic text classification based on analogical proportions

A novel plant type, leaf disease and severity identification framework using CNN and transformer with multi-label method.

Investigating the Functioning of Rating Scales With Rasch Models.

Finding Discriminative Subsequences Via a Coverage Measure and Mutual Information Selection Strategy for Multi-Class Time Series Classification

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Bayesian estimation of the Kullback-Leibler divergence for categorical systems using mixtures of Dirichlet priors.

On clustering levels of a hierarchical categorical risk factor

An improved multi-stage framework for large-scale hierarchical text classification problems using a modified feature hashing and bi-filtering strategy

Image Classification on Fashion Dataset Using Inception V3

Метрика схожості категоріальних розподілів, що враховує спорідненість різних категорій

КОНЦЕПТУАЛЬНАЯ ОБЛАСТЬ PERSONALITY TRAITS: СТРУКТУРА И СРЕДСТВА ЯЗЫКОВОЙ РЕПРЕЗЕНТАЦИИ

Learning From High-Cardinality Categorical Features in Deep Neural Networks

Sequence Embeddings Help Detect Insurance Fraud

A classification method to classify bone marrow cells with class imbalance problem

The number of response categories in ordered response models.

Historical analysis of the development of powerlifting. Powerlifting. Stages of the learning process. Competition texchniques

Identification of Multi-Class Drugs Based on Near Infrared Spectroscopy and Bidirectional Generative Adversarial Networks.

An explainable attention network for fraud detection in claims management

Bringing Context Inside Process Research with Digital Trace Data