MULTILINGUAL TEXT CLASSIFIER USING PRE-TRAINED UNIVERSAL SENTENCE ENCODER MODEL

O V Orlovskiy,S E Ostapov,K P Hazdyuk,L M Shumylyak,Khalili Sohrab

doi:10.15588/1607-3274-2022-3-10

Abstract

Context. Online platforms and environments continue to generate ever-increasing content. The task of automating the moderation of user-generated content continues to be relevant. Of particular note are cases in which, for one reason or another, there is a very small amount of data to teach the classifier. To achieve results under such conditions, it is important to involve the classifier pre-trained models, which were trained on a large amount of data from a wide range. This paper deals with the use of the pre-trained multilingual Universal Sentence Encoder (USE) model as a component of the developed classifier and the affect of hyperparameters on the classification accuracy when learning on a small data amount (~ 0.05% of the dataset). Objective. The goal of this paper is the investigation of the pre-trained multilingual model and optimal hyperparameters influence for learning the text data classifier on the classification result. Method. To solve this problem, a relatively new approach to few-shot learning has recently been used – learning with a relatively small number of examples. Since text data is still the dominant way of transmitting information, the study of the possibilities of constructing a classifier of text data when learning from a small number of examples (~ 0.002–0.05% of the data set) is an actual problem. Results. It is shown that even with a small number of examples for learning (36 per class) due to the use of USE and optimal configuration in learning can achieve high accuracy of classification on English and Russian data, which is extremely important when it is impossible to collect your own large data set. The influence of the approach using USE and a set of different configurations of hyperparameters on the result of the text data classifier on the example of English and Russian data sets is evaluated. Conclusions. During the experiments, a significant degree of relevance of the correct selection of hyperparameters is shown. In particular, this paper considered the batch size, optimizer, number of learning epochs and the percentage of data from the set taken to train the classifier. In the process of experimentation, the optimal configuration of hyperparameters was selected, according to which 86.46% accuracy of classification on the Russian-language data set and 91.13% on the English-language data, respectively, can be achieved in ten seconds of training (training time can be significantly affected by technical means used).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MULTILINGUAL TEXT CLASSIFIER USING PRE-TRAINED UNIVERSAL SENTENCE ENCODER MODEL

Abstract

Talk to us

Similar Papers

More From: Radio Electronics, Computer Science, Control

Lead the way for us

Journal: Radio Electronics, Computer Science, Control	Publication Date: Oct 16, 2022
License type: cc-by-sa

Similar Papers

DeepEmotex: Classifying Emotion in Text Messages using Deep Transfer Learning
Maryam Hasan ... Elke Rundensteiner
-
Maryam Hasan, et. al.Maryam Hasan ... Elke Rundensteiner
15 Dec 2021
15 Dec 2021

Few-shot code translation via task-adapted prompt learning
Xuan Li ... Beijun Shen
Journal of Systems and Software | VOL. 212
Xuan Li, et. al.Xuan Li ... Beijun Shen
17 Feb 2024
Journal of Systems and Software | VOL. 212

Automatic Exam Correction Framework (AECF) for the MCQs, Essays, and Equations Matching
Hossam Magdy Balaha ... Mahmoud M Saafan
IEEE Access | VOL. 9
Hossam Magdy Balaha, et. al.Hossam Magdy Balaha ... Mahmoud M Saafan
01 Jan 2020
IEEE Access | VOL. 9

An empirical evaluation of text representation schemes to filter the social media stream
Sandip Modha ... Thomas Mandl
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 34
Sandip Modha, et. al.Sandip Modha ... Thomas Mandl
24 Apr 2021
Journal of Experimental & Theoretical Artificial Intelligence | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MULTILINGUAL TEXT CLASSIFIER USING PRE-TRAINED UNIVERSAL SENTENCE ENCODER MODEL

Abstract

Talk to us

Similar Papers

More From: Radio Electronics, Computer Science, Control