PIRAP: A Study on Optimized Multi-Language Classification and Text Categorization Using Supervised Hybrid Machine Learning Approaches

Shweta S Aladakatti,Senthil Kumar Swami Durai

doi:10.1142/s0218843023500041

Abstract

Nowadays, all the records in various languages are accessible with their advanced structures. For simple recovery of these digitized records, these reports should be ordered into a class as indicated by their content. Text Categorization is an area of Text Mining which helps to overcome this challenge. Text Classification is a demonstration of allotting classes to records. This paper investigates Text Classification works done in foreign Languages, regional languages and a list of books’ content. Messages available in different languages force the difficulties of NLP approaches. This study shows that supervised ML algorithms such as Logistic regression, Naive Bayes classifier, [Formula: see text]-Nearest-Neighbor classifier, Decision Tree and SVMs performed better for Text Classification tasks. The automated document classification technique is useful in our day-to-day life to find out the type of language and different department books based on their text content. We have been using different foreign and regional languages here to classify such as Tamil, Telugu, Kannada, Bengali, English, Spanish, French, Russian and German. Here, we utilize one versus all SVMs for multi-characterization with 3-crease Cross Validation in all cases and see that SVMs outperform different classifiers. This implementation is done by using hybrid classifiers and it depicts analyses with delicate edge straight SVMs as well as bit-based SVMs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

PIRAP: A Study on Optimized Multi-Language Classification and Text Categorization Using Supervised Hybrid Machine Learning Approaches

Abstract

Talk to us

Similar Papers

More From: International Journal of Cooperative Information Systems

Lead the way for us

Journal: International Journal of Cooperative Information Systems	Publication Date: May 20, 2024
Citations: 1

Similar Papers

Text Classification in Architecture Field Based on Naive Bayes Algorithm
Xinyi Sun ... Liming Du
-
Xinyi Sun, et. al.Xinyi Sun ... Liming Du
01 Jun 2022
01 Jun 2022

A refinement approach to handling model misfit in text categorization
Haoran Wu ... Xiaoli Li
-
Haoran Wu, et. al.Haoran Wu ... Xiaoli Li
23 Jul 2002
23 Jul 2002

MPEG VBR video traffic classification using Bayesian and nearest neighbor classifiers
Qilian Liang
-
Qilian Liang Qilian Liang
07 Aug 2002
07 Aug 2002

Using Text Classification for Gene Function Annotation
Soumya Raychaudhuri
-
Soumya RaychaudhuriSoumya Raychaudhuri
26 Jan 2006
26 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

PIRAP: A Study on Optimized Multi-Language Classification and Text Categorization Using Supervised Hybrid Machine Learning Approaches

Abstract

Talk to us

Similar Papers

More From: International Journal of Cooperative Information Systems