A HYBRID FEATURE SELECTION APPROACH FOR ROMAN URDU TEXT CLASSIFICATION

Waqas Azeem Waqas Azeem,Chakir Aziza Chakir Aziza

doi:10.36755/jac.v2i1.61

Abstract

Text classification is the task of assigning labels to unlabeled text data. Text classification has several applications like sentiment analysis, document classification, and fake news detection such as Machine learning (ML) methods have been used commonly in text classification in the last several years. The fundamental problem in ML is that these approaches heavily depend on feature selection methods. The models and feature selection methods used in this research. Several past types of research conclude that there is no uniform feature selection method that works well for all types of classifier tasks as well as Urdu is a resource-poor language. In this study, a proposed hybrid feature selection approach for Roman Urdu text not only reduces the dimension of the feature map but also increases the accuracy of ML models. Using 11000 and 20000 records have been used for Support Vector Classifier, Naive Base and Decision Tree which have given 80.81%, 72.94% and 76.78% respectively, among other tested methods. The best accuracy values achieved by each classifier and the hybrid features ChiSAE, CorrelationAE, and GainRAE. In future, text classification for better understanding of human being self-analysis as well as deep learning methods will be utilized for better authenticity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A HYBRID FEATURE SELECTION APPROACH FOR ROMAN URDU TEXT CLASSIFICATION

Abstract

Talk to us

Similar Papers

More From: Journal of Advancement in Computing

Lead the way for us

Journal: Journal of Advancement in Computing	Publication Date: Jan 31, 2024
License type: CC BY 4.0

Similar Papers

Accuracy of machine learning models using ultrasound images in prostate cancer diagnosis: a systematic review
Retta Catherina Sihotang ... Agus Rizal Ardy Hariandy Hamid
Medical Journal of Indonesia | VOL. 32
Retta Catherina Sihotang, et. al.Retta Catherina Sihotang ... Agus Rizal Ardy Hariandy Hamid
20 Oct 2023
Medical Journal of Indonesia | VOL. 32

Species classification from hyperspectral leaf information using machine learning approaches
Guangman Song ... Quan Wang
Ecological Informatics | VOL. 76
Guangman Song, et. al.Guangman Song ... Quan Wang
24 May 2023
Ecological Informatics | VOL. 76

Building Large Scale Cloud System for Product Sentiment Analysis using Hybrid Group Search Optimization Based Feature Selection
-
International Journal of Innovative Technology and Exploring Engineering | VOL. 8
--
23 Aug 2019
International Journal of Innovative Technology and Exploring Engineering | VOL. 8

A Meta-analysis of Predicting Disorders of Consciousness After Traumatic Brain Injury by Machine Learning Models.
Xi Zhu ... Li Gao
Alpha psychiatry | VOL. 25
Xi Zhu, et. al.Xi Zhu ... Li Gao
01 Jun 2024
Alpha psychiatry | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A HYBRID FEATURE SELECTION APPROACH FOR ROMAN URDU TEXT CLASSIFICATION

Abstract

Talk to us

Similar Papers

More From: Journal of Advancement in Computing