Nitelik Çıkarımı Yöntemlerinin Türkçe Metinlerin Sınıflandırılmasına Etkisi

Özge Akdoğan,Selma Ayşe Özel

doi:10.21605/cukurovaummfd.637643

Abstract

Feature extraction is the most important preprocessing step of text classification task. Effects of preprocessing techniques on text mining for English have been extensively studied. However, studies for Turkish are limited and generally belong to a specific problem domain. In this study, we investigate the effects of feature extraction techniques on four different Turkish text classification problems including news classification, spam e-mail detection, sentiment analysis, and author detection to show the differences and similarities among the problems. We also propose a new feature selection method to reduce feature space. The experimental analysis has showed that, stopword removal improves classification performance. However, stemming does not make any positive effect on classification accuracy. The most successful term weighting methods are tf and tf*idf. The proposed feature selection method improves classification performance and has higher accuracy than the well-known methods.&nbsp;

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Nitelik Çıkarımı Yöntemlerinin Türkçe Metinlerin Sınıflandırılmasına Etkisi

Abstract

Talk to us

Similar Papers

More From: Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi

Lead the way for us

Journal: Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi	Publication Date: Sep 30, 2019
Citations: 2

Similar Papers

Analysis and Evaluation of Feature Selection and Feature Extraction Methods
Rubén E Nogales ... Marco E Benalcázar
International Journal of Computational Intelligence Systems | VOL. 16
Rubén E Nogales, et. al.Rubén E Nogales ... Marco E Benalcázar
20 Sep 2023
International Journal of Computational Intelligence Systems | VOL. 16

An Empirical Study of Several Information Theoretic Based Feature Extraction Methods for Classifying High Dimensional Low Sample Size Data
Sheena Leeza Verghese ... Tomas H Maul
IEEE Access | VOL. 9
Sheena Leeza Verghese, et. al.Sheena Leeza Verghese ... Tomas H Maul
01 Jan 2020
IEEE Access | VOL. 9

A New Feature Selection Method for Sentiment Analysis in Short Text
H M Keerthi Kumar ... B S Harish
Journal of Intelligent Systems | VOL. 29
H M Keerthi Kumar, et. al.H M Keerthi Kumar ... B S Harish
04 Dec 2018
Journal of Intelligent Systems | VOL. 29

Research on Feature Selection and kNN Classification Method in Chinese Text Classification
Chao Xiao ... Ping Wu
-
Chao Xiao, et. al.Chao Xiao ... Ping Wu
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Nitelik Çıkarımı Yöntemlerinin Türkçe Metinlerin Sınıflandırılmasına Etkisi

Abstract

Talk to us

Similar Papers

More From: Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi