Türkçe Metinde Topluluk Öğrenme ve Genetik Algoritma Kombinasyonu Tabanlı Yazar Tahmini

Merve Güllü,Hüseyin Polat

doi:10.2339/politeknik.992493

Abstract

The easiness of reaching information through the internet and social media and the expansiveness of opportunities for searching, copying, and spreading data have caused some problems in identifying an author for a specific text. A text carries the characteristic features of the person who wrote it, and these features can be used to identify its author. For this study, we are offering a method that is based on an approach using ensemble learning algorithm (ELA) and genetic algorithm (GA) for author identification in Tur-kish texts. The raw data set, which includes 40 authors and 3269 texts, was created from Turkish news websites and analyzed in pre-processing step. After, syntactic and structural analyses were done on the data and, in total, 6 different data sets were created. Each of the data sets was subjected to the feature selection process by using GA and ELA approach together. Each of the obtained data sets from the previous step was classified by using the ELA's bagging method which contains 5 different classifiers, namely, Naive Bayes, K-Nearest Neighbor, Artificial Neural Networks, Support Vector Machine, and Decision Tree. After applying the aforementioned processes to the raw data, the author identification approach reached 89% accuracy. The combination of ELA and GA has a strong potential to identify the author of a text.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Politeknik Dergisi	Publication Date: Oct 1, 2022
Citations: 3	License type: cc-by-sa

R Discovery Prime

R Discovery Prime

Türkçe Metinde Topluluk Öğrenme ve Genetik Algoritma Kombinasyonu Tabanlı Yazar Tahmini

Abstract

Talk to us

Similar Papers

More From: Politeknik Dergisi

Lead the way for us

Similar Papers

Machine learning in pain research.
Jörn Lötsch ... Alfred Ultsch
Pain | VOL. 159
Jörn Lötsch, et. al.Jörn Lötsch ... Alfred Ultsch
24 Nov 2017
Pain | VOL. 159

評估基於微陣列晶片資料之動態參數基因演算法（GADP）的最適分類器
...
-
, et. al. ...
01 Oct 2012
01 Oct 2012

불균형 데이터 집합의 분류를 위한 하이브리드 SVM 모델
Jae Sik Lee ... Jong Gu Kwon
Journal of Intelligence and Information Systems | VOL. 19
Jae Sik Lee, et. al.Jae Sik Lee ... Jong Gu Kwon
30 Jun 2013
Journal of Intelligence and Information Systems | VOL. 19

Comparison of Ensemble Machine Learning Methods for Soil Erosion Pin Measurements
Kieu Anh Nguyen ... Bor-Shiun Lin
ISPRS International Journal of Geo-Information | VOL. 10
Kieu Anh Nguyen, et. al.Kieu Anh Nguyen ... Bor-Shiun Lin
19 Jan 2021
ISPRS International Journal of Geo-Information | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Türkçe Metinde Topluluk Öğrenme ve Genetik Algoritma Kombinasyonu Tabanlı Yazar Tahmini

Abstract

Talk to us

Similar Papers

More From: Politeknik Dergisi