An ant colony optimization based feature selection for web page classification.

Esra Saraç,Selma Ayşe Özel

doi:10.1155/2014/649260

Esra Saraç, Selma Ayşe Özel

Open Access

https://doi.org/10.1155/2014/649260

Copy DOI

Journal: The Scientific World Journal	Publication Date: Jan 1, 2014
Citations: 74	License type: CC BY 3.0

Affiliation: Cukurova University

Abstract

The increased popularity of the web has caused the inclusion of huge amount of information to the web, and as a result of this explosive information growth, automated web page classification systems are needed to improve search engines' performance. Web pages have a large number of features such as HTML/XML tags, URLs, hyperlinks, and text contents that should be considered during an automated classification process. The aim of this study is to reduce the number of features to be used to improve runtime and accuracy of the classification of web pages. In this study, we used an ant colony optimization (ACO) algorithm to select the best features, and then we applied the well-known C4.5, naive Bayes, and k nearest neighbor classifiers to assign class labels to web pages. We used the WebKB and Conference datasets in our experiments, and we showed that using the ACO for feature selection improves both accuracy and runtime performance of classification. We also showed that the proposed ACO based algorithm can select better features with respect to the well-known information gain and chi square feature selection methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An ant colony optimization based feature selection for web page classification.

Abstract

Talk to us

Similar Papers

More From: The Scientific World Journal

Lead the way for us

Similar Papers

Web Page Classification Based on Novel Black Widow Meta-Heuristic Optimization with Deep Learning Technique
V Gokula Krishnan ... J Deepa
-
V Gokula Krishnan, et. al.V Gokula Krishnan ... J Deepa
01 Jan 2021
01 Jan 2021

Information gain and divergence-based feature selection for machine learning-based text categorization
Changki Lee ... Gary Geunbae Lee
Information Processing & Management | VOL. 42
Changki Lee, et. al.Changki Lee ... Gary Geunbae Lee
03 Aug 2005
Information Processing & Management | VOL. 42

Approach for Dimensionality Reduction in Web Page Classification
Jayant Gadge ... Shraddha Sarode
International Journal of Computer Applications | VOL. 99
Jayant Gadge, et. al.Jayant Gadge ... Shraddha Sarode
20 Aug 2014
International Journal of Computer Applications | VOL. 99

Hybrid dimensionality reduction approach for web page classification
Shraddha Sarode ... Jayant Gadge
-
Shraddha Sarode, et. al.Shraddha Sarode ... Jayant Gadge
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An ant colony optimization based feature selection for web page classification.

Abstract

Talk to us

Similar Papers

More From: The Scientific World Journal