Classification of Malicious Web Pages through a J48 Decision Tree, a Naïve Bayes, a RBF Network and a Random Forest Classifier for WebSpam Detection

Muhammad Iqbal,Syed Hasnain Alam Kazmi,Usman Waheed,Malik Muneeb Abid

doi:10.14257/ijunesst.2017.10.4.05

Abstract

Web spam is a negative practice carried out by spammers to produce fake searchengines results for improving rank position of their Web pages. It is available on arena of World Wide Web (WWW) in different forms and lacks a consistent definition. The search engines are struggling to eliminate spam pages through machine learning (ML) detectors. Mostly, search engines measure the quality of websites by using different factors (signals) such as, number of visitors, body text, anchor text, back link and forward link etc. information and, and spammers try to induce these signals into their desired pages to subvert ranking function of search engines. This study compares the detection efficiencyof different ML classifiers trained and tested on WebSpam UK2007 data set. The results of our study show that random forest has achieve higher score than other well-known classifiers.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Classification of Malicious Web Pages through a J48 Decision Tree, a Naïve Bayes, a RBF Network and a Random Forest Classifier for WebSpam Detection

Abstract

Talk to us

Similar Papers

More From: International Journal of u- and e- Service, Science and Technology

Lead the way for us

Journal: International Journal of u- and e- Service, Science and Technology	Publication Date: Apr 30, 2017
Citations: 1

Similar Papers

An Improved Framework for Content- and Link-Based Web-Spam Detection: A Combined Approach
Asim Shahzad ... Abdullah Khan
Complexity | VOL. 2021
Asim Shahzad, et. al.Asim Shahzad ... Abdullah Khan
15 Nov 2021
Complexity | VOL. 2021

Comparative Study of Web Spam Detection using Data Mining
Chirag Nathwani ... Viralkumar Prajapati
International Journal of Computer Applications | VOL. 68
Chirag Nathwani, et. al.Chirag Nathwani ... Viralkumar Prajapati
18 Apr 2013
International Journal of Computer Applications | VOL. 68

Using Machine Learning for Web Page Classification in Search Engine Optimization
Goran Matošević ... Jasminka Dobša
Future Internet | VOL. 13
Goran Matošević, et. al.Goran Matošević ... Jasminka Dobša
02 Jan 2020
Future Internet | VOL. 13

A new enhanced technique for link farm detection
D Saraswathi ... R Kavitha
-
D Saraswathi, et. al.D Saraswathi ... R Kavitha
01 Mar 2012
01 Mar 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Classification of Malicious Web Pages through a J48 Decision Tree, a Naïve Bayes, a RBF Network and a Random Forest Classifier for WebSpam Detection

Abstract

Talk to us

Similar Papers

More From: International Journal of u- and e- Service, Science and Technology