EFFICIENT METHODOLOGIES TO HANDLE HANGING PAGES USING VIRTUAL NODE

Ashutosh Kumar Singh,P Ravi Kumar,Alex Goh Kwang Leng

doi:10.1080/01969722.2011.634679

Abstract

In this article we first explain the knowledge extraction (KE) process from the World Wide Web (WWW) using search engines. Then we explore the PageRank algorithm of Google search engine (a well-known link-based search engine) with its hidden Markov analysis. We also explore one of the problems of link-based ranking algorithms called hanging pages or dangling pages (pages without any forward links). The presence of these pages affects the ranking of Web pages. Some of the hanging pages may contain important information that cannot be neglected by the search engine during ranking. We propose methodologies to handle the hanging pages and compare the methodologies. We also introduce the TrustRank algorithm (an algorithm to handle the spamming problems in link-based search engines) and include it in our proposed methods so that our methods can combat Web spam. We implemented the PageRank algorithm and TrustRank algorithm and modified those algorithms to implement our proposed methodologies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

EFFICIENT METHODOLOGIES TO HANDLE HANGING PAGES USING VIRTUAL NODE

Abstract

Talk to us

Similar Papers

More From: Cybernetics and Systems

Lead the way for us

Journal: Cybernetics and Systems	Publication Date: Nov 1, 2011
Citations: 11

Similar Papers

Knowledge Extraction Through Page Rank Using Web-Mining Techniques for E-Business
Mahesh Kumar Singh ... Zaved Akhtar
-
Mahesh Kumar Singh, et. al.Mahesh Kumar Singh ... Zaved Akhtar
01 Jan 2017
01 Jan 2017

Extended User Preference Based Weighted Page Ranking Algorithm
Huda Alghamdi ... Fahd Alhaidari
-
Huda Alghamdi, et. al.Huda Alghamdi ... Fahd Alhaidari
27 Mar 2021
27 Mar 2021

The performance of page rank algorithm under degree preserving perturbations
Upul Senanayake ... Mahendra Piraveenan
-
Upul Senanayake, et. al.Upul Senanayake ... Mahendra Piraveenan
01 Dec 2014
01 Dec 2014

Design of a Metacrawler for web document retrieval
K R Remesh Babu ... A P Arya
-
K R Remesh Babu, et. al.K R Remesh Babu ... A P Arya
01 Nov 2012
01 Nov 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

EFFICIENT METHODOLOGIES TO HANDLE HANGING PAGES USING VIRTUAL NODE

Abstract

Talk to us

Similar Papers

More From: Cybernetics and Systems