A Novel Approach on Focused Crawling With Anchor Text

S Subatra Devi

doi:10.51983/ajcst-2018.7.1.1849

Abstract

A novel approach with focused crawling for various anchor texts is discussed in this paper. Most of the search engines search the web with the anchor text to retrieve the relevant pages and answer the queries given by the users. The crawler usually searches the web pages and filters the unnecessary pages which can be done through focused crawling. A focused crawler generates its boundary to crawl the relevant pages based on the link and ignores the irrelevant pages on the web. In this paper, an effective focused crawling method is implemented to improve the quality of the search. Here, three learning phases are considered namely, content-based, link-based and sibling-based learning are undergone to improve the navigation of the search. In this approach, the crawler crawls through the relevant pages efficiently and more relevant pages are retrieved in an effective way. It is proved experimentally that more number of relevant pages are retrieved for different anchor texts with three learning phases using focused crawling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Approach on Focused Crawling With Anchor Text

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Computer Science and Technology

Lead the way for us

Similar Papers

A Novel Approach on Focused Crawling with Anchor Text
S Subatra Devi
American Journal of Computer Science and Information Technology | VOL. 05
S Subatra DeviS Subatra Devi
01 Jan 2017
American Journal of Computer Science and Information Technology | VOL. 05

An Advanced Approach on Focused Crawling with Anchor Text
S Subatra Devi
-
S Subatra DeviS Subatra Devi
29 Jun 2021
29 Jun 2021

Automatic Recovery of Broken Links Using Information Retrieval Techniques
Shoaib Hayat ... Muhammad Riaz
-
Shoaib Hayat, et. al.Shoaib Hayat ... Muhammad Riaz
07 Sep 2018
07 Sep 2018

Web Searching With Logarithmic and Probability Measure
P Sheik Abdul Khader ... S Subatradevi
International Journal of Computer Applications | VOL. 64
P Sheik Abdul Khader, et. al.P Sheik Abdul Khader ... S Subatradevi
15 Feb 2013
International Journal of Computer Applications | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Approach on Focused Crawling With Anchor Text

Abstract

Talk to us

Similar Papers

More From: Asian Journal of Computer Science and Technology