Deep web performance enhance on search engine

Deepak Kumar,Rajesh Mishra

doi:10.1109/icscti.2015.7489619

Abstract

Due to digital preservation and new generation technology Deep Web increasing faster than Surface Web, it's necessary to public accessible content also retrieving on general search engine. Huge amounts of data like documents, unstructured, distributed, multi-media available on HTML forms or hidden or invisible or difficult to access known as Deep Web has becoming one of the most valuable resources. Surface Web and Deep Web are two types of Web. The traditional or general search engines like — Google, Yahoo, MSN, Bing etc. better crawling Surface Web only whose pages are directly indexed by general search engines. In this paper proposed on Deep Web public access content that hidden data indexing enhance by general search engine crawler. Sitemap, search engine crawler's Robot.txt and meta data (data about data) technique implemented on a particular developed website. Where all documents data store in databases that access by HTML forms. For the result comparing purpose developed similar Deep Web website which has not implements aforesaid technique. Google webmaster tool used for result analysis. Quicker result found on which have technique implemented that is indexed by Google crawler.

Full Text