Abstract
Search interfaces are only the mode to retrieve the information on the web, which is becoming enormous day by day. One has to key in the set of keywords in search form to retrieve pages from the website. These hidden web pages do not have static links[1], hence, search engines are unable to get and index such pages and thus no result is returned. As per recent studies many of the deep webs are of tremendous high quality, which can be extremely useful to the users. These pages are often referred to as the Hidden Web or the Deep Web. A powerful web crawler can discover and download hidden web pages. The main challenge that a hidden web crawler has to face is to make meaningful queries to issue to the site as the only entry point to the hidden website is query interface. Here a theoretical framework to investigate the query generation problem is provided for the hidden web and effective policies for generating queries automatically studies.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have