A Query based Approach to Reduce the Web Crawler Traffic using HTTP Get Request and Dynamic Web Page

Anurag Jain,Dr A.K Sachan,Shekhar Mishra

doi:10.5120/1826-2406

A Query based Approach to Reduce the Web Crawler Traffic using HTTP Get Request and Dynamic Web Page

Anurag Jain, Dr A.K Sachan + Show 1 more

Open Access

PDF Available

https://doi.org/10.5120/1826-2406

Copy DOI

Export

Save

Cite

Journal: International Journal of Computer Applications	Publication Date: Jan 12, 2011
Citations: 5

#Web Crawler #HTTP GET Request #Dynamic Web Page #HTTP GET #GET Request #Web Page #List Of Updates #Web Site #Web Page Request #Dynamic HTTP

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

The functions of Web crawler download information from web for search engine. Web pages changed without any notice. Web crawler has to revisit web site to download updated and new web pages. It is estimated 40% of current web traffic is generated by web crawler. This paper proposes query based approach to inform updates on web site to web crawler using Dynamic web page and HTTP GET Request. Dynamic web page generates HTML based response having list of updates on web site after crawler last visit. Web crawler only visits updated web pages instead of visiting full web sites for updates. Proposed scheme is tested & results show that it is very promising.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: International Journal of Computer Applications

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

A Query based Approach to Reduce the Web Crawler Traffic using HTTP Get Request and Dynamic Web Page