Notice of Violation of IEEE Publication Principles: The Anatomy of a Large-Scale Hyper Textual Web Search Engine

Umesh Sehgal,Kuljeet Kaur,Pawan Kumar

doi:10.1109/iccee.2009.59

Abstract

In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems. To engineer a search engine is a challenging task. Search engines index tens to hundreds of millions of web pages involving a comparable number of distinct terms. They answer tens of millions of queries every day. Despite the importance of large-scale search engines on the web, very little academic research has been done on them. Furthermore, due to rapid advance in technology and web proliferation, creating a web search engine today is very different from three years ago. This paper provides an in-depth description of our large-scale web search engine to define the values and traditional techniques of data in hypertext. Apart from the problems of scaling traditional search techniques to data of this magnitude, there are new technical challenges involved with using the additional information present in hypertext to produce better search results. This paper addresses this question of how to build a practical large-scale system which can exploit the additional information present in hypertext. Also we look at the problem of how to effectively deal with uncontrolled hypertext collections where anyone can publish anything they want.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Notice of Violation of IEEE Publication Principles: The Anatomy of a Large-Scale Hyper Textual Web Search Engine

Abstract

Talk to us

Similar Papers

More From: 2009 Second International Conference on Computer and Electrical Engineering

Lead the way for us

Journal: 2009 Second International Conference on Computer and Electrical Engineering	Publication Date: Dec 1, 2009
Citations: 131

Similar Papers

Reprint of: The anatomy of a large-scale hypertextual web search engine
Sergey Brin ... Lawrence Page
Computer Networks | VOL. 56
Sergey Brin, et. al.Sergey Brin ... Lawrence Page
23 Oct 2012
Computer Networks | VOL. 56

The anatomy of a large-scale hypertextual Web search engine
Sergey Brin ... Lawrence Page
Computer Networks and ISDN Systems | VOL. 30
Sergey Brin, et. al.Sergey Brin ... Lawrence Page
01 Apr 1998
Computer Networks and ISDN Systems | VOL. 30

Mining query logs to optimize index partitioning in parallel web search engines
...
-
, et. al. ...
06 Jun 2007
06 Jun 2007

Scalability and Efficiency Challenges in Large-Scale Web Search Engines
B Barla Cambazoglu ... Ricardo Baeza-Yates
-
B Barla Cambazoglu, et. al.B Barla Cambazoglu ... Ricardo Baeza-Yates
02 Feb 2015
02 Feb 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Notice of Violation of IEEE Publication Principles: The Anatomy of a Large-Scale Hyper Textual Web Search Engine

Abstract

Talk to us

Similar Papers

More From: 2009 Second International Conference on Computer and Electrical Engineering