Using Visual Clues Concept for Extracting Main Data from Deep Web Pages

Satish J Pusdekar,Shaikh Phiroj Chhaware

doi:10.1109/icesc.2014.39

Abstract

Extracting data from deep Web pages is a challenging problem due to the underlying intricate structures of such pages. A large number of techniques have been proposed to address this problem, but all of them have inherent limitations because they are Web-page-programming-language-dependent. The contents on Web pages are always displayed regularly for users to browse. There is different ways for deep Web data extraction to overcome the limitations of previous works by utilizing some interesting common visual features on the deep Web pages. In this paper vision-based approach is Web page programming-language-independent approach is proposed. This approach utilizes the visual features of the web pages to extract data from deep web pages including data record extraction and data item extraction. Again we also propose a new evaluation measure revision to capture human effort needed to produce exact extraction of data. Our implementation on large set of web databases describes the proposed vision-based approach is highly effective for data extraction from deep web pages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using Visual Clues Concept for Extracting Main Data from Deep Web Pages

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

ViDE: A Vision-Based Approach for Deep Web Data Extraction
Wei Liu ... Xiaofeng Meng
IEEE Transactions on Knowledge and Data Engineering | VOL. 22
Wei Liu, et. al.Wei Liu ... Xiaofeng Meng
01 Mar 2010
IEEE Transactions on Knowledge and Data Engineering | VOL. 22

Visual Architecture based Web Information Extraction
Oswalt Manoj S
Bonfring International Journal of Data Mining | VOL. 1
Oswalt Manoj SOswalt Manoj S
30 Dec 2011
Bonfring International Journal of Data Mining | VOL. 1

I-ViDE: An Improved Vision-Based Approach for Deep Web Data Extraction
Mrudula Varade ... Vimla Jethani
IOSR Journal of Computer Engineering | VOL. 16
Mrudula Varade, et. al.Mrudula Varade ... Vimla Jethani
01 Jan 2014
IOSR Journal of Computer Engineering | VOL. 16

A Framework for Incremental Deep Web Crawler Based on URL Classification
Zhixiao Zhang ... Guoqing Dong
-
Zhixiao Zhang, et. al.Zhixiao Zhang ... Guoqing Dong
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using Visual Clues Concept for Extracting Main Data from Deep Web Pages

Abstract

Talk to us

Similar Papers