Extracting result schema based on query instances in the Deep Web

Tiezheng Nie,Derong Shen,Yue Kou,Ge Yu,Wei Liu

doi:10.1007/s11859-007-0043-7

Abstract

Deep Web sources contain a large of high-quality and query-related structured date. One of the challenges in the Deep Web is extracting result schemas of Deep Web sources. To address this challenge, this paper describes a novel approach that extracts both result data and the result schema of a Web database. The approach first models the query interface of a Deep Web source and fills in it with a specifically query instance. Then the result pages of the Deep Web sources are formatted in the tree structure to retrieve subtrees that contain elements of the query instance. Next, result schema of the Deep Web source is extracted by matching the subtree’ nodes with the query instance, in which, a two-phase schema extraction method is adopted for obtaining more accurate result schema. Finally, experiments on real Deep Web sources show the utility of our approach, which provides a high precision and recall.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Extracting result schema based on query instances in the Deep Web

Abstract

Talk to us

Similar Papers

More From: Wuhan University Journal of Natural Sciences

Lead the way for us

Journal: Wuhan University Journal of Natural Sciences	Publication Date: Sep 1, 2007
Citations: 12

Similar Papers

DWSpyder: a new schema extraction method for a deep web integration system
Yasser Saissi ... Ahmed Zellou
International Journal of Web Engineering and Technology | VOL. 14
Yasser Saissi, et. al.Yasser Saissi ... Ahmed Zellou
01 Jan 2019
International Journal of Web Engineering and Technology | VOL. 14

Extraction of relational schema from deep web sources: a form driven approach
Yasser Saissi ... Ahmed Zellou
-
Yasser Saissi, et. al.Yasser Saissi ... Ahmed Zellou
01 Nov 2014
01 Nov 2014

Quality-based data source selection for web-scale Deep Web data integration
Xue-Feng Xian ... Zhi-Ming Cui
-
Xue-Feng Xian, et. al. Xue-Feng Xian ... Zhi-Ming Cui
01 Jul 2009
01 Jul 2009

Discovering the Deep Web through XML Schema Extraction
Yasser Saissi ... Ali Idri
-
Yasser Saissi, et. al.Yasser Saissi ... Ali Idri
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extracting result schema based on query instances in the Deep Web

Abstract

Talk to us

Similar Papers

More From: Wuhan University Journal of Natural Sciences