An Efficient Mechanism for Deep Web Data Extraction Based on Tree-Structured Web Pattern Matching

B Bazeer Ahamed,Olfat M Mirza,D Yuvaraj,Aisha Alsobhi,S Shitharth,Ayman Yafoz

doi:10.1155/2022/6335201

Abstract

The World Wide Web comprises of huge web databases where the data are searched using web query interface. Generally, the World Wide Web maintains a set of databases to store several data records. The distinct data records are extracted by the web query interface as per the user requests. The information maintained in the web database is hidden and retrieves deep web content even in dynamic script pages. In recent days, a web page offers a huge amount of structured data and is in need of various web-related latest applications. The challenge lies in extracting complicated structured data from deep web pages. Deep web contents are generally accessed by the web queries, but extracting the structured data from the web database is a complex problem. Moreover, making use of such retrieved information in combined structures needs significant efforts. No further techniques are established to address the complexity in data extraction of deep web data from various web pages. Despite the fact that several ways for deep web data extraction are offered, very few research address template-related issues at the page level. For effective web data extraction with a large number of online pages, a unique representation of page generation using tree-based pattern matches (TBPM) is proposed. The performance of the proposed technique TBPM is compared to that of existing techniques in terms of relativity, precision, recall, and time consumption. The performance metrics such as high relativity is about 17-26% are achieved when compared to FiVaTech approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Communications and Mobile Computing	Publication Date: May 27, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Efficient Mechanism for Deep Web Data Extraction Based on Tree-Structured Web Pattern Matching

Abstract

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Similar Papers

Visual Architecture based Web Information Extraction
Oswalt Manoj S
Bonfring International Journal of Data Mining | VOL. 1
Oswalt Manoj SOswalt Manoj S
30 Dec 2011
Bonfring International Journal of Data Mining | VOL. 1

ViDE: A Vision-Based Approach for Deep Web Data Extraction
Wei Liu ... Xiaofeng Meng
IEEE Transactions on Knowledge and Data Engineering | VOL. 22
Wei Liu, et. al.Wei Liu ... Xiaofeng Meng
01 Mar 2010
IEEE Transactions on Knowledge and Data Engineering | VOL. 22

I-ViDE: An Improved Vision-Based Approach for Deep Web Data Extraction
Mrudula Varade ... Vimla Jethani
IOSR Journal of Computer Engineering | VOL. 16
Mrudula Varade, et. al.Mrudula Varade ... Vimla Jethani
01 Jan 2014
IOSR Journal of Computer Engineering | VOL. 16

Automatic discovery of Web Query Interfaces using machine learning techniques
Heidy M. Marin-Castro ... Jose F. Martinez-Trinidad
Journal of Intelligent Information Systems | VOL. 40
Heidy M. Marin-Castro, et. al.Heidy M. Marin-Castro ... Jose F. Martinez-Trinidad
23 Aug 2012
Journal of Intelligent Information Systems | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Mechanism for Deep Web Data Extraction Based on Tree-Structured Web Pattern Matching

Abstract

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing