A third-party replication service for dynamic hidden databases

Stefan Hintzen,Yves Liesy,Christian Zirpins

doi:10.1007/s11761-020-00313-x

Abstract

Much data on the web is available in hidden databases. Users browse their contents by sending search queries to form-based interfaces or APIs. Yet, hidden databases just return the top-k result entries and limit the number of queries per time interval. Such access restrictions constrict those tasks that require many/specific queries or need to access many/all data entries. For a temporary solution, an unrestricted local snapshot can be created by crawling the hidden database. Yet, keeping the snapshot permanently consistent is challenging due to the access restrictions of its origin. In this paper, we propose a replication approach providing permanent unrestricted access to the local copy of a hidden database with dynamic changes. To this end, we present an algorithm to effectively crawl hidden databases that outperforms the state of the art. Furthermore, we propose a new way to continuously control the consistency of the replicated database in an efficient manner. We also introduce the cloud-based architecture of a replication service for hidden databases. We show the effectiveness of the approach through a variety of reproducible experimental evaluations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A third-party replication service for dynamic hidden databases

Abstract

Talk to us

Similar Papers

More From: Service Oriented Computing and Applications

Lead the way for us

Journal: Service Oriented Computing and Applications	Publication Date: Jan 8, 2021
License type: open-access

Similar Papers

Partitioning mechanism based on dynamic Allocation of Data entries for chip multiprocessors
Yan Pei-Xiang ... Yang Xian-Ju
-
Yan Pei-Xiang, et. al. Yan Pei-Xiang ... Yang Xian-Ju
01 Nov 2010
01 Nov 2010

Interpretable Learning and Pattern Mining: Scalable Algorithms and Data-Driven Applications

-

10 Jul 2020
10 Jul 2020

Towards an autonomic Service Oriented Architecture in computational engineering framework
M Agni Catur Bhakti ... Azween B Abdullah
-
M Agni Catur Bhakti, et. al.M Agni Catur Bhakti ... Azween B Abdullah
01 May 2010
01 May 2010

Migrating Legacy Systems to Web Services Architecture
Shing-Han Li ... David C Yen
-
Shing-Han Li, et. al.Shing-Han Li ... David C Yen
01 Jan 2010
01 Jan 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A third-party replication service for dynamic hidden databases

Abstract

Talk to us

Similar Papers

More From: Service Oriented Computing and Applications