An Index Scheme for Similarity Search on Cloud Computing using MapReduce over Docker Container

Dt-Tri Nguyen,Chan Ho Yong,Huu-Quoc Nguyen,Ton Thi Kim Loan,Xuan-Qui Pham,Eui-Nam Huh

doi:10.1145/2857546.2857607

Abstract

We consider the problem of similarity search over the large datasets in the distributed environment. The proposed framework employs the Vp-Tree algorithm that integrated on top of the MapReduce framework to achieve good performance as well as meet the scalability and fault tolerance requirements for the system while data scale up. Since VP-Tree algorithm was implemented initially for partition and searching data in the local disk access, we proposed a new approach to using it in the parallel environment. The key point of the Vp-Tree algorithm is that it distributed the similar data points into groups, thereby reducing number of data need to scan during the searching stage. Consequently, the response time of the entire system has been improved. Otherwise, we used an open source computer vision library OpenCV for detect the similarity among images in the dataset. We evaluate the performance of our proposed framework using a synthetic data to show the positive of our approach. The experiment shows that our proposed framework achieves 57% improvement in response time in comparison with running searching job in tradition Hadoop framework. We also compared our application running time on Docker container against VM-based environment. The result points out that deploy our system over Docker container provide higher performance than VM-based environment in term of response time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Index Scheme for Similarity Search on Cloud Computing using MapReduce over Docker Container

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Hill Climb Game Play with Webcam Using OpenCV
Chandan Kumar
International Journal for Research in Applied Science and Engineering Technology | VOL. 10
Chandan KumarChandan Kumar
31 Jan 2022
International Journal for Research in Applied Science and Engineering Technology | VOL. 10

Methodology of Visualization of non-Isothermal Mixing Processes Under the Influence of External Dynamic Forces
Alexandr Alexandrovich Sataev ... Vyacheslav Viktorovich Andreev
-
Alexandr Alexandrovich Sataev, et. al.Alexandr Alexandrovich Sataev ... Vyacheslav Viktorovich Andreev
01 Jan 2021
01 Jan 2021

Design and implementation of high resolution face image acquisition system under low illumination based on the open source computer vision library
Min Luo ... Hui Li
-
Min Luo, et. al. Min Luo ... Hui Li
01 Jun 2017
01 Jun 2017

Injury-Related Reductions in Skilled Visuomotor Learning Revealed by Single Trial Analysis and Response Time Variability
Courtenay Dunn-Lewis ... Jeff S Volek
Medicine & Science in Sports & Exercise | VOL. 49
Courtenay Dunn-Lewis, et. al.Courtenay Dunn-Lewis ... Jeff S Volek
01 May 2017
Medicine & Science in Sports & Exercise | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Index Scheme for Similarity Search on Cloud Computing using MapReduce over Docker Container

Abstract

Talk to us

Similar Papers