Enabling Massive XML-Based Biological Data Management in HBase.

Jian Liu,Qiuru Liu,Yongzhuang Liu,Shuhui Su,Lei Zhang

doi:10.1109/tcbb.2019.2915811

Abstract

Publishing biological data in XML formats is attractive for organizations who would like to provide their bioinformatics resources in an extensible and machine-readable format. In the era of big data, massive XML-based biological data management is emerged as a challengeable issue. With the continuous growth of the XML-based biological data sets, it is usually frustrating to use traditional declarative query languages to provide efficient query capabilities in terms of processing speed and scale. In this study, we report a novel platform to store and query massive XML-based biological data collections. A prototype tool for constructing HBase tables from XML-based biological data collections is first developed, and then a formal approach to transform the XML query model into the MapReduce query model is proposed. Finally, an evaluation of the query performance of the proposed approach on the existing XML-based biological databases is presented, showing that the performance advantages of the proposed solution. The source code of the massive XML-based biological data management platform is freely available at https://github.com/lyotvincent/X2H.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enabling Massive XML-Based Biological Data Management in HBase.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Nov 1, 2020
Citations: 43

Similar Papers

A Blockchain-Assisted Massive IoT Data Collection Intelligent Framework
Lupeng Zhang ... Fengqi Li
IEEE Internet of Things Journal | VOL. 9
Lupeng Zhang, et. al.Lupeng Zhang ... Fengqi Li
10 Jan 2021
IEEE Internet of Things Journal | VOL. 9

Trauma Outcome Prediction in the Era of Big Data: From Data Collection to Analytics
Shiming Yang ... Peter F Hu
-
Shiming Yang, et. al.Shiming Yang ... Peter F Hu
04 Jul 2018
04 Jul 2018

Design of Hadoop-Based Massive Intelligence Data Management System
Guo Lin Zhao ... Zi Yan Shi
Applied Mechanics and Materials | VOL. 608-609
Guo Lin Zhao, et. al.Guo Lin Zhao ... Zi Yan Shi
01 Oct 2014
Applied Mechanics and Materials | VOL. 608-609

Massive sensor data management framework in Cloud manufacturing based on Hadoop
Yuan Bao ... Yongliang Luo
-
Yuan Bao, et. al.Yuan Bao ... Yongliang Luo
01 Jul 2012
01 Jul 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enabling Massive XML-Based Biological Data Management in HBase.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics