Fast Access and Retrieval of Big Data Based on Unique Identification

Wenshun Sheng,Shengli Wu,Aiping Xu

doi:10.32604/iasc.2022.022571

Wenshun Sheng, Shengli Wu + Show 1 more

Open Access

PDF Available

https://doi.org/10.32604/iasc.2022.022571

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

In big data applications, the data are usually stored in data files, whose data file structures, field structures, data types and lengths are not uniform. Therefore, if these data are stored in the traditional relational database, it is difficult to meet the requirements of fast storage and access. To solve this problem, we propose the mapping model between the source data file and the target HBase file. Our method solves the heterogeneity of the file object and the universality of the storage conversion. Firstly, based on the mapping model, we design “RowKey”, generation rules and algorithm. Then according to the mapping rules of data file fields with the HBase table column, the data in the data file are transformed into HBase. Finally, the retrieved keywords in “RowKey” are stored and used to achieve fast data retrieval by prefix matching or keyword matching method. Our method has been applied to different projects, which shows these results can be applied to the data conversion from regular row store data file to HBase distributed large data storage and has strong commonality. The method can be widely used in HBase big data storage applications.

Highlights

For big data applications, massive data are stored in files by rows [1,2]
Our method has been applied to different projects, which shows these results can be applied to the data conversion from regular row store data file to HBase distributed large data storage and has strong commonality
In this paper, aiming at the problem mentioned above, we study converting and storing the data file stored by row to HBase distributed database, and fast retrieval and access to the big data in HBase

Summary

Introduction

Massive data are stored in files by rows [1,2]. With the continuous development and application of distributed database technology, converting these data files into distributed storage can provide a more convenient application environment [3,4]. The storage and retrieval method through associated multi-attributes of massive data is described in Reference [20], which solves the secondary index problem based on the multi-condition query of HBase dynamic properties [21]. In this paper, aiming at the problem mentioned above, we study converting and storing the data file stored by row to HBase distributed database, and fast retrieval and access to the big data in HBase. It involves the storage domain of big data. A common tool for distributed storage, transformation and retrieval of big data is to be studied

The Specific Issue to Solve of This Research

The Expression of “Rowkey” and the Generating Algorithm of “Rowkey”

Objective

Test and Analysis of Big Data Retrieval

Conversion in Different Types of Data File

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Intelligent Automation & Soft Computing	Publication Date: Jan 1, 2022
Citations: 1	License type: cc-by

R Discovery Prime

Fast Access and Retrieval of Big Data Based on Unique Identification

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Intelligent Automation & Soft Computing

Lead the way for us

Similar Papers

Explore Big Data Analytics Applications and Opportunities: A Review
Zaher Ali Al-Sai ... Sharifah Mashita Syed-Mohamad
Big Data and Cognitive Computing | VOL. 6
Zaher Ali Al-Sai, et. al.Zaher Ali Al-Sai ... Sharifah Mashita Syed-Mohamad
14 Dec 2022
Big Data and Cognitive Computing | VOL. 6

Cloud computing and big data: Technologies and applications
Mostapha Zbakh ... Mohamed Bakhouya
Concurrency and Computation: Practice and Experience | VOL. 29
Mostapha Zbakh, et. al.Mostapha Zbakh ... Mohamed Bakhouya
29 Mar 2017
Concurrency and Computation: Practice and Experience | VOL. 29

Chapter 7 - Public Transportation Big Data Mining and Analysis
Xiaolei Ma ... Xi Chen
Data-Driven Solutions to Transportation Problems | VOL. -
Xiaolei Ma, et. al.Xiaolei Ma ... Xi Chen
07 Dec 2018
Data-Driven Solutions to Transportation Problems | VOL. -

Development model based on visual image big data applied to art management
Jiehui Ju ... Er Zhuang
Heliyon | VOL. 10
Jiehui Ju, et. al.Jiehui Ju ... Er Zhuang
01 Sep 2024
Heliyon | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Fast Access and Retrieval of Big Data Based on Unique Identification

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Intelligent Automation &amp; Soft Computing

More From: Intelligent Automation & Soft Computing