Abstract

The volume of scientific data has been increasing dramatically over the past decades. Due to its large data size, it becomes rather hard to efficiently organize the archived scientific data so as to provide fast data access capability. The indexing techniques, especially the multidimensional indexing techniques which can handle complex data, are heavily demanded and thus have been playing a critical role in many modern scientific applications. Bitmap index is a type of technique which aims to efficiently organize multidimensional data objects for fast data access. Modern scientific data often involve massive data which are stored in a distributed cluster, while most of the existing bitmap indexing approaches are designed for single server. Therefore, there are still many technical challenges in big scientific data based on bitmap indexing. In this paper, aiming at accelerating the bitmap index construction, we explore a methodology to efficiently build the representative bitmap technique, e.g., FastBit, in a distributed environment. A series of evaluations have been conducted over large real astronomical data sets. Those evaluations verify that our approach outperformed the conventional FastBit index construction in both small and large data sizes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.