Abstract

The traditional data warehousing approaches should adapt to take into consideration novel needs and data structures. In this context, NoSQL technology is progressively gaining a place in the research and industry domains. This paper proposes an approach for building a NoSQL document-oriented warehouse (DocW). This approach has two methods, namely 1) document warehouse builder and 2) NoSQL-Converter. The first method generates the DocW schema as a galaxy model whereas the second one translates the generated galaxy into a document-oriented NoSQL model. This relies on two types of rules: structure and hierarchical rules. Furthermore, in order to help understanding the textual results of analytical queries on the NoSQL-DocW, the authors define two semantic operators S-Drill-Up and S-Drill-Down to aggregate/expand the terms of query. The implementation of our proposals uses MangoDB and Talend. The experiment uses the medical collection Clef-2007 and two metrics called write request latency and read request latency to evaluate respectively the loading time and the response time to queries.

Highlights

  • Documents contain valued information and incarnate pertinent knowledge for decisional processes

  • This paper proposes an approach for building a NoSQL document-oriented warehouse (DocW)

  • A Document Warehouse (DocW) is modeled as a Star schema (Tseng et al, 2006; Ben Mefteh et al, 2016), or as a Galaxy (Ben Messaoud et al, 2015; Feki et al, 2013; Pujolle et al, 2011) that is a variant of the Star schema

Read more

Summary

Introduction

Documents contain valued information and incarnate pertinent knowledge for decisional processes. This paper proposes an approach for building a NoSQL document-oriented warehouse (DocW). Keywords Document Warehouse, Galaxy Model, Hierarchical Rules, MangoDB, NoSQL, Semantic OLAP, Structure Rules, Transformation Rules

Objectives
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.