Abstract
The traditional data warehousing approaches should adapt to take into consideration novel needs and data structures. In this context, NoSQL technology is progressively gaining a place in the research and industry domains. This paper proposes an approach for building a NoSQL document-oriented warehouse (DocW). This approach has two methods, namely 1) document warehouse builder and 2) NoSQL-Converter. The first method generates the DocW schema as a galaxy model whereas the second one translates the generated galaxy into a document-oriented NoSQL model. This relies on two types of rules: structure and hierarchical rules. Furthermore, in order to help understanding the textual results of analytical queries on the NoSQL-DocW, the authors define two semantic operators S-Drill-Up and S-Drill-Down to aggregate/expand the terms of query. The implementation of our proposals uses MangoDB and Talend. The experiment uses the medical collection Clef-2007 and two metrics called write request latency and read request latency to evaluate respectively the loading time and the response time to queries.
Highlights
Documents contain valued information and incarnate pertinent knowledge for decisional processes
This paper proposes an approach for building a NoSQL document-oriented warehouse (DocW)
A Document Warehouse (DocW) is modeled as a Star schema (Tseng et al, 2006; Ben Mefteh et al, 2016), or as a Galaxy (Ben Messaoud et al, 2015; Feki et al, 2013; Pujolle et al, 2011) that is a variant of the Star schema
Summary
Documents contain valued information and incarnate pertinent knowledge for decisional processes. This paper proposes an approach for building a NoSQL document-oriented warehouse (DocW). Keywords Document Warehouse, Galaxy Model, Hierarchical Rules, MangoDB, NoSQL, Semantic OLAP, Structure Rules, Transformation Rules
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: International Journal of Operations Research and Information Systems
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.