Abstract

Leveraging Big Data (BD) processing frameworks to process large-scale Resource Description Framework (RDF) datasets holds a great interest in optimizing query performance. Modern BD services are complicated data systems, where tuning the configurations notably affects the performance. Benchmarking different frameworks and configurations provides the community with best practices towards selecting the most suitable configurations. However, most of these benchmarking efforts are classified as descriptive or diagnostic analytics. Moreover, there is no standardization for comparing and contrasting these benchmarks based on quantitative ranking techniques. This paper aims to fill this timely research gap by proposing ranking criteria (called Bench-ranking) that provide prescriptive analytics via ranking functions. In particular, Bench-ranking starts by describing the current state-of-the-art single-dimensional ranking limitations. Next, we discuss the recent benchmarking requirements for sophisticated approaches over multi-dimensional ranking. Finally, we discuss the ranking criteria goodness by reviewing its conformance and coherence metrics. We validate Bench-ranking by conducting an empirical study using large RDF datasets under a relational BD engine, i.e., Apache Spark-SQL. The proposed ranking techniques provide the practitioners with clear insights to make an informed decision, especially with experimental trade-offs for such complex solution space.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.