Abstract

Many entrepreneurship applications use data as the core concept of their business to better understand the needs of their customers. However, as the size of databases used by these entrepreneurship applications grows and as more users access data through various interactive interfaces, obtaining the result for a top-k query may take long time if the query matches millions of the tuples in the database. Traditionally, layer-based indexing methods are representative for processing top-k queries efficiently. These methods form tuples into a list of layers where the ith layer holds the tuples that can be the top-i answer. Layer-based indexing methods enable us to obtain top-k answers by accessing at most k layers. Most of these methods achieve high accuracy of query answer at the expense of enlarged index construction time. However, we can adjust between accuracy and index construction time to achieve an optimal performance. Thus in this paper, we propose a method, called the adaptive convex skyline (AdaptCS) for efficient-processing top-k queries in entrepreneurship applications. AdaptCS first prunes the data with a virtual threshold point and finds skyline points over the pruned data. Here, by adjusting virtual threshold we are able to achieve optimal performance. Then, AdaptCS divides the skyline into m subregions with projection partitioning method and constructs the convex hull m times for each subregion with virtual objects. Lastly, AdaptCS combines the objects obtained by computing the convex hull. The experimental results show that the proposed method outperforms the existing methods.

Highlights

  • Entrepreneurship is considered to be an ability to establish new companies using new concepts

  • The intersection of minimum values in ith axis is defined as a virtual threshold object and is used for pruning data that falls outside its boundaries

  • We have proposed adaptive convex skyline (AdaptCS), a threshold-based project partitioned layerbased index for efficient-processing top-k queries

Read more

Summary

Introduction

Entrepreneurship is considered to be an ability to establish new companies using new concepts. One of the wellknown entrepreneurship applications, the healthcare field, generates the huge amounts of data, which often comes from electronic health records (EHRs) in the form of medical history, laboratory test results and general information like age and weight When this data is large and high-dimensional, obtaining the result for a query may take long time if the query matches millions of the tuples in the database. We propose a method, called the adaptive convex skyline (AdaptCS) for efficientprocessing top-k queries in entrepreneurship applications. AdaptCS divides the skyline into m subregions with projection partitioning method and constructs the convex hull m times for each subregion with virtual objects. We compare the index construction time, and precision of AdaptCS with the existing methods that use approximate convex skyline, convex skyline and skyline.

Related work
Proposed method
Skyline step
Partition step
Combine step
Experimental environment
Experimental results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call