맵리듀스에서 빅데이터 분석을 위한 다중 Group-by 질의의 효율적인 처리 기법

Eunju Park,Junho Shim,Sohyun Oh,Ki Yong Lee,Sojeong Park,Hyejin Choi

doi:10.5626/ktcp.2015.21.5.387

맵리듀스에서 빅데이터 분석을 위한 다중 Group-by 질의의 효율적인 처리 기법

Eunju Park, Junho Shim + Show 4 more

https://doi.org/10.5626/ktcp.2015.21.5.387

Copy DOI

Journal: KIISE Transactions on Computing Practices	Publication Date: May 15, 2015
Citations: 1

#Group-by Query #Data Sets In Parallel + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

MapReduce is a framework used to process large data sets in parallel on a large cluster. A group-by query is a query that partitions the input data into groups based on the values of the specified attributes, and then evaluates the value of the specified aggregate function for each group. In this paper, we propose an efficient method for processing multiple group-by queries using MapReduce. Instead of computing each group-by query independently, the proposed method computes multiple group-by queries in stages with one or more MapReduce jobs in order to reduce the total execution cost. We compared the performance of this method with the performance of a less sophisticated method that computes each group-by query independently. This comparison showed that the proposed method offers better performance in terms of execution time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: KIISE Transactions on Computing Practices

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.