Data Preloading and Data Placement for MapReduce Performance Improving

Anton Spivak,Denis Nasonov

doi:10.1016/j.procs.2016.11.044

Data Preloading and Data Placement for MapReduce Performance Improving

Anton Spivak, Denis Nasonov

Open Access

https://doi.org/10.1016/j.procs.2016.11.044

Copy DOI

Journal: Procedia Computer Science	Publication Date: Jan 1, 2016
Citations: 11	License type: cc-by-nc-nd

Affiliation: ITMO University

#MapReduce Tasks #RAM Capacity + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

The usage of Hadoop cluster is widely spread in different business and academic spheres. The performance of Hadoop depends on various factors, such as amount and frequency of CPU cores, RAM capacity, throughput of storages, dataflow's intensity, network bandwidth and latency, etc. The heterogeneity of a computing environment raises such problems as the optimization of data distribution across computing and storage resources of Hadoop cluster. In this paper, we propose an approach for the improvement of data placement and suggest an implementation of presented algorithm in Hadoop platform. Proposed method uses HDFS distributed cache to enhance a performance of task's execution. As a result, the introduced algorithm leads to the reduction of overall MapReduce tasks’ execution time and increasing of I/O rates during the map stage.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Procedia Computer Science

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.