Cluster computing for web-scale data processing

Aaron Kimball,Sierra Michels-Slettvet,Christophe Bisciglia

doi:10.1145/1352322.1352177

Cluster computing for web-scale data processing

Aaron Kimball, Sierra Michels-Slettvet + Show 1 more

https://doi.org/10.1145/1352322.1352177

Copy DOI

Journal: ACM SIGCSE Bulletin	Publication Date: Feb 29, 2008
Citations: 9

Affiliation: University of Washington, Google (United States)

#Google File System #Large-scale Data Processing + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

In this paper we present the design of a modern course in cluster computing and large-scale data processing. The defining differences between this and previously published designs are its focus on processing very large data sets and its use of Hadoop, an open source Java-based implementation of MapReduce and the Google File System as the platform for programming exercises. Hadoop proved to be a key element for successfully implementing structured lab activities and independent design projects. Through this course, offered at the University of Washington in 2007, we imparted new skills on our students, improving their ability to design systems capable of solving web-scale problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: ACM SIGCSE Bulletin

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.