Parallelism by design

Robert Griesemer

doi:10.1145/1356058.1356089

Abstract

Very large data sets - telephone call records, network logs, high-resolution satellite images, or web document repositories - are not easily analyzed using traditional database techniques. They may be simply too large, grow too fast, or may not fit well in a database schema. They tend to span multiple disks and machines. On the other hand, these large data sets often have a flat and regular structure that permits distributed filtering and aggregation.We present a system and language for such analyses*. Altering phase, in which a query is expressed using the procedural programming language Sawzall, emits data to an aggregation phase. Both phases are distributed over hundreds or even thousands of computers. The language constructs and execution model of Sawzall have been devised to enable parallel execution without the need for complex dependency analysis. Even with our fairly traditional implementation of the Sawzall execution engine we observe nearly perfect scalability as we add more machines.*Joint work with Sean Dorward, Rob Pike, and Sean Quinlan.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parallelism by design

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Interpreting the Data: Parallel Analysis with Sawzall
Rob Pike ... Sean Dorward
Scientific Programming | VOL. 13
Rob Pike, et. al.Rob Pike ... Sean Dorward
01 Jan 2004
Scientific Programming | VOL. 13

Distributed information gain theoretic feature selector using spark
Bakshi Rohit Prasad ... Unmesh Kishor Bendale
-
Bakshi Rohit Prasad, et. al.Bakshi Rohit Prasad ... Unmesh Kishor Bendale
01 Dec 2016
01 Dec 2016

Stroke Width Transform for Linear Structure Detection: Application to River and Road Extraction from High-Resolution Satellite Images
Moslem Ouled Sghaier ... Richard Lepage
-
Moslem Ouled Sghaier, et. al.Moslem Ouled Sghaier ... Richard Lepage
01 Jan 2017
01 Jan 2017

Robust Radiometric Normalization of Multitemporal Satellite Images Via Block Adjustment Without Master Images
Kunbo Liu ... Tao Ke
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 13
Kunbo Liu, et. al.Kunbo Liu ... Tao Ke
01 Jan 2020
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parallelism by design

Abstract

Talk to us

Similar Papers