Efficient 2D and 3D watershed on graphics processing unit: block-asynchronous approaches based on cellular automata

Pablo Quesada-Barriuso,Dora B Heras,Francisco Argüello

doi:10.1016/j.compeleceng.2013.04.020

Pablo Quesada-Barriuso, Dora B Heras + Show 1 more

https://doi.org/10.1016/j.compeleceng.2013.04.020

Copy DOI

Abstract

The watershed transform is a method for non-supervised image segmentation. In this paper we show that a watershed algorithm based on a cellular automaton is a good choice for the recent GPU architectures, especially when the synchronization rules are relaxed. In particular, we propose a block-asynchronous computation strategy that maps the cellular automaton on the thread blocks of the GPU. This method reduces the number of points of global synchronization allowing efficient exploitation of the memory hierarchy of the GPU. We also avoid the artifacts produced in the watershed lines by the block-asynchronous updating scheme by correcting the data propagation speed among the blocks. The proposals are compared to an OpenMP multithreaded code. The high speedups indicate the potential of this kind of algorithm for new architectures based on hundreds of cores. The method is tuned to be applied to 3D volumes obtaining similar results.

Full Text