Memory Coalescing Implementation of Metropolis Resampling on Graphics Processing Unit

Özcan Dülger,Mübeccel Demirekler,Halit Oğuztüzün

doi:10.1007/s11265-017-1254-6

Abstract

Owing to many cores in its architecture, graphics processing unit (GPU) offers promise for parallel execution of the particle filter. A stage of the particle filter that is particularly challenging to parallelize is resampling. There are parallel resampling algorithms in the literature such as Metropolis resampling, which does not require a collective operation such as cumulative sum over weights and does not suffer from numerical instability. However, with large number of particles, Metropolis resampling becomes slow. This is because of the non-coalesced access problem on the global memory of the GPU. In this article, we offer solutions for this problem of Metropolis resampling. We introduce two implementation techniques, named Metropolis-C1 and Metropolis-C2, and compare them with the original Metropolis resampling on NVIDIA Tesla K40 board. In the first scenario where these two techniques achieve their fastest execution times, Metropolis-C1 is faster than the others, but yields the worst results in quality. However, Metropolis-C2 is closer to Metropolis resampling in quality. In the second scenario where all three algorithms yield similar quality, although Metropolis-C1 and Metropolis-C2 get slower, they are still faster than the original Metropolis resampling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Memory Coalescing Implementation of Metropolis Resampling on Graphics Processing Unit

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems

Lead the way for us

Journal: Journal of Signal Processing Systems	Publication Date: Jun 10, 2017
Citations: 9

Similar Papers

GPU-powered CFD-DEM framework for modelling large-scale gas–solid reacting flows (GPU- rCFD-DEM) and an industry application
Dazhao Gou ... Yansong Shen
Chemical Engineering Science | VOL. 299
Dazhao Gou, et. al.Dazhao Gou ... Yansong Shen
21 Jul 2024
Chemical Engineering Science | VOL. 299

Acceleration of PIC Simulation with GPU
Junya Suzuki ... Mitsue Den
Plasma and Fusion Research | VOL. 6
Junya Suzuki, et. al.Junya Suzuki ... Mitsue Den
01 Jan 2010
Plasma and Fusion Research | VOL. 6

Accelerating parallel particle swarm optimization via GPU
Yukai Hung ... Weichung Wang
Optimization Methods and Software | VOL. 27
Yukai Hung, et. al.Yukai Hung ... Weichung Wang
01 Feb 2012
Optimization Methods and Software | VOL. 27

Multi-phase SPH modelling of violent hydrodynamics on GPUs
Athanasios Mokos ... José M Domínguez
Computer Physics Communications | VOL. 196
Athanasios Mokos, et. al.Athanasios Mokos ... José M Domínguez
07 Jul 2015
Computer Physics Communications | VOL. 196

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Memory Coalescing Implementation of Metropolis Resampling on Graphics Processing Unit

Abstract

Talk to us

Similar Papers

More From: Journal of Signal Processing Systems