Fast, Sub-pixel Accurate Digital Image Correlation Algorithm Powered by Heterogeneous (CPU-GPU) Framework

Mullai Thiagu,Sankara J Subramanian,Rupesh Nasre

doi:10.1007/978-3-319-97481-1_13

Abstract

AbstractDigital Image Correlation (DIC) is a popular non-contact image-based full-field deformation measurement tool widely used in mechanics. In spite of its significant advantages, it is still primarily used as a post-processing tool due to its computational cost. In recent years, parallel computing platforms such as multi-core processors and Graphics Processing Units (GPUs) have been used to improve the speed of the DIC algorithm, with GPUs being well-suited for implementing data-parallel operations. Previous works have performed GPU-based DIC wherein each sub-image (i.e. a collection of a few pixels in the local neighborhood of a point of interest) is allocated to a single thread on the GPU, thus achieving parallelism across sub-images. However, this is not the only type of parallelism that is possible: one can also achieve parallelism within a sub-image as well as across whole images. The aim of this work is to efficiently implement 2D-DIC such that parallelism within a sub-image as well as across sub-images leads to considerable reduction in computation time. We use a heterogeneous framework consisting of an Intel Xeon octa-core CPU and an Nvidia Tesla K20C GPU card in this work. The CPU is used to handle image pre-processing, whereas the GPU is used to process four compute-intensive tasks: affine shape function computation, B-Spline interpolation, residual vector calculation and deformation vector update. Parallelization within and across sub-images is achieved in this work by efficient thread handling and use of pre-compiled BLAS libraries. In order to estimate the speedup provided by the GPU, the same four tasks were also evaluated on the octa-core CPU; a speedup of approximately 7 to 5 times was observed for a single sub-image whose size varies from 21×21 to 61×61 respectively. However, it is expected that for a larger number of sub-images, the GPU speedup will be higher and this is indeed the case: when the affine shape function computation and B-Spline interpolation steps were evaluated on 1869 21×21 pixel sub-images, the speedup was around a more impressive 453 times. Further GPU optimization as well as parallelization across image pairs is currently underway and even faster GPU-assisted DIC seems achievable.KeywordsFull-field displacementSub-imageParallel computingHeterogeneous frameworkCompute Unified Device Architecture (CUDA)ThreadKernel

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Fast, Sub-pixel Accurate Digital Image Correlation Algorithm Powered by Heterogeneous (CPU-GPU) Framework

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Architecture-Aware Mapping and Optimization on a 1600-Core GPU
Mayank Daga ... Wu-Chun Feng
-
Mayank Daga, et. al.Mayank Daga ... Wu-Chun Feng
01 Dec 2011
Architecture-Aware Mapping and Optimization on a 1600-Core GPU
Mayank Daga ... Wu-Chun Feng

Computational Benefit of GPU Optimization for the Atmospheric Chemistry Modeling
Jian Sun ... Jack Dongarra
Journal of Advances in Modeling Earth Systems | VOL. 10
Jian Sun, et. al.Jian Sun ... Jack Dongarra
01 Aug 2018
Journal of Advances in Modeling Earth Systems | VOL. 10

General Purpose Computation on Graphics Processing Units Using OpenCL

-

01 Jan 2013
01 Jan 2013

GPU accelerated volumetric lattice Boltzmann model for image-based hemodynamics in portal hypertension
Bo Shang ... Huidan(Whitney) Yu
Computers & Fluids | VOL. 266
Bo Shang, et. al.Bo Shang ... Huidan(Whitney) Yu
23 Aug 2023
Computers & Fluids | VOL. 266

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fast, Sub-pixel Accurate Digital Image Correlation Algorithm Powered by Heterogeneous (CPU-GPU) Framework

Abstract

Talk to us

Similar Papers