Accelerating iterative CT reconstruction algorithms using Tensor Cores

Mohsen Nourazar,Bart Goossens

doi:10.1007/s11554-020-01069-5

Abstract

Tensor Cores are specialized hardware units added to recent NVIDIA GPUs to speed up matrix multiplication-related tasks, such as convolutions and densely connected layers in neural networks. Due to their specific hardware implementation and programming model, Tensor Cores cannot be straightforwardly applied to other applications outside machine learning. In this paper, we demonstrate the feasibility of using NVIDIA Tensor Cores for the acceleration of a non-machine learning application: iterative Computed Tomography (CT) reconstruction. For large CT images and real-time CT scanning, the reconstruction time for many existing iterative reconstruction methods is relatively high, ranging from seconds to minutes, depending on the size of the image. Therefore, CT reconstruction is an application area that could potentially benefit from Tensor Core hardware acceleration. We first studied the reconstruction algorithm’s performance as a function of the hardware related parameters and proposed an approach to accelerate reconstruction on Tensor Cores. The results show that the proposed method provides about 5 times increase in speed and energy saving using the NVIDIA RTX 2080 Ti GPU for the parallel projection of 32 images of size 512times 512. The relative reconstruction error due to the mixed-precision computations was almost equal to the error of single-precision (32-bit) floating-point computations. We then presented an approach for real-time and memory-limited applications by exploiting the symmetry of the system (i.e., the acquisition geometry). As the proposed approach is based on the conjugate gradient method, it can be generalized to extend its application to many research and industrial fields.

Highlights

Graphics Processing Units (GPUs), as one of the most feasible parallel structured processors, have proven their power in facilitating research in a wide range of fields, including highperformance computing, data centers, medical imaging, and machine learning
We demonstrate the application of NVIDIA Tensor Cores to accelerating Computed Tomography (CT) forward-projection (FP) and back-projection (BP) algorithms, which are of the most demanding kernels in iterative reconstruction approaches
The experimental setup and the sizes considered for images and projection are explained

Summary

Introduction

Graphics Processing Units (GPUs), as one of the most feasible parallel structured processors, have proven their power in facilitating research in a wide range of fields, including highperformance computing, data centers, medical imaging, and machine learning. GPU-based machine learning applications, and deep learning, have significantly grown in use in recent years [18]. To address this need, NVIDIA has introduced a specialized computing unit called Tensor Core that speeds up neural network. The distance-driven method, on the other hand, converts the problem of projection–backprojection into a one-dimensional re-sampling problem by mapping both the which is a linear equation in P0 and P1 This means that the distance-driven projection describes a linear transformation from the image domain to the projection domain.

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Real-Time Image Processing	Publication Date: Jan 28, 2021
Citations: 5	License type: open-access

R Discovery Prime

R Discovery Prime

Accelerating iterative CT reconstruction algorithms using Tensor Cores

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Real-Time Image Processing

Lead the way for us

Similar Papers

Recovering single precision accuracy from Tensor Cores while surpassing the FP32 theoretical peak performance
Hiroyuki Ootomo ... Rio Yokota
The International Journal of High Performance Computing Applications | VOL. 36
Hiroyuki Ootomo, et. al.Hiroyuki Ootomo ... Rio Yokota
03 Jun 2022
The International Journal of High Performance Computing Applications | VOL. 36

TC-SEPM: Characterizing soft error resilience of CNNs on Tensor Cores from program and microarchitecture perspectives
Xiaohui Wei ... Joey Tianyi Zhou
Journal of Systems Architecture | VOL. 145
Xiaohui Wei, et. al.Xiaohui Wei ... Joey Tianyi Zhou
04 Nov 2023
Journal of Systems Architecture | VOL. 145

Improving GPU Throughput through Parallel Execution Using Tensor Cores and CUDA Cores
Khoa Ho ... Adwait Jog
-
Khoa Ho, et. al.Khoa Ho ... Adwait Jog
01 Jul 2022
01 Jul 2022

Leveraging GPU Tensor Cores for Double Precision Euclidean Distance Calculations
Benoit Gallet ... Michael Gowanlock
-
Benoit Gallet, et. al.Benoit Gallet ... Michael Gowanlock
01 Dec 2022
01 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accelerating iterative CT reconstruction algorithms using Tensor Cores

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Real-Time Image Processing