CUDA Research Articles

The object of research is to parallelize the learning process of artificial neural networks to automate the procedure of medical image analysis using the Python programming language, PyTorch framework and Compute Unified Device Architecture (CUDA) technology. The operation of this framework is based on the Define-by-Run model. The analysis of the available cloud technologies for realization of the task and the analysis of algorithms of learning of artificial neural networks is carried out. A modified U-Net architecture from the MedicalTorch library was used. The purpose of its application was the need for a network that can effectively learn with small data sets, as in the field of medicine one of the most problematic places is the availability of large datasets, due to the requirements for data confidentiality of this nature. The resulting information system is able to implement the tasks set before it, contains the most user-friendly interface and all the necessary tools to simplify and automate the process of visualization and analysis of data. The efficiency of neural network learning with the help of the central processor (CPU) and with the help of the graphic processor (GPU) with the use of CUDA technologies is compared. Cloud technology was used in the study. Google Colab and Microsoft Azure were considered among cloud services. Colab was first used to build a prototype. Therefore, the Azure service was used to effectively teach the finished architecture of the artificial neural network. Measurements were performed using cloud technologies in both services. The Adam optimizer was used to learn the model. CPU duration measurements were also measured to assess the acceleration of CUDA technology. An estimate of the acceleration obtained through the use of GPU computing and cloud technologies was implemented. CPU duration measurements were also measured to assess the acceleration of CUDA technology. The model developed during the research showed satisfactory results according to the metrics of Jaccard and Dyce in solving the problem. A key factor in the success of this study was cloud computing services.

Read full abstract

Spectral CT utilizes spectral information of X-ray sources to reconstruct energy-resolved X-ray images and has wide medical applications. Compared with conventional energy-integrated CT scanners, however, spectral CT faces serious technical difficulties in hardware, and hence its clinical use has been expensive and limited. The goal of this paper is to present a software solution and an implementation of a framelet-based spectral reconstruction algorithm for multi-slice spiral scanning based on a conventional energy-integrated CT hardware platform. In the present work, we implement the framelet-based spectral reconstruction algorithm using compute unified device architecture (CUDA) with bowtie filtration. The platform CUDA enables fast execution of the program, while the bowtie filter reduces radiation exposure. We also adopt an order-subset technique to accelerate the convergence. The multi-slice spiral scanning geometry with these additional features will make the framelet-based spectral reconstruction algorithm more powerful for clinical applications. The method provides spectral information from just one scan with a standard energy-integrating detector and produces color CT images, spectral curves of the attenuation coefficient at every point inside the object, and photoelectric images, which are all valuable imaging tools in cancerous diagnosis. The proposed algorithm is tested with a Catphan phantom and real patient data sets for its performance. In experiments with the Catphan 504 phantom, the synthesized color image reveals changes in the level of colors and details and the yellow color in Teflon indicates a special spectral property which is invisible in regular CT reconstruction. In experiments with clinical images, the synthesized color images provide some extra details which are helpful for clinical diagnosis, for example, details about the renal pelvis and lumbar join. The numerical studies indicate that the proposed method provides spectral image information which can reveal fine structures in clinical images and that the algorithm is efficient regarding to the computational time. Thus, the proposed algorithm has a great potential in practical application.

Read full abstract

CUDA Research Articles

Related Topics

Articles published on CUDA

Advances in parallel and distributed computing and its applications

Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU

Development of software and algorithms of parallel learning of artificial neural networks using CUDA technologies

Parallel Makespan Calculation for Flow Shop Scheduling Problem with Minimal and Maximal Idle Time

Multi-GPU implementation of a cellular automaton model for dendritic growth of binary alloy

Investigation of MHD free convection of power‐law fluids in a sinusoidally heated enclosure using the MRT‐LBM

Fast fringe enhancement by improved bidimensional sinusoids-assisted empirical mode decomposition

Implementation of a parallel high-order WENO-type Euler equation solver using a CUDA PTX paradigm

Performance evaluation of GPU- and cluster-computing for parallelization of compute-intensive tasks

A novel GPGPU-parallelized contact detection algorithm for combined finite-discrete element method

GPU-accelerated multitiered iterative phasing algorithm for fluctuation X-ray scattering.

Numerical modelling of interaction between aluminium structure and explosion in soil

Efficient parallelization of SPH algorithm on modern multi-core CPUs and massively parallel GPUs

Development of a real-time magnetic island reconstruction system based on PCIe platform for HL-2A tokamak

An Efficient Method to Compute EM Scattering From Target Covered With Honeycomb Composite Material

Boundary condition enforcement for renormalised weakly compressible meshless Lagrangian methods

Study and evaluation of improved automatic GPU offloading method

Safer and More Efficient Parallel Cryptographic Algorithm and its Implementation in the GPU

Implementation of a Framelet-Based Spectral Reconstruction for Multi-Slice Spiral CT

Tabu Genetic Cat Swarm Algorithm Analysis of Optimization Arrangement on Mistuned Blades Based on CUDA

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

CUDA Research Articles

Related Topics

Articles published on CUDA

Advances in parallel and distributed computing and its applications

Parallel Dislocation Model Implementation for Earthquake Source Parameter Estimation on Multi-Threaded GPU

Development of software and algorithms of parallel learning of artificial neural networks using CUDA technologies

Parallel Makespan Calculation for Flow Shop Scheduling Problem with Minimal and Maximal Idle Time

Multi-GPU implementation of a cellular automaton model for dendritic growth of binary alloy

Investigation of MHD free convection of power‐law fluids in a sinusoidally heated enclosure using the MRT‐LBM

Fast fringe enhancement by improved bidimensional sinusoids-assisted empirical mode decomposition

Implementation of a parallel high-order WENO-type Euler equation solver using a CUDA PTX paradigm

Performance evaluation of GPU- and cluster-computing for parallelization of compute-intensive tasks

A novel GPGPU-parallelized contact detection algorithm for combined finite-discrete element method

GPU-accelerated multitiered iterative phasing algorithm for fluctuation X-ray scattering.

Numerical modelling of interaction between aluminium structure and explosion in soil

Efficient parallelization of SPH algorithm on modern multi-core CPUs and massively parallel GPUs

Development of a real-time magnetic island reconstruction system based on PCIe platform for HL-2A tokamak

An Efficient Method to Compute EM Scattering From Target Covered With Honeycomb Composite Material

Boundary condition enforcement for renormalised weakly compressible meshless Lagrangian methods

Study and evaluation of improved automatic GPU offloading method

Safer and More Efficient Parallel Cryptographic Algorithm and its Implementation in the GPU

Implementation of a Framelet-Based Spectral Reconstruction for Multi-Slice Spiral CT

Tabu Genetic Cat Swarm Algorithm Analysis of Optimization Arrangement on Mistuned Blades Based on CUDA