Data Layout Transformation Research Articles

In deep learning, convolutional neural networks (CNNs) are a class of artificial neural networks (ANNs), most commonly applied to analyze visual imagery. They are also known as Shift-Invariant or Space-Invariant Artificial Neural Networks (SIANNs), based on the shared-weight architecture of the convolution kernels or filters that slide along input features and provide translation-equivariant responses known as feature maps. Recently, various architectures for CNN based on FPGA platform have been proposed because it has the advantages of high performance and fast development cycle. However, some key issues including how to optimize the performance of CNN layers with different structures, high-performance heterogeneous accelerator design, and how to reduce the neural network framework integration overhead need to be improved. To overcome and improve these problems, we propose dynamic cycle pipeline tiling, data layout optimization, and a pipelined software and hardware (SW–HW)-integrated architecture with flexibility and integration. Some benchmarks have been tested and implemented on the FPGA board for the proposed architecture. The proposed dynamic tiling and data layout transformation improved by 2.3 times in the performance. Moreover, with two-level pipelining, we achieve up to five times speedup and the proposed system is 3.8 times more energy-efficient than the GPU.

Complex tensor contraction expressions arise in accurate electronic structure models in quantum chemistry, such as the coupled cluster method. This paper addresses two complementary aspects of performance optimization of such tensor contraction expressions. Transformations using algebraic properties of commutativity and associativity can be used to significantly decrease the number of arithmetic operations required for evaluation of these expressions. The identification of common subexpressions among a set of tensor contraction expressions can result in a reduction of the total number of operations required to evaluate the tensor contractions. The first part of the paper describes an effective algorithm for operation minimization with common subexpression identification and demonstrates its effectiveness on tensor contraction expressions for coupled cluster equations. The second part of the paper highlights the importance of data layout transformation in the optimization of tensor contraction computations on modern processors. A number of considerations, such as minimization of cache misses and utilization of multimedia vector instructions, are discussed. A library for efficient index permutation of multidimensional tensors is described, and experimental performance data is provided that demonstrates its effectiveness.

Data Layout Transformation Research Articles

Articles published on Data Layout Transformation

Optimizing FPGA-Based Convolutional Neural Network Performance

Synthesis-powered optimization of smart contracts via data type refactoring

Optimizing Graph Processing on GPUs

Reducing I/O variability using dynamic I/O path characterization in petascale storage systems

Data layout optimization for GPGPU architectures

Data Layout Transformation Exploiting Memory-Level Parallelism in Structured Grid Many-Core Applications

Performance Optimization of Tensor Contraction Expressions for Many-Body Methods in Quantum Chemistry

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Data Layout Transformation Research Articles

Articles published on Data Layout Transformation

Optimizing FPGA-Based Convolutional Neural Network Performance

Synthesis-powered optimization of smart contracts via data type refactoring

Optimizing Graph Processing on GPUs

Reducing I/O variability using dynamic I/O path characterization in petascale storage systems

Data layout optimization for GPGPU architectures

Data Layout Transformation Exploiting Memory-Level Parallelism in Structured Grid Many-Core Applications

Performance Optimization of Tensor Contraction Expressions for Many-Body Methods in Quantum Chemistry