Execution Cycles Research Articles

Acceleration techniques play a crucial role in enhancing the performance of modern high-speed computations, especially in Deep Learning (DL) applications where the speed is of utmost importance. One essential component in this context is the Systolic Array (SA), which effectively handles computational tasks and data processing in a rhythmic manner. Google's Tensor Processing Unit (TPU) leverages the power of SA for neural networks. The core SA's functionality and performance lies in the Computation Element (CE), which facilitates parallel data flow. In our article, we introduce a novel approach called Proposed Systolic Array (PSA), which is implemented on the CE and further enhanced with a modified Hybrid Kogge Stone adder (MHA). This design incorporates principles to expedite computations by rounding and extracting data model in SA as PSA-MHA. The PSA, utilizing a data flow model with MHA, significantly accelerates data shifts and control passes in execution cycles. We validated our approach through simulations on the Cadence Virtuoso platform using 65 nm process technology, comparing it to the General Matrix Multiplication (GMMN) benchmark. The results showed remarkable improvements in the CE, with a 30.29 % reduction in delay, a 23.07 % reduction in area, and an 11.87 % reduction in power consumption. The PSA outperformed these improvements, achieving a 46.38 % reduction in delay, a 7.58 % reduction in area, and an impressive 48.23 % decrease in Area Delay Product (ADP). To further substantiate our findings, we applied the PSA-based approach to pre-trained hybrid Convolutional and Recurrent (CNN-RNN) neural models. The PSA-based hybrid model incorporates 189 million Multiply-Accumulate (MAC) units, resulting in a weighted mean architecture value of 784.80 for the RNN component. We also explored variations in bit width, which led to delay reductions ranging from 20.17 % to 30.29 %, area variations between 13.08 % and 32.16 %, and power consumption changes spanning from 11.88 % to 20.42 %.

Read full abstract

The focus of this computational work is to predict and optimize the battery thermal performance indicators for its sustainable operation using different meta-heuristic optimization algorithms and machine learning models. The contribution of this work is two-fold, first, the heat removal ability from battery indicated by average Nusselt number (Nuavg) and hotspots (MaxT) to avoid battery thermal runaway are optimized as single objective optimization (SOO) and as multi-level objective optimization (MOO) problem. Second, intelligent algorithms: Gradient boosting (GB) algorithm and Gaussian process regressor (GPR) algorithm are used for modelling of Nuavg and MaxT. For SOO, Multi-verse optimization (MVO) and Grey wolf optimization (GWO) algorithms are used for individual battery performance indicators. Similarly, the enhanced version of MVO and GWO for MOO (MMVO and MGWO) algorithms is customized. Each algorithm is operated for five cycles and 100 iterations in each cycle of execution. In GB algorithm the effect of different loss functions and in GPR algorithm the effect of parameter alpha (α) is analyzed. SOO gives highest fitness of Nuavg and lowest hotspots occurrence from both the algorithms with same converged positions of operating parameters. MMVO and MGWO relatively provide lower Nuavg with MaxT in the same range of SOO. The MOO provides different set of particle positions compared to SOO. MGWO algorithm has outperformed in providing the best non-dominated solution. The GB and GPR algorithm are good enough for the forecasting of battery thermal parameters. GPR is even accurate, however the range of α is important during training and testing. The best Nuavg obtained from SOO using MVO algorithm is around 82.06 while MaxT is 0.34. The same from GWO algorithm is 82.05 and 0.33 respectively. MGWO algorithm in MOO provides Nuavg and MaxT around 75.57 and 0.34 while MMWO provides 66.76 and 0.33 respectively. GPR algorithm gives accuracy as close as 98 % for MaxT while it gives 94 % accuracy for Nuavg. On the other hand GB algorithm gives 99 % and 97.5 % accuracy for MaxT and Nuavg respectively.

Read full abstract

Execution Cycles Research Articles

Related Topics

Articles published on Execution Cycles

Reinforcement Learning-Based Scheduling Optimization for DNN Accelerators

A certain examination on heterogeneous systolic array (HSA) design for deep learning accelerations with low power computations

CPU-GPU Cooperative QoS Optimization of Personalized Digital Healthcare Using Machine Learning and Swarm Intelligence.

Budget Process and Execution: A Case Study on the Underperformance of the Peruvian Health System, 2000-2021.

Аналитическая и компьютерная программная математические модели шума выходного сигнала волоконно-оптического гироскопа, анализ и верификация

Prototype System for Unmanned Reference Point Determination at the Sub‐Millimeter Level

Modelling Dynamic Traffic Loads in Multiserver Queues using G/G/k Queue

Navigating the Maize: cyclic and conditional computational graphs for molecular simulation

AI Accelerator with Ultralightweight Time-Period CNN-Based Model for Arrhythmia Classification.

A Programmable Crypto-Processor for National Institute of Standards and Technology Post-Quantum Cryptography Standardization Based on the RISC-V Architecture

Use of modern algorithms for multi-parameter optimization and intelligent modelling of sustainable battery performance

TEACHING LEARNING METHODS’ SHIFT AND ITS IMPACT DURING PANDEMIC

A deep reinforcement learning-based optimization method for long-running applications container deployment

Test Automation of Motor Over Temperature Protection Extension Module of Drive

Inverse Kinematics: An Alternative Solution Approach Applying Metaheuristics

An exact algorithm for calculating the minimum and feasible ranges of cycle time in periodic scheduling with shared resources

Parallel Exchange of Randomized SubGraphs for Optimization of Network Alignment: PERSONA.

Impact of sneak paths on in-memory logic design in memristive crossbars

Asynchronous and Adaptive State Estimation of Integrated Electricity–Gas Energy Systems

Intelligent Access to Unlicensed Spectrum: A Mean Field Based Deep Reinforcement Learning Approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Execution Cycles Research Articles

Related Topics

Articles published on Execution Cycles

Reinforcement Learning-Based Scheduling Optimization for DNN Accelerators

A certain examination on heterogeneous systolic array (HSA) design for deep learning accelerations with low power computations

CPU-GPU Cooperative QoS Optimization of Personalized Digital Healthcare Using Machine Learning and Swarm Intelligence.

Budget Process and Execution: A Case Study on the Underperformance of the Peruvian Health System, 2000-2021.

Аналитическая и компьютерная программная математические модели шума выходного сигнала волоконно-оптического гироскопа, анализ и верификация

Prototype System for Unmanned Reference Point Determination at the Sub‐Millimeter Level

Modelling Dynamic Traffic Loads in Multiserver Queues using G/G/k Queue

Navigating the Maize: cyclic and conditional computational graphs for molecular simulation

AI Accelerator with Ultralightweight Time-Period CNN-Based Model for Arrhythmia Classification.

A Programmable Crypto-Processor for National Institute of Standards and Technology Post-Quantum Cryptography Standardization Based on the RISC-V Architecture

Use of modern algorithms for multi-parameter optimization and intelligent modelling of sustainable battery performance

TEACHING LEARNING METHODS’ SHIFT AND ITS IMPACT DURING PANDEMIC

A deep reinforcement learning-based optimization method for long-running applications container deployment

Test Automation of Motor Over Temperature Protection Extension Module of Drive

Inverse Kinematics: An Alternative Solution Approach Applying Metaheuristics

An exact algorithm for calculating the minimum and feasible ranges of cycle time in periodic scheduling with shared resources

Parallel Exchange of Randomized SubGraphs for Optimization of Network Alignment: PERSONA.

Impact of sneak paths on in-memory logic design in memristive crossbars

Asynchronous and Adaptive State Estimation of Integrated Electricity–Gas Energy Systems

Intelligent Access to Unlicensed Spectrum: A Mean Field Based Deep Reinforcement Learning Approach