Agile Optimization Framework: A framework for tensor operator optimization in neural network

Mingwei Zhou,Xuxin Lin,Yanyan Liang

doi:10.1016/j.future.2024.07.019

Abstract

In recent years, with the gradual slowing of Moore’s Law and the development of deep learning, the demand for hardware performance of executing deep learning based applications has significantly increased. In this case, deep learning compilers have been proven to maximize hardware performance while keeping computational power constant, especially the end-to-end compiler Tensor Virtual Machine (TVM). TVM optimizes tensors by finding excellent parallel computing schemes, thereby achieving the goal of improving the performance of neural network inference. However, there is still untapped potential in current optimization methods. However, existing optimization methods based on the TVM, such as Genetic Algorithms Tuner (GA-Tuner), have failed to achieve a balance between optimization performance and optimization time. The intolerable duration of optimization detracts from TVM’s usability, rendering it challenging to extend into the scientific community. This paper introduces a novel deep learning compilation optimization framework base on TVM called Agile Optimization Framework (AOF), which incorporates a tuner based on the latest Beluga Whale Optimization Algorithm (BWO). The BWO is adept at tackling complex problems characterized by numerous local optima, making it particularly suitable for hardware compilation optimization scenarios. We further propose an Evolving Epsilon Strategy (EES), a search strategy that adaptively adjusts the balance between exploration and exploitation, thereby enhancing the effectiveness of the algorithm. Additionally, we developed a supervised Tuning Accelerator (TA) aimed at reducing the time required for optimization and enhancing efficiency. Comparative experiments demonstrate that AOF achieves 11.36%–66.20% improvement in performance and 30.30%–54.60% reduction in optimization time, significantly outperforming the control group.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Agile Optimization Framework: A framework for tensor operator optimization in neural network

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems

Lead the way for us

Journal: Future Generation Computer Systems	Publication Date: Jul 16, 2024
Citations: 1

Similar Papers

GenSyth: a new way to understand deep learning
Alexander Wong ... Mohammad Javad Shafiee
Electronics Letters | VOL. 55
Alexander Wong, et. al.Alexander Wong ... Mohammad Javad Shafiee
01 Sep 2019
Electronics Letters | VOL. 55

Multi-objective optimization of reservoir development strategy with hybrid artificial intelligence method
Xinyu Zhuang ... Yongmao Hao
Expert Systems with Applications | VOL. 241
Xinyu Zhuang, et. al.Xinyu Zhuang ... Yongmao Hao
25 Nov 2023
Expert Systems with Applications | VOL. 241

Guest editorial: Smart communications and networking: architecture, applications, and future challenges
Honghao Gao ... Walayat Hussain
IET Communications | VOL. 16
Honghao Gao, et. al.Honghao Gao ... Walayat Hussain
10 Mar 2022
IET Communications | VOL. 16

Comprehensive Study for Breast Cancer Using Deep Learning and Traditional Machine Learning
-
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34
--
12 Apr 2022
ZANCO JOURNAL OF PURE AND APPLIED SCIENCES | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Agile Optimization Framework: A framework for tensor operator optimization in neural network

Abstract

Talk to us

Similar Papers

More From: Future Generation Computer Systems