Auto-tuning Fixed-point Precision with TVM on RISC-V Packed SIMD Extension

Chun-Chieh Yang,Yuan-Ming Chang,Jenq-Kuen Lee,Yi-Ru Chen,Hui-Hsin Liao

doi:10.1145/3569939

Abstract

Today, as deep learning (DL) is applied more often in daily life, dedicated processors such as CPUs and GPUs have become very important for accelerating model executions. With the growth of technology, people are becoming accustomed to using edge devices, such as mobile phones, smart watches, and VR devices in their daily lives. A variety of technologies using DL are gradually being applied to these edge devices. However, there is a large number of computations in DL. It faces a challenging problem how to provide solutions in the edge devices. In this article, the proposed method enables a flow with the RISC-V Packed extension (P extension) in TVM. TVM, an open deep learning compiler for neural network models, is growing as a key infrastructure for DL computing. RISC-V is an open instruction set architecture (ISA) with customized and flexible features. The Packed-SIMD extension is a RISC-V extension that enables subword single-instruction multiple-data (SIMD) computations in RISC-V architectures to support fallback engines in AI computing. In the proposed flow, a fixed-point type that is supported by an integer of 16-bit type and saturation instructions is added to replace the original 32-bit float type. In addition, an auto-tuning method is proposed to use a uniform selector mechanism (USM) to find the binary point position for fixed-point type use. The tensorization feature of TVM can be used to optimize specific hardware such as subword SIMD instructions with RISC-V P extension. With our experiment on the Spike simulator, the proposed method with the USM can improve performance by approximately 2.54 to 6.15× in terms of instruction counts with little accuracy loss.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Auto-tuning Fixed-point Precision with TVM on RISC-V Packed SIMD Extension

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Design Automation of Electronic Systems

Lead the way for us

Journal: ACM Transactions on Design Automation of Electronic Systems	Publication Date: Mar 22, 2023
Citations: 6

Similar Papers

Efficient Acceleration of Deep Learning Inference on Resource-Constrained Edge Devices: A Review
Md Maruf Hossain Shuvo ... Syed Kamrul Islam
Proceedings of the IEEE | VOL. 111
Md Maruf Hossain Shuvo, et. al.Md Maruf Hossain Shuvo ... Syed Kamrul Islam
01 Jan 2023
Proceedings of the IEEE | VOL. 111

Distributed Deep Learning in An Edge Computing System
Tanmoy Sen ... Haiying Shen
-
Tanmoy Sen, et. al.Tanmoy Sen ... Haiying Shen
01 Oct 2022
01 Oct 2022

Big Data and Deep Learning Analytics
Nipun Tyagi
Journal of Artificial Intelligence & Cloud Computing | VOL. -
Nipun TyagiNipun Tyagi
30 Sep 2023
Journal of Artificial Intelligence & Cloud Computing | VOL. -

AI Multi-Tenancy on Edge: Concurrent Deep Learning Model Executions and Dynamic Model Placements on Edge Devices
Piyush Subedi ... Lakshmish Ramaswamy
-
Piyush Subedi, et. al.Piyush Subedi ... Lakshmish Ramaswamy
01 Sep 2021
01 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Auto-tuning Fixed-point Precision with TVM on RISC-V Packed SIMD Extension

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Design Automation of Electronic Systems