OnceNAS: Discovering efficient on-device inference neural networks for edge devices

Yusen Zhang,Yunchuan Qin,Yufeng Zhang,Xu Zhou,Songlei Jian,Yusong Tan,Kenli Li

doi:10.1016/j.ins.2024.120567

Abstract

Edge Intelligence (EI) offers an attractive approach for local AI processing at the network edge for privacy protection and reduced transmission, but deploying resource-intensive neural networks on edge devices remains a challenge. The neural architecture search (NAS) technique, known for its automation and minimal manual intervention, serves as a pivotal tool for EI. However, existing methods typically concentrate on optimizing resource consumption for specific hardware, leading to hardware-specific neural architectures with limited generalizability. In response, we propose OnceNAS, a novel method that designs and optimizes on-device inference neural networks for resource-constrained edge devices. OnceNAS simultaneously optimizes for parameter count and inference latency in addition to inference accuracy, producing lightweight neural networks while maintaining their inference performance. Meanwhile, we introduce an efficient evaluation strategy that can simultaneously assess multiple metrics. Experimental results demonstrate the effectiveness of OnceNAS, achieving high-performing architectures with substantial size reduction (10.49x) and speedup (5.45x). As a result, OnceNAS offers practical value by generating efficient on-device inference neural architectures for resource-constrained edge devices, facilitating real-world applications like autonomous driving and smart healthcare. Furthermore, we contribute DARTS-Bench, an open-source dataset providing candidate architectures with hardware-related information and a user-friendly API, facilitating future research in lightweight NAS.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

OnceNAS: Discovering efficient on-device inference neural networks for edge devices

Abstract

Talk to us

Similar Papers

More From: Information Sciences

Lead the way for us

Journal: Information Sciences	Publication Date: Apr 9, 2024
Citations: 1

Similar Papers

ETNAS: An energy consumption task-driven neural architecture search
Dong Dong ... Jason Wang
Sustainable Computing: Informatics and Systems | VOL. 40
Dong Dong, et. al.Dong Dong ... Jason Wang
10 Nov 2023
Sustainable Computing: Informatics and Systems | VOL. 40

Efficient and lightweight convolutional neural network architecture search methods for object classification
Chuen-Horng Lin ... Yung-Kuan Chan
Pattern Recognition | VOL. 156
Chuen-Horng Lin, et. al.Chuen-Horng Lin ... Yung-Kuan Chan
06 Jul 2024
Pattern Recognition | VOL. 156

Inference latency prediction for CNNs on heterogeneous mobile devices and ML frameworks
Zhuojin Li ... Leana Golubchik
Performance Evaluation | VOL. 165
Zhuojin Li, et. al.Zhuojin Li ... Leana Golubchik
14 Jul 2024
Performance Evaluation | VOL. 165

Effect Verification of a Feature Extraction Method Based on Graph Convolutional Networks
Yunzhe Zhu ... Qingbo Wu
-
Yunzhe Zhu, et. al.Yunzhe Zhu ... Qingbo Wu
01 Aug 2022
01 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

OnceNAS: Discovering efficient on-device inference neural networks for edge devices

Abstract

Talk to us

Similar Papers

More From: Information Sciences