A Survey on Efficient Convolutional Neural Networks and Hardware Acceleration

Deepak Ghimire,Dayoung Kil,Seong-Heum Kim

doi:10.3390/electronics11060945

Deepak Ghimire, Dayoung Kil + Show 1 more

Open Access

https://doi.org/10.3390/electronics11060945

Copy DOI

Journal: Electronics	Publication Date: Mar 18, 2022
Citations: 72	License type: CC BY 4.0

Affiliation: Soongsil University

Abstract

Over the past decade, deep-learning-based representations have demonstrated remarkable performance in academia and industry. The learning capability of convolutional neural networks (CNNs) originates from a combination of various feature extraction layers that fully utilize a large amount of data. However, they often require substantial computation and memory resources while replacing traditional hand-engineered features in existing systems. In this review, to improve the efficiency of deep learning research, we focus on three aspects: quantized/binarized models, optimized architectures, and resource-constrained systems. Recent advances in light-weight deep learning models and network architecture search (NAS) algorithms are reviewed, starting with simplified layers and efficient convolution and including new architectural design and optimization. In addition, several practical applications of efficient CNNs have been investigated using various types of hardware architectures and platforms.

Full Text