Abstract

Convolutional neural networks are widely used in human production and life, but due to their large amount of calculation and complex calculation mode, their calculation speed is slow, so it is necessary to design a dedicated hardware accelerator. This paper firstly analyzes the algorithm of the convolutional neural network and decomposes the algorithm into multiple basic operations. For the convolution operation with the largest amount of calculation and complex operation mode, a near calculation storage array is designed according to its operational characteristics. Furthermore, a convolutional neural network accelerator architecture is proposed to realize the fast operation of a convolutional neural network.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call