Abstract

NVDLA (NVIDIA Deep Learning Accelerator) is an open-source CNN (Convolutional Neural Network) accelerator and has broad application prospects. The implement of Lenet-5 based on NVDLA is presented in this paper, which makes the recognition of handwritten numeral more efficient. Under the condition of no compiler for NVDLA is available so far, the function of registers and the working principles of NVDLA were explored. The LeNet-5 runs on NVDLA successfully through configuring the NVDLA registers. This work could also make a base for the implementation of other deep learning networks, such as AlexNet and VGG. The performance of the accelerator is estimated, and the results of the experiment indicate that Rocket SoC + NVDLA is able to achieve higher CNN processing efficiency, up to 4647x times than RISC-V core.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call