RRNet: Repetition-Reduction Network for Energy Efficient Depth Estimation

Sangyun Oh,Jongeun Lee,Hye-Jin S Kim,Junmo Kim

doi:10.1109/access.2020.3000773

Sangyun Oh, Jongeun Lee + Show 2 more

Open Access

https://doi.org/10.1109/access.2020.3000773

Copy DOI

Abstract

Lightweight neural networks that employ depthwise convolution have a significant computational advantage over those that use standard convolution because they involve fewer parameters; however, they also require more time, even with graphics processing units (GPUs). We propose a Repetition-Reduction Network (RRNet) in which the number of depthwise channels is large enough to reduce computation time while simultaneously being small enough to reduce GPU latency. RRNet also reduces power consumption and memory usage, not only in the encoder but also in the residual connections to the decoder. We apply RRNet to the problem of resource-constrained depth estimation, where it proves to be significantly more efficient than other methods in terms of energy consumption, memory usage, and computation. It has two key modules: the Repetition-Reduction (RR) block, which is a set of repeated lightweight convolutions that can be used for feature extraction in the encoder, and the Condensed Decoding Connection (CDC), which can replace the skip connection, delivering features to the decoder while significantly reducing the channel depth of the decoder layers. Experimental results on the KITTI dataset show that RRNet consumes 3.84x less energy and 3.06x less memory than conventional schemes, and that it is 2.21x faster on a commercial mobile GPU without increasing the demand on hardware resources relative to the baseline network. Furthermore, RRNet outperforms state-of-the-art lightweight models such as MobileNets, PyDNet, DiCENet, DABNet, and EfficientNet.

Highlights

Depth estimation is crucial for several computer vision applications
We propose a Repetition-Reduction Network (RRNet), which is an energy efficient encoder–decoder model based on RR blocks and Condensed Decoding Connection (CDC) that outperforms current state-of-the-art complex and lightweight models in terms of performance, run time, and energy consumption using practical mobile graphics processing units (GPUs) hardware
We have observed that depthwise convolution involves a small amount of computation, its GPU latency is higher than that of other convolution operations such as the 3×3 standard convolution and the pointwise convolution, which we have described in detail in the Bottleneck Part in Section III A

Summary

Introduction

Depth estimation is crucial for several computer vision applications. Many technological goals, including localization in augmented reality (AR) or virtual reality (VR), advanced robotics, the reliable operation of autonomous vehicles or drones, and smart factories, cannot be realized without accurate depth estimation. Deep learning approaches [1]–[9] convincingly outperform attempts to manually solve this problem [10], [11]. Their use in mobile applications that involve a lightweight neural network model and relatively low-end graphics processing units (GPUs) remains limited. As we will show in the subsequent sections of this paper, this can strongly affect performance

Objectives

Methods

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

RRNet: Repetition-Reduction Network for Energy Efficient Depth Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

CAVE-CL: An OpenCL version of the package for detection and quantitative analysis of internal cavities in a system of overlapping balls: Application to proteins
Ján Buša ... Ming-Chya Wu
Computer Physics Communications | VOL. 190
Ján Buša, et. al.Ján Buša ... Ming-Chya Wu
07 Jan 2015
Computer Physics Communications | VOL. 190

Floating point converter handles 32-bit words
-
Microprocessors and Microsystems | VOL. 8
--
01 Nov 1984
Microprocessors and Microsystems | VOL. 8

High-performance direct gravitational N-body simulations on graphics processing units
Simon F Portegies Zwart ... Peter M Geldof
New Astronomy | VOL. 12
Simon F Portegies Zwart, et. al.Simon F Portegies Zwart ... Peter M Geldof
05 Jun 2007
New Astronomy | VOL. 12

PCB design aids
-
Microprocessors and Microsystems | VOL. 8
--
01 Nov 1984
Microprocessors and Microsystems | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RRNet: Repetition-Reduction Network for Energy Efficient Depth Estimation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access