Exploiting Motion Perception in Depth Estimation Through a Lightweight Convolutional Neural Network

Pedro Nuno Leite,Andry Maykol Pinto

doi:10.1109/access.2021.3082697

Pedro Nuno Leite, Andry Maykol Pinto

Open Access

https://doi.org/10.1109/access.2021.3082697

Copy DOI

Abstract

Understanding the surrounding 3D scene is of the utmost importance for many robotic applications. The rapid evolution of machine learning techniques has enabled impressive results when depth is extracted from a single image. High-latency networks are required to achieve these performances, rendering them unusable for time-constrained applications. This article introduces a lightweight Convolutional Neural Network (CNN) for depth estimation, NEON, designed for balancing both accuracy and inference times. Instead of solely focusing on visual features, the proposed methodology exploits the Motion-Parallax effect to combine the apparent motion of pixels with texture. This research demonstrates that motion perception provides crucial insight about the magnitude of movement for each pixel, which also encodes cues about depth since large displacements usually occur when objects are closer to the imaging sensor. NEON's performance is compared to relevant networks in terms of Root Mean Squared Error (RMSE), the percentage of correctly predicted pixels ( δ <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sub> ) and inference times, using the KITTI dataset. Experiments prove that NEON is significantly more efficient than the current top ranked network, estimating predictions 12 times faster; while achieving an average RMSE of 3.118 m and a δ <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sub> of 94.5%. Ablation studies demonstrate the relevance of tailoring the network to use motion perception principles in estimating depth from image sequences, considering that the effectiveness and quality of the estimated depth map is similar to more computational demanding state-of-the-art networks. Therefore, this research proposes a network that can be integrated in robotic applications, where computational resources and processing-times are important constraints, enabling tasks such as obstacle avoidance, object recognition and robotic grasping.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Access	Publication Date: Jan 1, 2021
Citations: 11	License type: CC BY-NC-ND 4.0

R Discovery Prime

R Discovery Prime

Exploiting Motion Perception in Depth Estimation Through a Lightweight Convolutional Neural Network

Abstract

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Depth-aware salient object segmentation
Le Vu Ha ... Tran Hoang Tung
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36
Le Vu Ha, et. al.Le Vu Ha ... Tran Hoang Tung
07 Oct 2020
VNU Journal of Science: Computer Science and Communication Engineering | VOL. 36

Author response: Invariant representation of physical stability in the human brain
RT Pramod ... Joshua B Tenenbaum
-
RT Pramod, et. al.RT Pramod ... Joshua B Tenenbaum
09 Feb 2022
09 Feb 2022

End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis
Muhammad Muzammel ... Alice Othmani
Computer Methods and Programs in Biomedicine | VOL. 211
Muhammad Muzammel, et. al.Muhammad Muzammel ... Alice Othmani
28 Sep 2021
Computer Methods and Programs in Biomedicine | VOL. 211

Multi-task feature fusion network for monocular depth estimation without joint annotations
Jialing Zou ... Rui Wang
-
Jialing Zou, et. al.Jialing Zou ... Rui Wang
27 Mar 2022
27 Mar 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploiting Motion Perception in Depth Estimation Through a Lightweight Convolutional Neural Network

Abstract

Talk to us

Similar Papers

More From: IEEE Access